-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
failure on empty input sequence file #193
Comments
The error is due to no sequences in the input qza file, which in this case, was caused by a failure of the upstream job to produce sequences for conversion to qza and subsequent classification with the q2-classifier. It appears that the q2-classifier assumes >=1 sequences in the input qza. Maybe adding a check for sequences would be helpful. |
'['grep', '-c', '^>', ...]'
failing when q2-cls run on AWS Batch
@nick-youngblut, did you encounter this when running |
This seems to be the same issue as #175. I'm going to close this in favor of that issue. |
Bug Description
Error:
Why is
grep
used instead of just loading the file and iterating through the lines? I get that it's a bit slower than grep, but the file size shouldn't be very big.An advantage of using python to count sequences versus grep is that one gets a better stack trace of the issue versus the current code:
The error only occurs on AWS Batch (works when running locally using the same docker image). I've provided >400 Gb of memory, so that is not the issue, and there should be plenty of disk space.
*Steps to reproduce the behavior
I'm using
quay.io/qiime2/core:2023.2
for the docker image, and running q2-classifier on AWS Batch via Nextflow (with Fusion & Wave).Computation Environment
quay.io/qiime2/core:2023.2
The text was updated successfully, but these errors were encountered: