Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidate algorithm_output.tsv and unassigner_output.tsv #39

Open
Ulthran opened this issue Oct 21, 2024 · 0 comments
Open

Consolidate algorithm_output.tsv and unassigner_output.tsv #39

Ulthran opened this issue Oct 21, 2024 · 0 comments
Assignees

Comments

@Ulthran
Copy link
Contributor

Ulthran commented Oct 21, 2024

Currently running unassign /path/to/input.fasta --output path/to/output will produce four output files with two being:

algorithm_output.tsv

query_id	species_name	typestrain_id	region_mismatches	region_positions	probability_incompatible	mu1	num_references	mu2	nonregion_positions_in_subject	max_nonregion_mismatches
1	Sphingomonas sediminicola	AB258386	48	1409	1	0.03439716312056738	5	0.03439716312056738	31	-12
2	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA
...

unassigner_output.tsv

query_id	species	typestrain_id	region_mismatches	region_positions	probability_incompatible
1	Sphingomonas sediminicola	AB258386	48	1409	1
2	NA	NA	NA	NA	NA
...

where one appears to be completely contained within the other. We should only output one or else make it clearer in the naming convention that one is an expanded version of the other.

@Ulthran Ulthran self-assigned this Oct 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant