Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Select genomes to sequence by NSTI #190

Open
zaneveld opened this issue Oct 19, 2015 · 0 comments
Open

Select genomes to sequence by NSTI #190

zaneveld opened this issue Oct 19, 2015 · 0 comments
Assignees

Comments

@zaneveld
Copy link
Contributor

One enhancement we've discussed for the next PICRUSt release is using NSTI values to inform strain selection in sequencing. This would be a new script inside PICRUSt that greedily selects genomes for sequencing in order to lower NSTI scores for a given environment, represented by an OTU table.

Required Inputs:
-- Tree (assume Greengenes by default?)
-- Count Table (needed only to know which strains have information).

Optional Inputs:
-- Target OTU table. If provided, look to minimize NSTI vs. the organisms actually present in the OTU table.
-- File providing a list of available strain ids [e.g. if someone is looking at a culture collection]. If not provided, assume all tips can be cultured (will be much slower).

Outputs:
-- Greedily selected set of strains for sequencing with estimated cumulative reduction in NSTI per strain.

@zaneveld zaneveld self-assigned this Oct 19, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant