Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible to restrict RAM use with haploflow? #18

Open
chrisgab opened this issue Feb 15, 2023 · 2 comments
Open

Possible to restrict RAM use with haploflow? #18

chrisgab opened this issue Feb 15, 2023 · 2 comments
Assignees

Comments

@chrisgab
Copy link

Hi,
I have recently tested haploflow on a complex metagenomics dataset, and it seems to be performing very well compared to other tools in producing correctly assembled viral contigs. For testing, I have used half of my dataset (i.e. only forward reads), with an uncompressed file size of 8 GB, and this has run without issues. I have used the conda installation, and am running this on a Linux system with 250 GB RAM. However, when I try to use the full dataset (16 GB), the RAM usage increases until it is maxed out and the program eventually crashes. Is there any way to control the memory use to avoid these issues?

@AlphaSquad AlphaSquad self-assigned this Feb 16, 2023
@AlphaSquad
Copy link
Collaborator

Hi, unfortunately there currently is no way to control memory manually and the deBruijn graph implementation is not optimised for large metagenomic datasets (it scales with the number of different k-mers). It is on my list to improve the memory behaviour though and maybe there really is a bug/memory leak somewhere, since I don't expect the reverse reads to add so many new k-mers - I will investigate.

@chrisgab
Copy link
Author

Thank you for your quick response. Very interested to hear what you find, and if a future version will implement more control over memory usage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants