Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pmc build & memory corruption #34

Open
cbaakman opened this issue Dec 16, 2015 · 0 comments
Open

pmc build & memory corruption #34

cbaakman opened this issue Dec 16, 2015 · 0 comments

Comments

@cbaakman
Copy link
Contributor

Segmentation faults frequently occur while mrs is indexing the pubmed databank, using the raw xml files from ftp://ftp.ncbi.nih.gov/pub/pmc/articles*.tar.gz as input. The location of the segfaults vary, but they always co-occur with an allocation or deallocation. Inspection of some of the deallocation segfaults at https://github.com/cmbi/mrs/blob/master/src/M6Lexicon.cpp#L213 shows that the involved pointers are valid.

Facts about the update process:

  • MRS always reports invalid checksums for the pumbed tars, though they are readable by commandline tar.
  • pubmed updating occurs in multiple threads. One thread per archive.
  • MRS uses a custom XML parser for pubmed
  • The pubmed xml files are much larger than the average pdb file.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant