Skip to content
hrybacki edited this page Mar 20, 2013 · 2 revisions

Sprint Goals:

  1. Refine the crowd surfing engine to be more robust and modular.
  2. Design and implement a skeleton for the journal source listers, fetchers, and parsers.
  3. Design and implement a scalable database solution.
  4. Design a front end API for users to interact with the parsing engine and deposit personal article and citation information.

TODO:

  • Think about database optimization while parsing large groups of citations that relate to each other.
  • Think about creating a Bloom filter vs a Hash table for database queries as it grows.
  • Think about a way to verify/merge similar documents during and after their creation and storage.
  • Think about a journal abbreviation to journal conversion module.
Clone this wiki locally