Skip to content

Conversation

PrathamLearnsToCode
Copy link

bibtex, citations, minor corrections etc

@jingyangcarl
Copy link
Contributor

@PrathamLearnsToCode Thanks for the PR. Is it possible that I can have more details on how the corrections were made? They look nice to me.

@PrathamLearnsToCode
Copy link
Author

I ran logs for all the payloads that were missing the important metadata(this one had bibtex missing). Post that, I made a scraper to fetch the bib from the paper's official site, with a fallback to arxiv/openalex API which provides the missing keys if existed.

@jingyangcarl
Copy link
Contributor

@PrathamLearnsToCode Thanks for the message. I'm trying to build a unified structure scraper bot to get all the data. I think your results look solid. Can I invite you to build that bot together when you are available? My bot is designed to retrieve data from multiple sources and merge them afterwards to ensure the quality and consistency of the results in this repository. The repo also needs a lot of improvement, as there are a lot of compatibility issues across venues and across years. But I think it's valuable to use a unified norm for all the venues across years, as all the papers share the same metadata structure.

Please let me know your thoughts on this.
Best,
Jing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants