Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement of Document Merging #4

Open
2 tasks
aditeyabaral opened this issue Oct 19, 2020 · 4 comments
Open
2 tasks

Enhancement of Document Merging #4

aditeyabaral opened this issue Oct 19, 2020 · 4 comments
Labels
enhancement New feature or request feature_request hacktoberfest help wanted Extra attention is needed

Comments

@aditeyabaral
Copy link
Owner

Document merging can be enhanced in a few ways like

  • Addition of neural coreferencing to retain more information while merging and comparing
  • Better merging algorithms involving contextual analysis
@aditeyabaral aditeyabaral added enhancement New feature or request help wanted Extra attention is needed hacktoberfest labels Oct 19, 2020
@issue-label-bot
Copy link

Issue-Label Bot is automatically applying the label feature_request to this issue, with a confidence of 0.99. Please mark this comment with 👍 or 👎 to give our bot feedback!

Links: app homepage, dashboard and code for this bot.

@sakares
Copy link
Contributor

sakares commented Oct 8, 2021

Hi @aditeyabaral,

I am interested in this issue but have no clue where to start.
Do you have any list of interesting algorithms/papers to try with? Thanks

@aditeyabaral
Copy link
Owner Author

Hey @sakares
You could look into topic modelling techniques to identify all sentences that belong to a topic and pick those sentences (after preserving their order). You could also try finding all the unique concepts after clustering the documents and picking sentences from each concept and combine them in order.

This should be done after using neural coreference to preprocess the text.

I am open to any other suggestions as well since I do not have a fixed solution to this issue :)

@sakares
Copy link
Contributor

sakares commented Oct 9, 2021

Thanks! I was thinking about Huggingface neural coreference as well. I will try this and would update you back.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request feature_request hacktoberfest help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants