-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quote attribution annotation #190
Comments
Not entirely sure. I see why (and that) it makes sense, but I will not -- in the long term -- be able to maintain an annotation tool that is too generic. Quote + speaker + addressee is already very close to general slot filling. Need to think about it. I plan to use CorefAnnotator for a new project soonish, which will also be a different task from coreference and require some adaptations. Maybe there is a way to make this possible, but it will require substantial development effort. |
Fair enough. It's indeed a nontrivial feature, e.g. in terms of UI and file format. |
If you need a new file format for this tool combination, I'd be happy to implement an exporter. One more though: I plan to integrate an editor for entity relations and properties (e.g., X is female [property] and mother of Y [relation]). Technically, this could be extended to also allow relations between mentions. And if this is done, relating an entity mention with a speech mention could be done. But we would need a new UI for this ... |
The preprocessing I have in mind involves:
Adding this XML format as a new export format is possible, but neither the information to identify possible speakers nor the quotation spans are available within CorefAnnotator, so I guess it makes more sense to do the conversion with an external tool. The idea to do quote attribution annotation using mention relations could work, but I think it would be messy to treat quotations as regular mentions/entities. It would be better to have quotations as a separate kind of spans with its own tag in the XMI file; e.g. |
What do you think about adding quote attribution annotation as a feature?
This would involve creating a special/separate type of "mention" for quotes, which has a slot for its speaker mention (and perhaps addressee as well).
It might make sense to do such annotation together with coreference annotation because then you already have the mentions and entities of possible speakers, and it makes sense to annotate both because the tasks can benefit from training data of the other.
For reference, see http://aclweb.org/anthology/E17-1044
which also comes with an annotation tool.
The text was updated successfully, but these errors were encountered: