Skip to content
This repository has been archived by the owner on Jul 25, 2024. It is now read-only.

Get Mariona's algorithms up and running #30

Open
alexhebing opened this issue Jul 3, 2019 · 1 comment
Open

Get Mariona's algorithms up and running #30

alexhebing opened this issue Jul 3, 2019 · 1 comment
Labels
enhancement New feature or request

Comments

@alexhebing
Copy link

Continues from #26. Client choses to go for 1), but wants to see if the algorithm actually performs.
Do minimal updates and try to get the scripts running as they are, so that some basic tests are possible.

@alexhebing
Copy link
Author

alexhebing commented Jul 15, 2019

I got the script up and running by updating to Python 3.4. Most changed were needed to the database connection (handled PyMySQL now instead of MySqlDb). The script do include some very dirty fixes now (all with comments), simply because I wanted them to run.

Tested with the Dutch corpora included with the scripts, and a new (!), tiny corpus, NERred by Stanford (and manually edited to remove 'B-' and 'I-' from the tags), and it actually works!

Will take these results with me to client and see if / how we can establish if the algorithm performs well. (thinking of looking at the texts and the coordinates found)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant