You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Regex is good for pre-splitting, but I've noticed it behaves weirdly sometimes, when more context is needed in a pre-split sentence. I propose to add an option to use SpaCy as a sentence pre-splitter
The text was updated successfully, but these errors were encountered:
klein-t
changed the title
Alternatives to regex for pre-splitting
Rolling Window splitter: alternatives to regex for pre-splitting
Mar 12, 2024
sometimes, I'd have a short sentence split in two by a colon, which I'd like to keep as one sentence. Current regex seem to split it in two, SpaCy does not.
Don't get me wrong, I like regex, is fast, but I feel having SpaCy in the loop might help deal with more nuanced scenarios.
Regex is good for pre-splitting, but I've noticed it behaves weirdly sometimes, when more context is needed in a pre-split sentence. I propose to add an option to use SpaCy as a sentence pre-splitter
The text was updated successfully, but these errors were encountered: