-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spacy recognizes gonna
as an adverb
#691
Comments
Thanks. We'll add this to the tokenizer exceptions. If you want a workaround immediately, you should be able to do this at run-time. See here: https://spacy.io/docs/usage/customizing-tokenizer |
Adding this on the organize-language-data branch in the new format so it'll be fixed in v2. |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
In sentences like
I'm gonna send an invitation
spacy recognizesgonna
as an adverb. This messes up dependency parsing. Stanford instead tokenizesgonna
as two tokens (gon=VBG, na=TO), so its behaves likegoing to
. I found this behaviour more appropriate.The text was updated successfully, but these errors were encountered: