Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spacy recognizes gonna as an adverb #691

Closed
nilehmann opened this issue Dec 16, 2016 · 3 comments
Closed

Spacy recognizes gonna as an adverb #691

nilehmann opened this issue Dec 16, 2016 · 3 comments

Comments

@nilehmann
Copy link

In sentences like I'm gonna send an invitation spacy recognizes gonna as an adverb. This messes up dependency parsing. Stanford instead tokenizes gonna as two tokens (gon=VBG, na=TO), so its behaves like going to. I found this behaviour more appropriate.

@honnibal
Copy link
Member

honnibal commented Dec 16, 2016

Thanks. We'll add this to the tokenizer exceptions.

If you want a workaround immediately, you should be able to do this at run-time. See here: https://spacy.io/docs/usage/customizing-tokenizer

@ines ines added the 🌙 nightly Discussion and contributions related to nightly builds label Dec 17, 2016
@ines ines added this to the Reorganise language data milestone Dec 17, 2016
@ines
Copy link
Member

ines commented Dec 17, 2016

Adding this on the organize-language-data branch in the new format so it'll be fixed in v2.

@ines ines closed this as completed in d8d50a0 Dec 18, 2016
@ines ines removed the 🌙 nightly Discussion and contributions related to nightly builds label Dec 18, 2016
@lock
Copy link

lock bot commented May 9, 2018

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked as resolved and limited conversation to collaborators May 9, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants