-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
POS tag of "LANG" assigned to tokens, causing a KeyError #3958
Labels
bug
Bugs and behaviour differing from documentation
Comments
Damn, we threw the symbols table out of alignment. I don't get why our tests didn't catch this :(. Fix forth-coming, sorry! |
Fixed, and v2.1.6 uploaded. Wheels coming soon. Thanks again for the quick report. |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
How to reproduce the behaviour
Hello! I upgraded to v2.1.5 💫 and ran into an issue POS-tagging a text that wasn't present yesterday in v2.1.4. Specifically, the
en_core_web_sm
model assigns "LANG" as a POS tag for some tokens, which afaik isn't a valid value. This, in turn, raises aKeyError
when callingtok.pos_
on the offending tokens, sinceparts_of_speech.IDS
doesn't have "LANG" as a key.Diving in, I see that the POS tags are nonsensical:
I have no idea what's gone wrong. Here's a full example, using a fresh install of both spacy and the model:
Your Environment
The text was updated successfully, but these errors were encountered: