-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lemmatization of won't #948
Comments
Thanks! Will fix. |
Just fixed this on master and it should work now – the tokenizer exceptions for the contractions were missing a |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
can't
andwon't
get tokenized asca n't
andwo n't
respectively.In previous versions of spacy, the lemma of
ca
andwo
wasca
andwo
.Sicne the recent update,
ca
now is correctly lemmatized ascan
, butwo
is stillwo
when it should bewill
.Edit:
Also just noticed the lemma of
sha
insha n't
is not converted toshall
.Your Environment
The text was updated successfully, but these errors were encountered: