Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unexpected lemma: "inner" -> "inn" #912

Closed
davechallis opened this issue Mar 23, 2017 · 2 comments
Closed

Unexpected lemma: "inner" -> "inn" #912

davechallis opened this issue Mar 23, 2017 · 2 comments
Labels
bug Bugs and behaviour differing from documentation

Comments

@davechallis
Copy link

davechallis commented Mar 23, 2017

I've previously (in other lemmatization software) seen "inner" -> "inner" (but again, not sure what correct lemma is, only that "inn" seems wrong here).

I did notice some tasks for improving lemmatization in Spacy in future though, so can hold off on reporting lemmatization issues if it's not helpful at the moment.

Example case:

import spacy
nlp = spacy.load('en')
for token in nlp("Wearing inners in the innermost inner space in the inn."):
    print(token, token.lemma_)

Output:

Wearing wear
inners inner
in in
the the
innermost innermost
inner inn
space space
in in
the the
inn inn
. .

Info about spaCy

  • spaCy version: 1.7.2
  • Platform: Darwin-16.4.0-x86_64-i386-64bit
  • Python version: 3.6.0
  • Installed models: en
@honnibal
Copy link
Member

Thanks, that's definitely wrong.

@honnibal honnibal added bug Bugs and behaviour differing from documentation performance and removed performance labels Mar 25, 2017
@lock
Copy link

lock bot commented May 9, 2018

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked as resolved and limited conversation to collaborators May 9, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Bugs and behaviour differing from documentation
Projects
None yet
Development

No branches or pull requests

2 participants