-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect POS tags when multiple models are loaded #3853
Comments
Thanks for the report! I think I know what might be happening here: The |
Thanks for the reply! That makes sense. I realize this is not the most common use case, but it's still a bit unexpected, so if it's not something that can be fixed easily, maybe a warning when you load a conflicting model could be helpful? |
We'll at least add a warning in the next version, but I definitely do think this is a bug we should fix. Thanks again for the report. |
When we repackage the models, we need to take care that the We could change the value of |
Warn-and-continue was kind of a dumb behaviour, since the results for the model loaded first would predictably be bad. We may as well try changing the name. I added a warning pointing people here as well, so that it's easier to find the context if the problem is encountered. We should fix this properly in the v2.2 line of models, by making the vector names more specific. |
@honnibal does this have any effect on custom models loaded one after another? |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Something strange is happening when
en_core_web_md
anden_core_web_lg
are loaded at the same time, which leads to many POS tagging errors in the model that was loaded first.The weird tagging results mentioned in this comment turn out to be an issue when multiple models are loaded at the same time rather than a problem specific to
en_core_web_md
.To reproduce:
Output:
Loading
en_core_web_sm
doesn't seem to cause similar problems, but loadingen_core_web_md
/en_core_web_lg
in either order leads to many incorrect tags (plus obviously cascading errors in the rest of the pipeline) in the model that was loaded first.Your Environment
spacy 2.0 doesn't seem to have this issue.
The text was updated successfully, but these errors were encountered: