v2.1.8: Usability improvements and Serbian alpha tokenization
✨ New features and improvements
- NEW: Alpha tokenization support for Serbian
- Improve language data for Urdu.
- Support installing and loading model packages in the same session.
🔴 Bug fixes
- Fix issue #4002: Make
PhraseMatcher
work as expected forNORM
attribute. - Fix issue #4063: Improve docs on
Matcher
attributes. - Fix issue #4068: Make Korean work as expected on Python 2.7.
- Fix issue #4069: Add
validate
option toEntityRuler
. - Fix issue #4074: Raise error if annotation dict in simple training style has unexpected keys.
- Fix issue #4081: Fix typo in
pyproject.toml
. - Fix handling of keyword arguments in
Language.evaluate
.
📖 Documentation and examples
- Improve
Matcher
attribute docs. - Fix various typos and inconsistencies.
👥 Contributors
Thanks to @akornilo, @mirfan899, @veer-bains, @seppeljordan, @Pavle992, @svlandeg, @jenojp and @adrianeboyd for the pull requests and contributions.