v2.1.8: Usability improvements and Serbian alpha tokenization

ines released this 08 Aug 09:20

· 5795 commits to master since this release

✨ New features and improvements

NEW: Alpha tokenization support for Serbian
Improve language data for Urdu.
Support installing and loading model packages in the same session.

🔴 Bug fixes

Fix issue #4002: Make PhraseMatcher work as expected for NORM attribute.
Fix issue #4063: Improve docs on Matcher attributes.
Fix issue #4068: Make Korean work as expected on Python 2.7.
Fix issue #4069: Add validate option to EntityRuler.
Fix issue #4074: Raise error if annotation dict in simple training style has unexpected keys.
Fix issue #4081: Fix typo in pyproject.toml.
Fix handling of keyword arguments in Language.evaluate.

📖 Documentation and examples

Improve Matcher attribute docs.
Fix various typos and inconsistencies.

👥 Contributors

Thanks to @akornilo, @mirfan899, @veer-bains, @seppeljordan, @Pavle992, @svlandeg, @jenojp and @adrianeboyd for the pull requests and contributions.

Assets 2