Missing space at end of strings in NUM_WORDS #759

Derek-Jones · 2017-01-20T13:26:41Z

The following code in spacy/orth.pyx

NUM_WORDS = set('zero one two three four five six seven eight nine ten'
'eleven twelve thirteen fourteen fifteen sixteen seventeen'
'eighteen nineteen twenty thirty forty fifty sixty seventy'
'eighty ninety hundred thousand million billion trillion'
'quadrillion gajillion bazillion'.split())

is missing a space character after ten, seventeen, seventy, trillion.

At the moment ten is not recognised as a number, but teneleven is treated as like_number.

ines · 2017-01-20T14:10:13Z

Thanks – will be pushing the fix and regression test in a second! Also, now that I see it, this data should probably be moved to the English language data at some point in the future.

lock · 2018-05-09T04:38:25Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

ines added bug Bugs and behaviour differing from documentation lang / en English language data and models labels Jan 20, 2017

ines closed this as completed in 09ecc39 Jan 20, 2017

ines added a commit that referenced this issue Jan 20, 2017

Add regression test for #759

5f6f48e

lock bot locked as resolved and limited conversation to collaborators May 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing space at end of strings in NUM_WORDS #759

Missing space at end of strings in NUM_WORDS #759

Derek-Jones commented Jan 20, 2017

ines commented Jan 20, 2017

lock bot commented May 9, 2018

Missing space at end of strings in NUM_WORDS #759

Missing space at end of strings in NUM_WORDS #759

Comments

Derek-Jones commented Jan 20, 2017

ines commented Jan 20, 2017

lock bot commented May 9, 2018