System requirements/performance: 45 secs for the POS example? #763

larsschwarz · 2017-01-21T15:42:10Z

Are there some kind of system requirements for running SpaCy or is there anything wrong with my system config? The example POS tagging script takes 45 secs to finish on my 2 core VPS (4 GB RAM, Ubuntu 16.04, Python 2.7, spaCy 1.6.0 using the German model for the POS tagging example script and a test sentence of 9 words.)

Is this a general system performance issue or an issue with not using Python 3?
Are there any recommendations (CPU and memory-wise) I should use when I like to use spaCy for a "just in time" POS tagging?

mattmacy · 2017-01-21T23:48:07Z

I believe the bottleneck is loading the GloVe vectors. He manually demarshals all 1 million 300d vectors regardless of whether they're used. I'm working on reducing the loading overhead to be a function of the size of the vocabulary that is actually used.

honnibal · 2017-01-22T11:10:44Z

The load time is currently a significant problem. You can make things better by setting parser=False.

The good news is that this is all overhead — once loaded the tagger is very fast. So on real usage you'll be able to process a lot of text.

larsschwarz · 2017-01-22T22:13:11Z

Disabling the parser does not change anything for me.

nlp = spacy.load('de', parser=False)

still takes 41 to 52 seconds for that simple sentence.

mattmacy · 2017-01-22T22:17:28Z

Sorry, to hear that. I'm back to focusing on this issue. I hope to have something that @honnibal can use in a few days. Then it's a question of when he can get the time to integrate it.

TL;DR - even if you need GloVe vectors this shouldn't be a problem for too much longer.

ines · 2017-03-18T14:49:46Z

Closing this – the new version supports a smaller model for faster loading!

lock · 2018-05-09T01:39:11Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

honnibal added the performance label Jan 23, 2017

ines closed this as completed Mar 18, 2017

lock bot locked as resolved and limited conversation to collaborators May 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

System requirements/performance: 45 secs for the POS example? #763

System requirements/performance: 45 secs for the POS example? #763

larsschwarz commented Jan 21, 2017

mattmacy commented Jan 21, 2017

honnibal commented Jan 22, 2017 •

edited

Loading

larsschwarz commented Jan 22, 2017

mattmacy commented Jan 22, 2017

ines commented Mar 18, 2017

lock bot commented May 9, 2018

System requirements/performance: 45 secs for the POS example? #763

System requirements/performance: 45 secs for the POS example? #763

Comments

larsschwarz commented Jan 21, 2017

mattmacy commented Jan 21, 2017

honnibal commented Jan 22, 2017 • edited Loading

larsschwarz commented Jan 22, 2017

mattmacy commented Jan 22, 2017

ines commented Mar 18, 2017

lock bot commented May 9, 2018

honnibal commented Jan 22, 2017 •

edited

Loading