Why I found that the generation of embedding is very slow? #110

zhuchenxi · 2019-11-26T06:48:27Z

I found that in CPU mode, the speed of the generation of embedding is about 26 sentences per hour. Is that slow or normal?

hoschwenk · 2019-11-28T18:23:57Z

I haven't done speed test on CPU, but 26 sentences per hour is definitely by way too slow.
Are yo using multiple threads and bunch mode ?

hoschwenk · 2019-12-04T13:47:07Z

Any new insights ?

ShunChi100 · 2020-02-04T21:47:07Z

I found that setting max_sentences to some an int number helps, e.g. 128 (depending on your GPU memory) in the SentenceEncoder(). Otherwise, SentenceEncoder() treat one sentence as one batch.

ericmclachlan · 2020-03-11T13:36:48Z

@hoschwenk : I'm fairly new to using LASER (which is awesome!). You mention multiple threads and bunch mode... Is there any documentation you can point me to help me take advantage of these features? The FLASK API provided by the docker deployment seems to be strictly single-threaded.
TIA

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why I found that the generation of embedding is very slow? #110

Why I found that the generation of embedding is very slow? #110

zhuchenxi commented Nov 26, 2019

hoschwenk commented Nov 28, 2019

hoschwenk commented Dec 4, 2019

ShunChi100 commented Feb 4, 2020

ericmclachlan commented Mar 11, 2020

Why I found that the generation of embedding is very slow? #110

Why I found that the generation of embedding is very slow? #110

Comments

zhuchenxi commented Nov 26, 2019

hoschwenk commented Nov 28, 2019

hoschwenk commented Dec 4, 2019

ShunChi100 commented Feb 4, 2020

ericmclachlan commented Mar 11, 2020