can we train by Parallel Computing or Multithreading or multi-Progress #366

joytianya · 2019-07-12T01:45:06Z

can we train by Parallel Computing or Multithreading or multi-Progress?
Speed up training
thank you

yutkin · 2019-07-22T09:04:41Z

@joytianya Yes, we can! For example, look at YouTokenToMe. This BPE implementation quite efficiently uses parallel processing.

taku910 · 2019-08-02T04:31:40Z

Thank you. I will take a look. Actually, the current BPE algorithm is a little conservative to find the most frequent pairs.

taku910 · 2023-05-02T15:50:58Z

Will work on it in the next release.

lockmatrix · 2023-06-07T11:37:01Z

I am really looking forward to parallel training,
as running Asian language corpora on multi-core computers is extremely slow, making me feel like I am wasting my CPU...

heyaudace · 2023-12-08T04:48:07Z

Will work on it in the next release.

Hi @taku910 - I was wondering whether this feature was released. Thank you

ganeshkrishnan1 · 2024-05-29T01:11:17Z

Just tagging along: is it possible to use multi threaded tokenization for multi-cpu training?

taku910 added the feature request Add new feature label Jan 10, 2021

taku910 self-assigned this May 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can we train by Parallel Computing or Multithreading or multi-Progress #366

can we train by Parallel Computing or Multithreading or multi-Progress #366

joytianya commented Jul 12, 2019

yutkin commented Jul 22, 2019

taku910 commented Aug 2, 2019

taku910 commented May 2, 2023

lockmatrix commented Jun 7, 2023

heyaudace commented Dec 8, 2023

ganeshkrishnan1 commented May 29, 2024

can we train by Parallel Computing or Multithreading or multi-Progress #366

can we train by Parallel Computing or Multithreading or multi-Progress #366

Comments

joytianya commented Jul 12, 2019

yutkin commented Jul 22, 2019

taku910 commented Aug 2, 2019

taku910 commented May 2, 2023

lockmatrix commented Jun 7, 2023

heyaudace commented Dec 8, 2023

ganeshkrishnan1 commented May 29, 2024