Multithreading for TNeuralDataLoadingFit.FitLoading #119

gwiesenekker · 2023-09-01T00:16:44Z

I see from the HypotenuseFitLoading example that GetTrainingPair supports a ThreadId argument. How do you add multi-threading support to the HypotenuseFitLoading example?

gwiesenekker · 2023-09-01T01:02:37Z

I have compiled the HypotenuseFitLoading example and the output ('Threads') suggests that multithreading is enabled and it looks like 'batchsize' controls the number of threads. This is not the common meaning of 'batchsize' in the context of NN training. How are the 'batchsize' and the number of threads related?

joaopauloschuler · 2023-09-01T02:10:21Z

@gwiesenekker , regarding "I see from the HypotenuseFitLoading example that GetTrainingPair supports a ThreadId argument. How do you add multi-threading support to the HypotenuseFitLoading example?", this example already explores parallel processing. The class TNeuralDataLoadingFit uses https://github.com/joaopauloschuler/neural-api/blob/master/neural/neuralthread.pas . The parameter ThreadId is there just in case that you need to make sure that your data each thread has some specific code (such as writes instead of pure data loading).

joaopauloschuler · 2023-09-01T02:15:08Z

In this API, the batch is distributed to parallel threads using the principle of data parallelism: https://www.telesens.co/2017/12/25/understanding-data-parallelism-in-machine-learning/ .

I tested this API with up to 64 CPU cores with 64 parallel threads. It should support more threads but I tested with up to 64 CPU cores. In this API, your batch size should be bigger than the number of cores.

gwiesenekker · 2023-09-01T06:22:19Z

Thanks. So where do you specify the number of threads? Looking at the source code it looks like the default number of threads is GetSystemThreadCount, correct? Why does changing 'batch size' in HypotenuseFitLoading to 8 change 'Threads' in the output to 8? Perhaps because the 'batch size' is 1 (one X, Y pair)?

joaopauloschuler · 2023-09-02T02:50:35Z

"Looking at the source code it looks like the default number of threads is GetSystemThreadCount, correct?"
In FPC, this is correct. In Delphi, TThread.ProcessorCount is used.

"Why does changing 'batch size' in HypotenuseFitLoading to 8 change 'Threads' in the output to 8? Perhaps because the 'batch size' is 1 (one X, Y pair)?"

This is a limitation in CAI. The number of threads needs to be equal or smaller than the batch size. In this API, larger batch sizes lead to better hardware usage (more efficient). Larger batch sizes tend to lead to smaller over fitting but may also make the learning harder. My own default value for problems that I have never touched before is 64 elements for batch size. I recommend 64 as a starting point. Some papers report batch sizes of up to 256 and even 512.

There are some experiments with batch size in this link: https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e .

In this API, just to start "feeling" a new problem, I would try 64 for batch size, 0,001 or smaller for LR and 0.9 for inertia.

joaopauloschuler self-assigned this Sep 1, 2023

joaopauloschuler added the question Further information is requested label Sep 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multithreading for TNeuralDataLoadingFit.FitLoading #119

Multithreading for TNeuralDataLoadingFit.FitLoading #119

gwiesenekker commented Sep 1, 2023 •

edited

gwiesenekker commented Sep 1, 2023 •

edited

joaopauloschuler commented Sep 1, 2023

joaopauloschuler commented Sep 1, 2023

gwiesenekker commented Sep 1, 2023 •

edited

joaopauloschuler commented Sep 2, 2023

Multithreading for TNeuralDataLoadingFit.FitLoading #119

Multithreading for TNeuralDataLoadingFit.FitLoading #119

Comments

gwiesenekker commented Sep 1, 2023 • edited

gwiesenekker commented Sep 1, 2023 • edited

joaopauloschuler commented Sep 1, 2023

joaopauloschuler commented Sep 1, 2023

gwiesenekker commented Sep 1, 2023 • edited

joaopauloschuler commented Sep 2, 2023

gwiesenekker commented Sep 1, 2023 •

edited

gwiesenekker commented Sep 1, 2023 •

edited

gwiesenekker commented Sep 1, 2023 •

edited