Multi-gpu training degrades performance #95

XiaoXuan42 · 2023-12-07T08:42:51Z

Hi, is there any caution with this lib when use multiple gpus in training? Like taking special attention to InnerBatchnorm, etc? I train two versions of the same network structure, one on one gpu while the other on multiple gpus and their performances differ a lot.

maxxxzdn · 2024-03-05T10:16:13Z

Just curious, how severe is the degradation? Did you also adjust for batch size and learning rate when training on multiple GPUs?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-gpu training degrades performance #95

Multi-gpu training degrades performance #95

XiaoXuan42 commented Dec 7, 2023

maxxxzdn commented Mar 5, 2024

Multi-gpu training degrades performance #95

Multi-gpu training degrades performance #95

Comments

XiaoXuan42 commented Dec 7, 2023

maxxxzdn commented Mar 5, 2024