Multi-gpu training degrades performance #95

XiaoXuan42 · 2023-12-07T08:42:51Z

Hi, is there any caution with this lib when use multiple gpus in training? Like taking special attention to InnerBatchnorm, etc? I train two versions of the same network structure, one on one gpu while the other on multiple gpus and their performances differ a lot.

maxxxzdn · 2024-03-05T10:16:13Z

Just curious, how severe is the degradation? Did you also adjust for batch size and learning rate when training on multiple GPUs?

lrenaux-bdai · 2024-10-31T14:45:19Z

I just posted #103 which could be what you're referring to

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-gpu training degrades performance #95

Multi-gpu training degrades performance #95

XiaoXuan42 commented Dec 7, 2023

maxxxzdn commented Mar 5, 2024

lrenaux-bdai commented Oct 31, 2024

Multi-gpu training degrades performance #95

Multi-gpu training degrades performance #95

Comments

XiaoXuan42 commented Dec 7, 2023

maxxxzdn commented Mar 5, 2024

lrenaux-bdai commented Oct 31, 2024