Support BF16 for FSDP #963

yuvalkirstain · 2022-03-22T14:06:17Z

Feature Request

Please support BF16 mixed-precision

Additional context

Training with BF16 is usually more stable than fp16, which is very important when we want to train large models. Additionally, many models (e.g. T5) are trained with BF16 and if we want to continue training them with mixed-precision, using fp16 will result in NaNs.

Thank you!

anj-s · 2022-03-22T14:10:18Z

Thank you for this issue! We are currently working on adding support for bf16 and hope to have it done very soon :)

Assuming that you meant support bf16 with FSDP? Or were you thinking of another API?

yuvalkirstain · 2022-03-22T14:11:39Z

Exactly, bf16 with FSDP!

yuvalkirstain · 2022-03-25T20:41:15Z

@anj-s please let me know if there is anything we can do to help, having support for bf16 with FSDP in Fairseq will really really help us! :)

yuvalkirstain · 2022-05-25T12:45:00Z

Hi, has there been any progress with resolving this issue? @anj-s
Thank you so much

anj-s · 2022-05-25T13:27:27Z

Hi, has there been any progress with resolving this issue? @anj-s Thank you so much

Hi @yuvalkirstain, I think this should work without any issues. Can you try using bfloat16 by passing the right compute_dtype argument when using FSDP? Unfortunately i haven't had a chance to add a unit test but perhaps someone else on the team has looked into this. cc @anupambhatnagar @min-xu-ai

wangleiofficial · 2022-07-19T11:33:30Z

bfloat16 support with pytorch lighting will be better, do you have this consideration?

toriving · 2022-07-29T08:25:36Z

Is there currently any progress on this issue?
Or I'm just wondering if it works if I just apply the above branch.

anupambhatnagar · 2022-07-29T22:37:07Z

There has been no progress on this so far.

anj-s self-assigned this Mar 22, 2022

anj-s changed the title ~~Support BF16~~ Support BF16 for FSDP Mar 22, 2022

anj-s added the FSDP FullyShardedDataParallel (zero-3) label Mar 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support BF16 for FSDP #963

Support BF16 for FSDP #963

yuvalkirstain commented Mar 22, 2022

anj-s commented Mar 22, 2022 •

edited

yuvalkirstain commented Mar 22, 2022

yuvalkirstain commented Mar 25, 2022

yuvalkirstain commented May 25, 2022

anj-s commented May 25, 2022

wangleiofficial commented Jul 19, 2022

toriving commented Jul 29, 2022 •

edited

anupambhatnagar commented Jul 29, 2022

Support BF16 for FSDP #963

Support BF16 for FSDP #963

Comments

yuvalkirstain commented Mar 22, 2022

Feature Request

Additional context

anj-s commented Mar 22, 2022 • edited

yuvalkirstain commented Mar 22, 2022

yuvalkirstain commented Mar 25, 2022

yuvalkirstain commented May 25, 2022

anj-s commented May 25, 2022

wangleiofficial commented Jul 19, 2022

toriving commented Jul 29, 2022 • edited

anupambhatnagar commented Jul 29, 2022

anj-s commented Mar 22, 2022 •

edited

toriving commented Jul 29, 2022 •

edited