Pinned
Repositories
Showing 10 of 322 repositories
-
- LOLA-Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
-