Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AI][Megtron] Train GPT models with NVIDIA Megatron-LM #49

Open
mozhgan-kch opened this issue Jul 8, 2021 · 2 comments
Open

[AI][Megtron] Train GPT models with NVIDIA Megatron-LM #49

mozhgan-kch opened this issue Jul 8, 2021 · 2 comments
Assignees

Comments

@mozhgan-kch
Copy link
Contributor

This Bootcamp is designed to give NLP researchers an overview of the fundamentals of NVIDIA Megatron-LM ( NVIDIA open-source framework for training very large language models). The focus will be on training GPT Megatron models specifically.

It will consist of intro to Megatron-LM code base, converting data to mmap format, understanding model parallel, data-parallel and how to config your training, then train and profile GPT Megatron models.

@Zenodia
Copy link
Collaborator

Zenodia commented Aug 30, 2021

I forked it and created a branch called megatron , i am now ready to create a PR , but i am not sure how you want me to do this. the ready repo can be found here https://github.com/Zenodia/gpubootcamp/tree/megatron

@mozhgan-kch
Copy link
Contributor Author

mozhgan-kch commented Oct 26, 2021

Add a link in the start notebook to our other materials. Check the AI materials for reference. Assigned to @mozhgan-kch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants