-
Notifications
You must be signed in to change notification settings - Fork 2k
Issues: NVIDIA/Megatron-LM
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Configuring datasets using train-data-path, valid-data-path, and test-data-path results in training errors
#841
opened May 27, 2024 by
Eisenhower
[BUG] Checkpoint saving is slow for zarr backend + distributed optimizer
#834
opened May 22, 2024 by
chotzen
[QUESTION] Why enable
non_blocking=True
when doing synchronous D2H?
#833
opened May 22, 2024 by
raywan-110
[QUESTION] How to Obtain Computation Model Graphs in Megatron-LM?
#832
opened May 19, 2024 by
fwyc0573
[BUG] Modify FLOPs in MFU calculation for casual mask when using FlashAttention.
#831
opened May 17, 2024 by
Yuxin-CV
Question with forward_backward_pipelining_without_interleaving in Megatron-LM Pipeline
#830
opened May 17, 2024 by
Hongjie1Chu
[QUESTION] how to profile bubble time in pipeline parallelism?
#828
opened May 15, 2024 by
starstream
[BUG]:there is a small chance that it will get stuck, If i repeat runing test_serialization.py many times,
#825
opened May 14, 2024 by
starkhu
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.