Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

NVIDIA / Megatron-LM Public

Notifications
Fork 2k
Star 8.7k

Code
Issues 291
Pull requests 125
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/Megatron-LM

Labels 11 Milestones 0

Labels 11 Milestones 0

New pull request New

125 Open 172 Closed

125 Open 172 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[bug] fix xavier uniform init for output layers

#814 opened May 8, 2024 by hjlee1371

Loading…

Support for Megatron-VLM training

#806 opened May 5, 2024 by 1049451037

Loading…

2

Add dataset packing

#802 opened May 2, 2024 by shamanez

Loading…

fix finalize_model_grads when sp is on

#798 opened Apr 29, 2024 by zhaoyinglia

Loading…

Speed up the creation of attention mask

#797 opened Apr 29, 2024 by yuantailing

Loading…

1

Fix incorrect src argument in broadcast_params function

#796 opened Apr 26, 2024 by Yuxin-CV

Loading…

fix loading distributed checkpoint when enable auto-detect-ckpt-format but disable use-dist-ckpt

#794 opened Apr 24, 2024 by imh966

Loading…

modifed the model parreleized gpt pre-trainign script

#789 opened Apr 22, 2024 by shamanez

Loading…

forward step missing arg

#784 opened Apr 18, 2024 by malay-nagda

Loading…

fix a mistake when check if num_layers dividable by vpp

#781 opened Apr 16, 2024 by constroy

Loading…

Fix llama converter.

#777 opened Apr 12, 2024 by Victarry

Loading…

Update pretrain_bert.py

#772 opened Apr 9, 2024 by ocryptocode

Loading…

[very simple change] Remove duplicated code

#765 opened Apr 3, 2024 by NoelBird

Loading…

1

fix new bucket when param require new bucket

#762 opened Apr 2, 2024 by wangxicoding

Loading…

Updated fused_kernels import path

#760 opened Mar 31, 2024 by Yazeed7

Loading…

use new methods for communication

#758 opened Mar 30, 2024 by mayank31398

Loading…

drop redundant check

#757 opened Mar 30, 2024 by mayank31398

Loading…

Fix typo in README.md

#751 opened Mar 26, 2024 by HashiamKadhim

Loading…

Support S3 checkpointing for the torch strategy in distributed checkpointing

#748 opened Mar 22, 2024 by jrocmar

Loading…

8

[BUG FIX] Fix world_size bug in QuickStart Example

#747 opened Mar 22, 2024 by Mr-Philo

Loading…

Update outdated method name passed to get linear_layer function to match intented method that was imported

#740 opened Mar 18, 2024 by OckermanSethGVSU

Loading…

Replace outdated import path of get_forward_backward_func in eval_utils.py

#734 opened Mar 14, 2024 by OckermanSethGVSU

Loading…

1

fix torch softmax masking

#731 opened Mar 12, 2024 by JRD971000

Loading…

support more general inference case that query length > 1

#730 opened Mar 12, 2024 by yidong72

Loading…

Support S3 data loading

#729 opened Mar 11, 2024 by jrocmar

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.