Skip to content

Pull requests: karpathy/nanoGPT

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix: h100-mfu-calculation
#464 opened Mar 24, 2024 by OrenLeung Loading…
Fixing eval path in README
#463 opened Mar 24, 2024 by goswamig Loading…
Refactor for easier configuration and overrides
#459 opened Mar 20, 2024 by ikeman32 Loading…
Early stopping
#453 opened Mar 9, 2024 by derekehyatt Loading…
Implement ROPE positional encodings
#450 opened Mar 8, 2024 by devinbot Loading…
fix: estimate_mfu dt ZeroDivisionError
#446 opened Mar 2, 2024 by HildaM Loading…
Generalize encode/decode for datasets
#415 opened Jan 5, 2024 by GMNGeoffrey Loading…
Fix BUG when = in CLI value, like: --start="1+1="
#412 opened Jan 4, 2024 by DIYer22 Loading…
Dockerfile using Nvidia Container Toolkit
#409 opened Dec 27, 2023 by niccolox Loading…
Azure deployment
#406 opened Dec 18, 2023 by lakaschus Loading…
Update transformer_sizing.ipynb
#402 opened Dec 10, 2023 by Cassini-chris Loading…
Update configurator.py
#394 opened Nov 20, 2023 by GeniusPlums Loading…
Update bench.py
#393 opened Nov 20, 2023 by GeniusPlums Loading…
Fix typo in running instructions for train.py
#387 opened Oct 25, 2023 by psoulos Loading…
adding TP inference
#386 opened Oct 24, 2023 by HamidShojanazeri Loading…
Cleaner & more verbose & colored output on console
#382 opened Oct 2, 2023 by klezm Loading…
10 tasks done
Fix IndexError on val.bin generate
#379 opened Sep 23, 2023 by Jisan09 Loading…
Added links for other references
#372 opened Sep 17, 2023 by AayushSameerShah Loading…
Add streaming output in sample.py
#369 opened Sep 2, 2023 by karmi Loading…
Add printing sample output during training
#368 opened Sep 2, 2023 by karmi Loading…
Code formatting according to PEP8 standards
#365 opened Aug 24, 2023 by Djacon Loading…
Implement modular encoder/decoder class
#364 opened Aug 24, 2023 by bkarab03 Loading…
ProTip! What’s not been updated in a month: updated:<2024-04-25.