Pull requests: karpathy/nanoGPT
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: conditional use of GradScaler based on device_type and dtype in train.py
#481
opened May 9, 2024 by
BRAINIAC2677
Loading…
PyTorch nn.LayerNorm now takes bias arg - removed custom class
#454
opened Mar 10, 2024 by
calmitchell617
Loading…
Fix a small bug in the attention bias calculation when flash attention is not available
#398
opened Nov 28, 2023 by
tbuthfer
Loading…
Cleaner & more verbose & colored output on console
#382
opened Oct 2, 2023 by
klezm
Loading…
10 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-04-25.