Pull requests: karpathy/llm.c
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
(optionally) recompute gelu activations to reduce activation memory
#420
opened May 16, 2024 by
ngc92
Loading…
Include a matmul_backward_bias kernel based on PMPP CoarsenedSumReduction kernel in 10.15
#419
opened May 16, 2024 by
lancerts
Loading…
Remove parallel CUDA streams while keeping main_stream and loss_event(?)
#417
opened May 15, 2024 by
ademeure
Loading…
Free cooperative groups implementation of online softmax forward
#383
opened May 8, 2024 by
KarhouTam
Loading…
Modified version of ademeure's fused gelu_forward kernel
#363
opened May 5, 2024 by
ChrisDryden
Loading…
Experimenting with global instantiation for the layouts
#347
opened May 3, 2024 by
ChrisDryden
•
Draft
Adding GPT2 model evaluation on WikiText-103 with optional preprocessing in dev/model-eval/
#340
opened May 3, 2024 by
joeshmoe0112358
Loading…
Added FlameGraphs for nsys reports and some nsys documentation
#333
opened May 2, 2024 by
PeterZhizhin
Loading…
Rewrite the encoder_forward float4 kernel with pack128
#302
opened Apr 30, 2024 by
lancerts
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.