Skip to content

b-d-e/lightning-gpt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Based on Andrej Karpathy's 'NanoGPT' lecture (training a small transformer architecture on a shakespearean dataset), refactoring for training with PyTorch Lightning.

Character level tokenizer and decoder only transformer architecture trained with masked self-attention.

Training tested on an A100-40 and M2 Macbook.

About

⚡️ Refactoring Karpathy's Nano-GPT to train with PyTorch Lightning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published