TinyGPT

Tiny C++11 GPT-2 inference implementation from scratch, which is mainly based on the project picoGPT.

Accompanying blog post: Write a GPT from scratch (TinyGPT)

Core class

Tensor: Tensor class similar to the numpy interface.
Model: GPT-2 model implementation with reference to gpt2_pico.py.
Tokenizer: BPE tokenizer with exactly the same logic as GPT-2 encoder.py.

Build and Run

1. Get the code

git clone --recurse-submodules https://github.com/keith2018/TinyGPT.git

2. Install Intel MKL(Math Kernel Library)

Official website: Intel®-Optimized Math Library for Numerical Computing on CPUs & GPUs

3. Download GPT-2 model file

python3 tools/download_gpt2_model.py

if success, you'll see the file model_file.data in directory assets/gpt2

4. Build and Run

mkdir build
cmake -B ./build -DCMAKE_BUILD_TYPE=Release
cmake --build ./build --config Release

This will generate the executable file and copy assets to directory app/bin, then you can run the demo:

cd app/bin
./TinyGPT_demo
[DEBUG] TIMER TinyGPT::Model::loadModelGPT2: cost: 800 ms
[DEBUG] TIMER TinyGPT::Encoder::getEncoder: cost: 191 ms
INPUT:Alan Turing theorized that computers would one day become
GPT:the most powerful machines on the planet.
INPUT:exit

Dependencies

GEMM acceleration
- intel-mkl https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html
Json parser
- json11 https://github.com/dropbox/json11
Tokenizer regular matching
- re2 https://github.com/google/re2
- abseil-cpp https://github.com/abseil/abseil-cpp

License

This code is licensed under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
assets/gpt2		assets/gpt2
src		src
test		test
third_party		third_party
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

License

keith2018/TinyGPT

Folders and files

Latest commit

History

Repository files navigation

TinyGPT

Core class

Build and Run

1. Get the code

2. Install Intel MKL(Math Kernel Library)

3. Download GPT-2 model file

4. Build and Run

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages