Skip to content

Suchiman/llama2.net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

llama2.net

This is a pure C# port of Alfonso² Peterssen's Java port of Andrej Karpathy's awesome llama2.c, a very simple implementation to run inference of models with a Llama2-like transformer-based LLM architecture.

Build

Requires the .NET 8 SDK.
The code expects tokenizer.bin in the current directory.
The sample stories15M.bin model can be found here

To build and run:

dotnet run -c Release stories15M.bin

About

Inference Llama 2 in one file of pure C#

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages