FinetuneLLMs

Collections of all kinds of LLMs finetuning scripts

This repo aims to provide the finest collection of all tuning scripts that can be easily accessed by anyone.

Every training script in this repo is tested across multiple platforms.

Supported finetuning techniques

There are still a lot to implement, so stay tuned.

Model	SFT	DPO	ORPO	KTO	PRO
llama 2	✅	❌	❌	❌	❌
llama 3	✅	❌	✅	❌	❌
llama-gguf	✅	❌	❌	❌	❌
phi-3	✅	❌	❌	❌	❌
Mistral	✅	✅	❌	❌	❌

General Setup

Pull submodules
```
git submodule update --init
```
Install pytorch

The easist way to do this is via conda. If you don't have conda, please go to the installation guide
```
conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia
```
If you don't want to use conda, I recommend virtual env for different LLMs, as they have different requirements

For Linux

Install CUDA from Nvidia installation guide

cd llama2
python3 -m venv .llama2
source ./.llama2/bin/activate
pip3 install torch
pip3 install -r requirements.txt

For Windows (with Nvidia GPU)

Enable WSL2 on your machine.

Install CUDA from Nvidia installation guide
For Mac

Refer to mac/README.md

Please note, there might be other dependencies for different model training techniques. Please refer to specific README under those model directories.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
frontend		frontend
llama-gguf		llama-gguf
llama3		llama3
logs		logs
mac		mac
phi3		phi3
server		server
trainer		trainer
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

frontend

frontend

llama-gguf

llama-gguf

llama3

llama3

logs

logs

mac

mac

phi3

phi3

server

server

trainer

trainer

.gitignore

.gitignore

.gitmodules

.gitmodules

LICENSE

LICENSE

README.md

README.md

Repository files navigation

FinetuneLLMs

Supported finetuning techniques

General Setup

TODO

About

Languages

License

jazelly/FinetuneLLMs

Folders and files

Latest commit

History

Repository files navigation

FinetuneLLMs

Supported finetuning techniques

General Setup

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Languages