GPU parallelism for finetuning three different huggingface NLP models on Bittensor dataset

In this repo, the Bittensor CLM finetuning script is converted to support multi gpus. The bittensor version of the script has been adapted from Hugging Face's transformers/language-modeling code and can be found here: https://github.com/opentensor/clm_model_tuning

This script works with 2/4/6/8 GPUs in parallel to be able to train larger models or train them faster. It is written to be trained on the bittensor dataset and work for the huggingface models as gpt-neo-2.7B, gpt-j-6B and gpt-neeo-1.3B

Dataset

The information about the dataset is found https://docs.bittensor.com/nested/TheDataset.html

How to Run

Run git clone https://github.com/opentensor/clm_model_tuning.git Replace the content of finetune_using_clm2.py with finetune_using_clm2.py in the reference repo Run each notebook to finetune the gpt-neo-2.7B, gpt-j-6B and gpt-neeo-1.3B models on bittensor dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
clm_model_tuning_EleutherAI_gpt_j_6B.ipynb		clm_model_tuning_EleutherAI_gpt_j_6B.ipynb
clm_model_tuning_EleutherAI_gpt_neo_1_3B.ipynb		clm_model_tuning_EleutherAI_gpt_neo_1_3B.ipynb
clm_model_tuning_EleutherAI_gpt_neo_2_7B.ipynb		clm_model_tuning_EleutherAI_gpt_neo_2_7B.ipynb
finetune_using_clm2.py		finetune_using_clm2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

clm_model_tuning_EleutherAI_gpt_j_6B.ipynb

clm_model_tuning_EleutherAI_gpt_j_6B.ipynb

clm_model_tuning_EleutherAI_gpt_neo_1_3B.ipynb

clm_model_tuning_EleutherAI_gpt_neo_1_3B.ipynb

clm_model_tuning_EleutherAI_gpt_neo_2_7B.ipynb

clm_model_tuning_EleutherAI_gpt_neo_2_7B.ipynb

finetune_using_clm2.py

finetune_using_clm2.py

Repository files navigation

GPU parallelism for finetuning three different huggingface NLP models on Bittensor dataset

Dataset

How to Run

About

Releases

Packages

Languages

nezamtrm/GPU-parllelism-for-finetuning-huggingface-NLP-models-on-bittensor-dataset

Folders and files

Latest commit

History

Repository files navigation

GPU parallelism for finetuning three different huggingface NLP models on Bittensor dataset

Dataset

How to Run

About

Topics

Resources

Stars

Watchers

Forks

Languages