You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

shripadk · 2024-03-03T18:11:17Z

Describe the bug

I have enabled 4-bit quantization for fine tuning mistralai/Mistral-7B-v0.1. Seems like Ludwig 0.10.1 depends on bitsandbytes < 0.41.0. But when I run the trainer I get the following warning:

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. 
If you want to save 4-bit models, make sure to have `bitsandbytes>=0.41.3` installed.

To Reproduce
Steps to reproduce the behavior:

Install Ludwig

pip install ludwig[full]

Config file (model.yaml):

model_type: llm
base_model: mistralai/Mistral-7B-v0.1

quantization:
  bits: 4

adapter:
  type: lora

prompt:
  template: |
    ### Instruction:
    {instruction}

    ### Input:
    {input}

    ### Response:

input_features:
  - name: prompt
    type: text

output_features:
  - name: output
    type: text

generation:
  temperature: 0.1

trainer:
  type: finetune
  epochs: 3
  optimizer:
    type: paged_adam
  batch_size: 1
  eval_steps: 100
  learning_rate: 0.0002
  eval_batch_size: 2
  steps_per_checkpoint: 1000
  learning_rate_scheduler:
    decay: cosine
    warmup_fraction: 0.03
  gradient_accumulation_steps: 16
  enable_gradient_checkpointing: true

preprocessing:
  sample_ratio: 0.1

Train the model:

ludwig train --config model.yaml --dataset "ludwig://alpaca"

Expected behavior
Should not show the warning on bitsandbytes version not supporting save_pretrained for 4-bit quantization.

Environment (please complete the following information):

OS: Linux
Version: 6.7.6-arch1-1
Python: 3.10.8
Ludwig: v0.10.1

@alexsherstinsky

The text was updated successfully, but these errors were encountered:

yogeshhk · 2024-03-04T09:28:20Z

Here is the notebook showing the run... First run asked for a RESTART, after doing that and running all the cells, the output is https://colab.research.google.com/drive/1kmZhQKBzpHBJRJvvp9PEdPEUMfMu6dh7?usp=sharing Just FYI.... btw, the output of the model is "","", but that's most likely an issue with the base model!! [
[@shripadk @alexsherstinsky]

yogeshhk · 2024-03-13T05:11:42Z

With more epochs Gemma finetuning seems to work fine https://console.cloud.google.com/vertex-ai/colab/notebooks?project=document-ai-374204&activeNb=projects%2Fdocument-ai-374204%2Flocations%2Fus-central1%2Frepositories%2F87000216-df46-4358-8bb1-6bc933f4c82b [@shripadk @alexsherstinsky ]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

shripadk commented Mar 3, 2024

yogeshhk commented Mar 4, 2024

yogeshhk commented Mar 13, 2024

You are calling save_pretrained to a 4-bit converted model, but your bitsandbytes version doesn't support it. #3951

You are calling save_pretrained to a 4-bit converted model, but your bitsandbytes version doesn't support it. #3951

Comments

shripadk commented Mar 3, 2024

yogeshhk commented Mar 4, 2024

yogeshhk commented Mar 13, 2024

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951