qlora taining on qwen1.5-15b-chat #459

wsp317 · 2024-05-13T09:56:04Z

训练qwen1.5-14b-chat，遇到下面的报错，transformers==4.38.2

RuntimeError(
"Unsloth: Tokenizer's pad_token cannot be = eos_token, and we couldn't find a\n"
"replacement of either <|reserved... or <|placeholder..."
)

danielhanchen · 2024-05-13T10:01:01Z

Oh that is an issue - the pad_token must be not the same as the eos_token, otherwise the finetune will be incorrect. I'll see if I can extend the tokenizer itself

wsp317 · 2024-05-13T10:07:35Z

I change the pad_token from <|endoftext|> to <|im_end|> in qwen's tokenizer_config.json file, and the training seems work.

danielhanchen · 2024-05-16T04:41:08Z

@wsp317 I fixed it just then! Sorry on the delay! I

If you're on a local machine, please update Unsloth via

pip uninstall unsloth -y
pip install --upgrade --force-reinstall --no-cache-dir git+https://github.com/unslothai/unsloth.git

Colab and Kaggle is fine (just restart it)

danielhanchen added the fixed - pending confirmation Fixed, waiting for confirmation from poster label May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qlora taining on qwen1.5-15b-chat #459

qlora taining on qwen1.5-15b-chat #459

wsp317 commented May 13, 2024

danielhanchen commented May 13, 2024

wsp317 commented May 13, 2024 •

edited

danielhanchen commented May 16, 2024

qlora taining on qwen1.5-15b-chat #459

qlora taining on qwen1.5-15b-chat #459

Comments

wsp317 commented May 13, 2024

danielhanchen commented May 13, 2024

wsp317 commented May 13, 2024 • edited

danielhanchen commented May 16, 2024

wsp317 commented May 13, 2024 •

edited