Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

qlora taining on qwen1.5-15b-chat #459

Open
wsp317 opened this issue May 13, 2024 · 3 comments
Open

qlora taining on qwen1.5-15b-chat #459

wsp317 opened this issue May 13, 2024 · 3 comments
Labels
fixed - pending confirmation Fixed, waiting for confirmation from poster

Comments

@wsp317
Copy link

wsp317 commented May 13, 2024

训练qwen1.5-14b-chat,遇到下面的报错,transformers==4.38.2

RuntimeError(
"Unsloth: Tokenizer's pad_token cannot be = eos_token, and we couldn't find a\n"
"replacement of either <|reserved... or <|placeholder..."
)

@danielhanchen
Copy link
Contributor

Oh that is an issue - the pad_token must be not the same as the eos_token, otherwise the finetune will be incorrect. I'll see if I can extend the tokenizer itself

@wsp317
Copy link
Author

wsp317 commented May 13, 2024

I change the pad_token from <|endoftext|> to <|im_end|> in qwen's tokenizer_config.json file, and the training seems work.

@danielhanchen
Copy link
Contributor

@wsp317 I fixed it just then! Sorry on the delay! I

If you're on a local machine, please update Unsloth via

pip uninstall unsloth -y
pip install --upgrade --force-reinstall --no-cache-dir git+https://github.com/unslothai/unsloth.git

Colab and Kaggle is fine (just restart it)

@danielhanchen danielhanchen added the fixed - pending confirmation Fixed, waiting for confirmation from poster label May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
fixed - pending confirmation Fixed, waiting for confirmation from poster
Projects
None yet
Development

No branches or pull requests

2 participants