finetuned using lamma-13B #13

Huangbukun · 2023-12-22T09:22:20Z

Hello, if I want to use llama-13B's pth for fine-tuning, what changes need to be made to the train.sh script? After fine-tuning according to the parameters of llama-7B, the accuracy is very low.

ikodoh · 2023-12-26T08:42:32Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

Huangbukun · 2023-12-27T05:42:10Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

thank you!

Huangbukun · 2023-12-27T07:47:52Z

In our experiments, I changed --adapter_layer from 32 to 40, and you may also decrease the learning rate.

Hello, after I adjusted --adapter_layer to 40, I changed the learning rate to 9e-3, but your result in the double line can only reach 65%.I don't know what I did wrong

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuned using lamma-13B #13

finetuned using lamma-13B #13

Huangbukun commented Dec 22, 2023

ikodoh commented Dec 26, 2023 •

edited

Huangbukun commented Dec 27, 2023

Huangbukun commented Dec 27, 2023

finetuned using lamma-13B #13

finetuned using lamma-13B #13

Comments

Huangbukun commented Dec 22, 2023

ikodoh commented Dec 26, 2023 • edited

Huangbukun commented Dec 27, 2023

Huangbukun commented Dec 27, 2023

ikodoh commented Dec 26, 2023 •

edited