Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finetuning Models #562

Open
ak2028 opened this issue Aug 27, 2023 · 2 comments
Open

Finetuning Models #562

ak2028 opened this issue Aug 27, 2023 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@ak2028
Copy link

ak2028 commented Aug 27, 2023

I followed the tutorial at train/finetune_example/mpt-7b-arc-easy--gpu.yaml and added an additional evaluation using icl_tasks: 'eval/yamls/tasks_light.yaml' in order to evaluate accuracy on ARC Easy. As the model finetuned, training loss decreased, but so did accuracy, which appears to be a bug.

I repeated this using the full ARC Easy training set and the same thing occurred. Is there a reason that finetuning causes training loss to decrease but accuracy on evaluation to decrease?

@ak2028 ak2028 added the bug Something isn't working label Aug 27, 2023
@samhavens
Copy link
Contributor

When you used all of ARC easy, can you share what changes you made to the YAML?

@ak2028
Copy link
Author

ak2028 commented Aug 29, 2023

Sure, I only changed:
data_dir: train/finetune_example/arc-easy/
In arc-easy I have a train.jsonl

I downloaded the data from: https://huggingface.co/datasets/ai2_arc

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants