Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 676 Bytes

baichuan2-7b-chat-acc.md

File metadata and controls

15 lines (12 loc) · 676 Bytes

Due to licensing restrictions, we are unable to release the model. Install lm-eval-harness from source, and the git id 96d185fa6232a5ab685ba7c43e45d1dbb3bb906d.

We used the following command for evaluation.

lm_eval --model hf  --model_args pretrained="./",autogptq=True,gptq_use_triton=True,trust_remote_code=True --device cuda:0 --tasks ceval-valid,cmmlu,mmlu,gsm8k --batch_size 16 --num_fewshot 0
Metric BF16 INT4
Avg. 0.4504 0.4470
mmlu 0.5096 0.5053
cmmlu 0.5486 0.5426
ceval 0.5394 0.5223
gsm8k 0.2039 0.2176