Skip to content

Latest commit

 

History

History
20 lines (18 loc) · 948 Bytes

neural-chat-7b-v3-3-acc.md

File metadata and controls

20 lines (18 loc) · 948 Bytes

Install lm-eval-harness from source, we used the git id f3b7917091afba325af3980a35d8a6dcba03dc3f

lm_eval  --model hf --model_args pretrained="Intel/neural-chat-v3-3-int4-inc",autogptq=True,gptq_use_triton=True --device cuda:0 --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,rte,arc_easy,arc_challenge,mmlu  --batch_size 128
Metric FP16 INT4
Avg. 0.6778 0.6748
mmlu 0.5993 0.5926
lambada_openai 0.7303 0.7370
hellaswag 0.6639 0.6559
winogrande 0.7632 0.7735
piqa 0.8101 0.8074
truthfulqa_mc1 0.4737 0.4737
openbookqa 0.3880 0.3680
boolq 0.8694 0.8694
rte 0.7581 0.7509
arc_easy 0.8266 0.8249
arc_challenge 0.5734 0.5691