Very slow alpaca-eval times for OLMo models #143

dwadden · 2024-04-08T19:20:04Z

Alpaca-eval on OLMo models is very slow -- maybe just because OLMo can't use vllm and huggingface in general is slow? Here's an example Beaker job; based on the TQDM log it will take 100 hours (~4 days) to evaluate 800 examples. This isn't really a workable solution. Potential options:

Get OLMo to play nicely with vllm. We'd probably need to recruit an engineer to help with this.
Evaluate on a subset of maybe 100 instances for OLMo models. Still not ideal but better than nothing.

@hamishivi @yizhongw any thoughts?

hamishivi · 2024-04-08T20:42:27Z

My understanding is that vLLM integration of OLMo was at some point being looked at by @AkshitaB , although not sure on the current setup.
As for the second, feel free to add a subset flag for this, since it might be something useful for debugging anyway. Just reducing the prompt set used should work naively with the existing code (I do this myself for debugging).

AkshitaB · 2024-04-09T23:26:00Z

@dwadden vllm supports OLMo in their latest version already, you should be able to use it directly. You'll need to convert the olmo checkpoint to HF format using the conversion script.

Also, make sure to use the latest vllm, since they fixed a bug with tensor parallel case in this commit after their last pip release.

hamishivi · 2024-04-09T23:57:42Z

thanks akshita!!!

natolambert linked a pull request Apr 29, 2024 that will close this issue

Update all versions for new OLMo models (and others) #151

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very slow alpaca-eval times for OLMo models #143

Very slow alpaca-eval times for OLMo models #143

dwadden commented Apr 8, 2024

hamishivi commented Apr 8, 2024

AkshitaB commented Apr 9, 2024

hamishivi commented Apr 9, 2024

Very slow alpaca-eval times for OLMo models #143

Very slow alpaca-eval times for OLMo models #143

Comments

dwadden commented Apr 8, 2024

hamishivi commented Apr 8, 2024

AkshitaB commented Apr 9, 2024

hamishivi commented Apr 9, 2024