Can you report the running time on hardware? #57

qiuzh20 · 2024-01-23T02:41:38Z

Thank you to the authors for providing a method for transforming a dense model into MoEs for more efficient inference!

MoEfication provides acceleration results for the transformed model on CPU and GPU, while the current technical report for Llama MoE does not contain this information. Can the authors provide the relevant reference information?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you report the running time on hardware? #57

Can you report the running time on hardware? #57

qiuzh20 commented Jan 23, 2024

Can you report the running time on hardware? #57

Can you report the running time on hardware? #57

Comments

qiuzh20 commented Jan 23, 2024