You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you to the authors for providing a method for transforming a dense model into MoEs for more efficient inference!
MoEfication provides acceleration results for the transformed model on CPU and GPU, while the current technical report for Llama MoE does not contain this information. Can the authors provide the relevant reference information?
The text was updated successfully, but these errors were encountered:
Thank you to the authors for providing a method for transforming a dense model into MoEs for more efficient inference!
MoEfication provides acceleration results for the transformed model on CPU and GPU, while the current technical report for Llama MoE does not contain this information. Can the authors provide the relevant reference information?
The text was updated successfully, but these errors were encountered: