Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can you report the running time on hardware? #57

Open
qiuzh20 opened this issue Jan 23, 2024 · 0 comments
Open

Can you report the running time on hardware? #57

qiuzh20 opened this issue Jan 23, 2024 · 0 comments

Comments

@qiuzh20
Copy link

qiuzh20 commented Jan 23, 2024

Thank you to the authors for providing a method for transforming a dense model into MoEs for more efficient inference!

MoEfication provides acceleration results for the transformed model on CPU and GPU, while the current technical report for Llama MoE does not contain this information. Can the authors provide the relevant reference information?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant