Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s? #138

Open
5 tasks done
HJT9328 opened this issue Nov 21, 2023 · 0 comments
Open
5 tasks done
Labels
question Further information is requested

Comments

@HJT9328
Copy link

HJT9328 commented Nov 21, 2023

Required prerequisites

Questions

7B模型实现了A800 上单卡吞吐的情况下实现了 70tokens/s 比较怀疑,

Checklist

  • I have provided all relevant and necessary information above.
  • I have chosen a suitable title for this issue.
@HJT9328 HJT9328 added the question Further information is requested label Nov 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant