-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Would this work to train flan-t5-xxl on multiple GPUs? #46
Comments
Bump |
@experimarketing : Right now, it only works with T5 and mT5 on single GPU. I was away from the development for a couple of months. So, I didn't upgrade it to support FlanT5 and multi GPU. But, I will integrate it ASAP. |
Would it work with the -xxl version? I believe model parrellism would be required to run it. As it is too large to run on a single GPU. |
@experimarketing : I'm afraid, It won't! |
Thanks for looking into this. Kindly let us know after completion. one more thing really many thanks for developing this library. It simplified the usage of the T5 model. |
No description provided.
The text was updated successfully, but these errors were encountered: