Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What's the minimal device requirement on this project? #19

Open
leeaction opened this issue Jan 15, 2024 · 4 comments
Open

What's the minimal device requirement on this project? #19

leeaction opened this issue Jan 15, 2024 · 4 comments

Comments

@leeaction
Copy link

I'm trying this model , please provide the minimal device requirement information , such as GPU(type/memory) CPU SSD

@leeaction leeaction changed the title What's minimal device requirement on this project? What's the minimal device requirement on this project? Jan 15, 2024
@jpthu17
Copy link
Member

jpthu17 commented Jan 16, 2024

If you intend to perform full parameter fine-tuning, it should be carried out on the 8*A100 (80G). If opting for Lora, the tuning process is feasible on 4*V100 (32G).

To mitigate hardware demands, consider utilizing a smaller language model, like phi-2. Fine-tuning for phi-2 is achievable on 3090 GPUs. I intend to enhance the code in the upcoming weeks to facilitate fine-tuning for phi-2.

@lucasjinreal
Copy link

Hi,any plan to support some decent Chinese LLM models? There are now some hight quality Chinese video instruct data now.

@jpthu17
Copy link
Member

jpthu17 commented Jan 17, 2024

Hi,any plan to support some decent Chinese LLM models? There are now some hight quality Chinese video instruct data now.

We will support fine-tuning of the Qwen and mixture of experts in the update.

@lucasjinreal
Copy link

lucasjinreal commented Jan 17, 2024

@jpthu17 Hoping for it! Please pin me once you guys got some upates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants