Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

支持自定义视觉编码器么(llava-llama3)? #668

Open
Yanllan opened this issue May 9, 2024 · 2 comments
Open

支持自定义视觉编码器么(llava-llama3)? #668

Yanllan opened this issue May 9, 2024 · 2 comments

Comments

@Yanllan
Copy link

Yanllan commented May 9, 2024

支持自定义视觉编码器么(llava-llama3)?
例如将clip换成siglip?
该如何实现?哪些代码需要修改?

@hhaAndroid
Copy link
Collaborator

已经在重构视觉部分了,快了。

@ztfmars
Copy link

ztfmars commented May 17, 2024

支持自定义视觉编码器么(llava-llama3)? 例如将clip换成siglip? 该如何实现?哪些代码需要修改?

哇,兄弟,你也是看了google 的paligamma吗?sigclip这个确实要比vitclip好用啊。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants