Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add distiluse-base-multilingual-cased-v2 transformer for embeddings #4975

Open
1 task done
jmservera opened this issue May 18, 2024 · 0 comments
Open
1 task done

Add distiluse-base-multilingual-cased-v2 transformer for embeddings #4975

jmservera opened this issue May 18, 2024 · 0 comments

Comments

@jmservera
Copy link

Describe your feature request

I need to create a multilingual vector database for searching for topics in a multilingual library of pdf books.

I've been searching for the multilingual sentence transformer that works best for my case (mainly Spanish, French and Catalan), and found this one that seems better suited for my needs. I've already built a custom container, but I would love to have it as an option.
It is the sentence-transformers/distiluse-base-multilingual-cased-v2.

Here's an example comparison of the sentence similarity:
image

Code of Conduct

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant