Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use arbitrary open-source llm #297

Open
TribeDH opened this issue Dec 19, 2023 · 3 comments
Open

Use arbitrary open-source llm #297

TribeDH opened this issue Dec 19, 2023 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@TribeDH
Copy link

TribeDH commented Dec 19, 2023

Hi everyone, is it possible (or will be in the future) to set the -model parameter to an arbitrary llm outside the models list? For example, this can be useful for extracting entities in documents written in a different language or using domain-specific fine-tuned llm. Thanks for your time.

@caufieldjh
Copy link
Member

Thanks @TribeDH. There are definite plans to support this.

@caufieldjh caufieldjh added the enhancement New feature or request label Dec 20, 2023
@TribeDH
Copy link
Author

TribeDH commented Dec 22, 2023

That's awesome! After some tests on an italian crime news dataset, I found that llama-2-7b-chat gets the better results in a foreign language, and adding some simple prompt engineering adjustments in the class' prompt section (for example "act as an italian speaker") really improves the results. I hope this can help the research meanwhile the llm upgrade is done.

@caufieldjh
Copy link
Member

Fantastic! I've also seen some very exciting results with llama2-7b and mistral-7b so I want to be sure we support those.

@caufieldjh caufieldjh self-assigned this Jan 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants