Use arbitrary open-source llm #297

TribeDH · 2023-12-19T10:58:08Z

Hi everyone, is it possible (or will be in the future) to set the -model parameter to an arbitrary llm outside the models list? For example, this can be useful for extracting entities in documents written in a different language or using domain-specific fine-tuned llm. Thanks for your time.

caufieldjh · 2023-12-20T15:44:39Z

Thanks @TribeDH. There are definite plans to support this.

TribeDH · 2023-12-22T08:51:09Z

That's awesome! After some tests on an italian crime news dataset, I found that llama-2-7b-chat gets the better results in a foreign language, and adding some simple prompt engineering adjustments in the class' prompt section (for example "act as an italian speaker") really improves the results. I hope this can help the research meanwhile the llm upgrade is done.

caufieldjh · 2023-12-22T15:41:52Z

Fantastic! I've also seen some very exciting results with llama2-7b and mistral-7b so I want to be sure we support those.

caufieldjh added the enhancement New feature or request label Dec 20, 2023

caufieldjh self-assigned this Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use arbitrary open-source llm #297

Use arbitrary open-source llm #297

TribeDH commented Dec 19, 2023

caufieldjh commented Dec 20, 2023

TribeDH commented Dec 22, 2023

caufieldjh commented Dec 22, 2023

Use arbitrary open-source llm #297

Use arbitrary open-source llm #297

Comments

TribeDH commented Dec 19, 2023

caufieldjh commented Dec 20, 2023

TribeDH commented Dec 22, 2023

caufieldjh commented Dec 22, 2023