[FEATURE] Use local LLM deployment as Judge #694

pascal-pfeiffer · 2024-05-03T16:09:01Z

🚀 Feature

Additional to the hardcoded models, add support to use local models as judges for evaluation. Can be simplified to require the OpenAI API.
Should be basically an endpoint selection, probably the Azure hosted pipeline can be extended to cover this. If already working, add documentation on how this can be done.

Motivation

Local development and evals

pascal-pfeiffer · 2024-05-07T13:37:02Z

One way this is already supported in the current version:

Have an endpoint running that supports the OpenAI API format, specifically chat.completions.
Start LLM Studio with environment variable to point to that endpoint: OPENAI_API_BASE="http://111.111.111.111:8000/v1"
Validate correct usage in "Settings page". Note that "Use Azure" must be off, and the environment variable that was set above should be visible below. Changing it here has no effect! This is only for testing the correct setting of the environment variable.
Run an experiment with GPT metric and use the correct model name at your endpoint:
Calls to the LLM judge are now directed to your own LLM endpoint

pascal-pfeiffer added the type/feature Feature request label May 3, 2024

pascal-pfeiffer self-assigned this May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Use local LLM deployment as Judge #694

[FEATURE] Use local LLM deployment as Judge #694

pascal-pfeiffer commented May 3, 2024

pascal-pfeiffer commented May 7, 2024

[FEATURE] Use local LLM deployment as Judge #694

[FEATURE] Use local LLM deployment as Judge #694

Comments

pascal-pfeiffer commented May 3, 2024

🚀 Feature

Motivation

pascal-pfeiffer commented May 7, 2024