#

llm-as-a-judge

Here are 4 public repositories matching this topic...

martin-wey / CodeUltraFeedback

CodeUltraFeedback: aligning large language models to coding preferences

alignment code-generation dpo large-language-models llm-as-a-judge codeultrafeedback codal-bench

Updated Mar 17, 2024
Python

KID-22 / LLM-IR-Bias-Fairness-Survey

This is the repo for the survey of Bias and Fairness in IR with LLMs.

information-retrieval recommender-systems bias ir fairness large-language-models llm chatgpt llm4rec llm4rs llm-as-a-judge llm-as-evaluator llm4ir

Updated May 5, 2024

minnesotanlp / cobbler

Code and data for ACL ARR 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"

nlp evaluation bias bias-detection llm llms llm-evaluation llms-benchmarking llm-as-judge llm-as-a-judge llm-as-evaluator

Updated Feb 16, 2024
Jupyter Notebook

aws-samples / model-as-a-judge-eval

Notebooks for evaluating LLM based applications using the Model (LLM) as a judge pattern.

evaluation llm generative-ai llm-as-a-judge

Updated May 18, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the llm-as-a-judge topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-as-a-judge topic, visit your repo's landing page and select "manage topics."