Highlights
- Pro
Popular repositories Loading
-
-
trlx-drrlhf
trlx-drrlhf PublicForked from CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python 1
-
HD-Classify
HD-Classify PublicA Survey of Effectiveness of Classification Methods on Heart Disease Data
-
-
RL4LMs_DR
RL4LMs_DR PublicForked from allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Python
-
general-preference-model
general-preference-model PublicForked from general-preference/general-preference-model
Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)
Python
If the problem persists, check the GitHub status page or contact support.


