Skip to content
@RLHFlow

RLHFlow

Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)

Hi there 👋

Popular repositories

  1. RLHF-Reward-Modeling RLHF-Reward-Modeling Public

    A recipe to train reward models for RLHF.

    Python 208 13

  2. Online-RLHF Online-RLHF Public

    A recipe to train reward models for RLHF.

    Python 135 11

  3. Directional-Preference-Alignment Directional-Preference-Alignment Public

    Directional Preference Alignment

    31 1

  4. RLHFlow.github.io RLHFlow.github.io Public

    Webpage for RLHFlow

    HTML 7

  5. .github .github Public

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python HTML

Most used topics