noncollapse

Follow

Kai Ye noncollapse

Follow

I am a Phd student at LSE focusing on RL and POMDP

6 followers · 3 following

Achievements

Achievements

Highlights

Pro

Popular repositories Loading

LLM_short_course LLM_short_course Public

PowerShell 5 2
trlx-drrlhf trlx-drrlhf Public

Forked from CarperAI/trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 1
HD-Classify HD-Classify Public

A Survey of Effectiveness of Classification Methods on Heart Disease Data
DRRLHF DRRLHF Public
RL4LMs_DR RL4LMs_DR Public

Forked from allenai/RL4LMs

A modular RL library to fine-tune language models to human preferences

Python
general-preference-model general-preference-model Public

Forked from general-preference/general-preference-model

Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)

Python