Skip to content
View xf-zhao's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report xf-zhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
xf-zhao/README.md

Hi there ๐Ÿ‘‹

  • ๐Ÿ‘‹ Hi, Iโ€™m Xufeng Zhao
  • ๐Ÿ“ซ Iโ€™m now a 3rd year PhD student at University of Hamburg (UHH)
  • ๐Ÿ’ผ Previously worked for 2 years in JD.COM
  • ๐Ÿ‘€ Iโ€™m interested in Robotics, Large Language Models (LLMs), Reinforcement Learning (RL)
  • ๐Ÿ’ฌ Contact me for any discussion about LLMs + RL + Robotics...
  • ๐ŸŒฑ Check out some of my recent publications & implementation below:)

xf-zhao's GitHub stats

Pinned

  1. Matcha-agent Matcha-agent Public

    Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268

    Python 18 2

  2. mengdi-li/awesome-RLAIF mengdi-li/awesome-RLAIF Public

    A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

    76 2

  3. LoT LoT Public

    Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"

    Python 8

  4. ISCM ISCM Public

    Official implementation of paper "Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations"

    Python 5 1