Skip to content
View yushengsu-thu's full-sized avatar

Highlights

  • Pro

Organizations

@thunlp
Block or Report

Block or report yushengsu-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
yushengsu-thu/README.md

Hi there ๐Ÿ‘‹

About Me:

I'm Yusheng (Ethan) Su. (Personal Website, Google Scholar).

Research:

My research spans the areas of natural language processing and machine learning, specifically focusing on large language models (LLMs). I am particularly interested in how to better pre-train, fine-tune/instruction-tune, evaluate LLMs, and advance them in real-world scenarios. Thus, my research broadly covers the following topics:

  1. LLM Pre-training
  2. LLM SFT/HLHF, Alignment (Talk2)
  3. LLM-based agent (Talk3)

Open Source Projects:

Contact:

Github Stats:

yusheng's github stats

Pinned

  1. OpenBMB/AgentVerse OpenBMB/AgentVerse Public

    ๐Ÿค– AgentVerse ๐Ÿช is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

    JavaScript 3.7k 346

  2. thunlp/Prompt-Transferability thunlp/Prompt-Transferability Public

    On Transferability of Prompt Tuning for Natural Language Processing

    Python 87 11

  3. thunlp/CokeBERT thunlp/CokeBERT Public

    CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

    Python 32 9

  4. yushengsu-thu.github.io yushengsu-thu.github.io Public

    Forked from academicpages/academicpages.github.io

    Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

    JavaScript 1

  5. llm.c llm.c Public

    Forked from karpathy/llm.c

    LLM training in simple, raw C/CUDA

    Cuda 1

  6. pre-training_cook pre-training_cook Public

    pre-training_cook

    1