XuandongZhao

Follow

😎

study

Xuandong Zhao XuandongZhao

😎

study

Follow

CS PhD@UCSB

106 followers · 237 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Block or Report

Block or report XuandongZhao

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

WatermarkAttacker WatermarkAttacker Public

Invisible Image Watermarks Are Provably Removable Using Generative AI

Python 134 24
Unigram-Watermark Unigram-Watermark Public

[ICLR 2024] Provable Robust Watermarking for AI-Generated Text

Python 20 5
weak-to-strong weak-to-strong Public

Weak-to-Strong Jailbreaking on Large Language Models

Python 46 6
pf-decoding pf-decoding Public

Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs

Python 7
NPPrompt NPPrompt Public

[ACL 2023] NPPrompt: Pre-trained Language Models Can be Fully Zero-Shot Learners

Python 7 3
DRW DRW Public

[EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP

Python 10 2