Skip to content

Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"

License

Notifications You must be signed in to change notification settings

MadryLab/journey-TRAK

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The Journey, Not the Destination: How Data Guides Diffusion Models

arXiv Code style: black

Check out our blog post!

In our paper, we introduce a data attribution framework for diffusion models, together with an efficent method fo computing attribution scores. Given a generated image X and a diffusion model of interest, you can use our library to identify training examples which strongly guide the diffusion model towards generating X.

In particular, we provide attribution scores for each step of the diffusuion process:

MSCOCO Attributions GIF CIFAR10 Attributions GIF

Usage

Check out the examples. There, we:

  • provide pre-computed attribution features so you can quickly score your generated images
  • showcase how to compute the final scores using pre-computed features
  • provide scripts to compute attribution features

Our code is based on the TRAK API.

Citation

If you use this code in your work, please cite using the following BibTeX entry:

@inproceedings{georgiev2023journey,
      title={The Journey, Not the Destination: How Data Guides Diffusion Models}, 
      author={Kristian Georgiev and Joshua Vendrow and Hadi Salman and Sung Min Park and Aleksander Madry},
      booktitle = {Arxiv preprint arXiv:2312.06205},
      year={2023},
}