Make-A-Scene - PyTorch

Pytorch implementation (unofficial) of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Figure 1. from paper

Note: this is work in progress.

We are at training stage! The process can be followed in the Discord-Channel on the LAION Discord https://discord.gg/DghvZDKu. The data preprocessing has been finished as well as training VQSEG. We are currently training VQIMG. Training checkpoints will be released soon with demos. The transformer implementation is in progess and will hopefully be started to train as soon as VQIMG finishes.

Demo

VQIMG: https://colab.research.google.com/drive/1SPyQ-epTsAOAu8BEohUokN4-b5RM_TnE?usp=sharing

Paper Description

Make-A-Scene modifies the VQGAN framework. It makes heavy use of using semantic segmentation maps for extra conditioning. This enables more influence on the generation process. Morever, it also conditions on text. The main improvements are the following:

Segmentation condition: separate VQVAE is trained (VQ-SEG) + loss modified to a weighted binary cross entropy. (3.4)
VQGAN training (VQ-IMG) is extended by Face-Loss & Object-Loss (3.3 & 3.5)
Classifier Guidance for the autoregressive transformer (3.7)

Training Pipeline

Figure 6. from paper

What needs to be done?

Refer to the different folders to see details.

Citation

@misc{https://doi.org/10.48550/arxiv.2203.13131,
  doi = {10.48550/ARXIV.2203.13131},
  url = {https://arxiv.org/abs/2203.13131},
  author = {Gafni, Oran and Polyak, Adam and Ashual, Oron and Sheynin, Shelly and Parikh, Devi and Taigman, Yaniv},
  title = {Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors},
  publisher = {arXiv},
  year = {2022},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.github		.github
Data		Data
conf		conf
losses		losses
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
log_utils.py		log_utils.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

Data

Data

conf

conf

losses

losses

models

models

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

log_utils.py

log_utils.py

train.py

train.py

utils.py

utils.py

Repository files navigation

Make-A-Scene - PyTorch

Note: this is work in progress.

Demo

Paper Description

Training Pipeline

What needs to be done?

Citation

About

Releases

Sponsor this project

Packages

Contributors 6

Languages

License

CasualGANPapers/Make-A-Scene

Folders and files

Latest commit

History

Repository files navigation

Make-A-Scene - PyTorch

Note: this is work in progress.

Demo

Paper Description

Training Pipeline

What needs to be done?

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Languages