Skip to content

Multi-modal Photo Upsampling via Latent Space Exploration of StyleGAN2

License

Notifications You must be signed in to change notification settings

sarperturann/multi-modal-pulse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MM-PULSE: Multi-modal Photo Upsampling via Latent Space Exploration of StyleGAN2

Paper

PULSE (Photo Upsampling via Latent Space Exploration) produces high-resolution images with high accuracy; however, it outputs only one image. MM-PULSE aims to address and solve the limitations of PULSE by first integrating StyleGAN2 into the PULSE architecture, enhancing the quality of generated images and then using different random latents achieving multi-modality. Additionally, it employs an InterFaceGAN-based approach to determine semantic directions for attributes like ’smile’, ’bangs’ and ’hair color’ and utilizes them to perform targeted edits on the generated images.

To simplify, MM-PULSE performs two main tasks: it takes a low-resolution input image and generates multiple high-resolution images that are visually realistic and properly downscaled, while also enabling the editing of specific attributes in images by manipulating the latent space.

PULSE and MM-PULSE comparisons MM-PULSE Semantic Directions


Architecture

MM-PULSE Architecture


Built on top of PULSE, InterfaceGAN