A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing

Gwilherm Lesné, Yann Gousseau, Saïd Ladjal, Alasdair Newson

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advances in the field of generative models and in particular generative adversarial networks (GANs) have lead to substantial progress for controlled image editing. Despite their powerful ability to apply realistic modifications to an image, these methods often lack properties such as disentanglement (the capacity to edit attributes independently). In this paper, we propose an auto-encoder which re-organizes the latent space of StyleGAN, so that each attribute which we wish to edit corresponds to an axis of the new latent space, and furthermore that the latent axes are decorrelated, encouraging disentanglement. We work in a compressed version of the latent space, using Principal Component Analysis, meaning that the parameter complexity of our autoencoder is reduced, leading to short training times (∼45 mins). Qualitative and quantitative results demonstrate the editing capabilities of our approach, with greater disentanglement than competing methods, while maintaining fidelity to the original image with respect to identity. Our autoencoder architecture is simple and straightforward, facilitating implementation.

Original languageEnglish
Title of host publicationProceedings - CVMP 2023
Subtitle of host publication20th ACM SIGGRAPH European Conference on Visual Media Production
EditorsStephen N. Spencer
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400704260
DOIs
Publication statusPublished - 30 Nov 2023
Event20th ACM SIGGRAPH European Conference on Visual Media Production, CVMP 2023 - London, United Kingdom
Duration: 30 Nov 20231 Dec 2023

Publication series

NameProceedings - CVMP 2023: 20th ACM SIGGRAPH European Conference on Visual Media Production

Conference

Conference20th ACM SIGGRAPH European Conference on Visual Media Production, CVMP 2023
Country/TerritoryUnited Kingdom
CityLondon
Period30/11/231/12/23

Keywords

  • Disentanglement
  • GANs
  • Generative Models
  • Image Editing
  • Latent navigation
  • Neural Networks

Fingerprint

Dive into the research topics of 'A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing'. Together they form a unique fingerprint.

Cite this