RIVQ-VAE: Discrete Rotation-Invariant 3D Representation Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Building local surface representations has recently attracted significant attention in 3D vision, allowing to structure complex 3D shapes as sequences of simpler local geometries. Inspired by advances in 2D discrete representation learning, recent approaches have proposed to break up 3D shapes into regular grids, where each cell is associated with a discrete code sampled from a learnable codebook. Unfortunately, existing methods ignore both the local rigid self-similarities as well as the ambiguities inherent to 3D geometry related to possible changes in orientation. As a result, such techniques require very large codebooks to capture all possible variability in both geometry and pose. In this work, we propose a novel generative model that improves the generation quality by compactly embedding local geometries in a rotation- and translation-invariant manner. This strategy allows our codebook of discrete codes to express a larger range of geometric structures by avoiding local and global redundancies. Crucially, we demonstrate via a careful architecture design that our approach allows to recover meaningful shapes from local embeddings, while ensuring global consistency. The conducted experiments show that our approach outperforms baseline methods by a large margin under similar settings.

Original languageEnglish
Title of host publicationProceedings - 2024 International Conference on 3D Vision, 3DV 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1382-1391
Number of pages10
ISBN (Electronic)9798350362459
DOIs
Publication statusPublished - 1 Jan 2024
Event11th International Conference on 3D Vision, 3DV 2024 - Davos, Switzerland
Duration: 18 Mar 202421 Mar 2024

Publication series

NameProceedings - 2024 International Conference on 3D Vision, 3DV 2024

Conference

Conference11th International Conference on 3D Vision, 3DV 2024
Country/TerritorySwitzerland
CityDavos
Period18/03/2421/03/24

Keywords

  • Generative modeling
  • Local representation
  • Point cloud completion
  • Single-view reconstruction

Fingerprint

Dive into the research topics of 'RIVQ-VAE: Discrete Rotation-Invariant 3D Representation Learning'. Together they form a unique fingerprint.

Cite this