Passer à la navigation principale Passer à la recherche Passer au contenu principal

RIVQ-VAE: Discrete Rotation-Invariant 3D Representation Learning

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Building local surface representations has recently attracted significant attention in 3D vision, allowing to structure complex 3D shapes as sequences of simpler local geometries. Inspired by advances in 2D discrete representation learning, recent approaches have proposed to break up 3D shapes into regular grids, where each cell is associated with a discrete code sampled from a learnable codebook. Unfortunately, existing methods ignore both the local rigid self-similarities as well as the ambiguities inherent to 3D geometry related to possible changes in orientation. As a result, such techniques require very large codebooks to capture all possible variability in both geometry and pose. In this work, we propose a novel generative model that improves the generation quality by compactly embedding local geometries in a rotation- and translation-invariant manner. This strategy allows our codebook of discrete codes to express a larger range of geometric structures by avoiding local and global redundancies. Crucially, we demonstrate via a careful architecture design that our approach allows to recover meaningful shapes from local embeddings, while ensuring global consistency. The conducted experiments show that our approach outperforms baseline methods by a large margin under similar settings.

langue originaleAnglais
titreProceedings - 2024 International Conference on 3D Vision, 3DV 2024
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages1382-1391
Nombre de pages10
ISBN (Electronique)9798350362459
Les DOIs
étatPublié - 1 janv. 2024
Evénement11th International Conference on 3D Vision, 3DV 2024 - Davos, Suisse
Durée: 18 mars 202421 mars 2024

Série de publications

NomProceedings - 2024 International Conference on 3D Vision, 3DV 2024

Une conférence

Une conférence11th International Conference on 3D Vision, 3DV 2024
Pays/TerritoireSuisse
La villeDavos
période18/03/2421/03/24

Empreinte digitale

Examiner les sujets de recherche de « RIVQ-VAE: Discrete Rotation-Invariant 3D Representation Learning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation