TY - GEN
T1 - Unsupervised Representation Learning for Diverse Deformable Shape Collections
AU - Hahner, Sara
AU - Attaiki, Souhaib
AU - Garcke, Jochen
AU - Ovsjanikov, Maks
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - We introduce a novel learning-based method for encoding and manipulating 3D surface meshes. Our method is specifically designed to create an interpretable embedding space for deformable shape collections. Unlike previous 3D mesh autoencoders that require meshes to be in a 1-to-1 correspondence, our approach is trained on diverse meshes in an unsupervised manner. Central to our method is a spectral pooling technique that establishes a universal latent space, breaking free from traditional constraints of mesh connectivity and shape categories. The entire process consists of two stages. In the first stage, we employ the functional map paradigm to extract point-to-point (p2p) maps between a collection of shapes in an unsupervised manner. These p2p maps are then utilized to construct a common latent space, which ensures straightforward interpretation and independence from mesh connectivity and shape category. Through extensive experiments, we demonstrate that our method achieves excellent reconstructions and produces more realistic and smoother interpolations than baseline approaches. Our code can be found online: https: //github.com/Fraunhofer-SCAI/DISCO-AE/
AB - We introduce a novel learning-based method for encoding and manipulating 3D surface meshes. Our method is specifically designed to create an interpretable embedding space for deformable shape collections. Unlike previous 3D mesh autoencoders that require meshes to be in a 1-to-1 correspondence, our approach is trained on diverse meshes in an unsupervised manner. Central to our method is a spectral pooling technique that establishes a universal latent space, breaking free from traditional constraints of mesh connectivity and shape categories. The entire process consists of two stages. In the first stage, we employ the functional map paradigm to extract point-to-point (p2p) maps between a collection of shapes in an unsupervised manner. These p2p maps are then utilized to construct a common latent space, which ensures straightforward interpretation and independence from mesh connectivity and shape category. Through extensive experiments, we demonstrate that our method achieves excellent reconstructions and produces more realistic and smoother interpolations than baseline approaches. Our code can be found online: https: //github.com/Fraunhofer-SCAI/DISCO-AE/
KW - Mesh Autoencoder
KW - Mesh Pooling
KW - Representation Learning
KW - Surface Mesh Autoencoder
KW - Unsupervised Learning
U2 - 10.1109/3DV62453.2024.00158
DO - 10.1109/3DV62453.2024.00158
M3 - Conference contribution
AN - SCOPUS:85196741350
T3 - Proceedings - 2024 International Conference on 3D Vision, 3DV 2024
SP - 1594
EP - 1604
BT - Proceedings - 2024 International Conference on 3D Vision, 3DV 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th International Conference on 3D Vision, 3DV 2024
Y2 - 18 March 2024 through 21 March 2024
ER -