TY - GEN
T1 - Deep Multi-View Stereo Gone Wild
AU - Darmon, Francois
AU - Bascle, Benedicte
AU - Devaux, Jean Clement
AU - Monasse, Pascal
AU - Aubry, Mathieu
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - Deep multi-view stereo (MVS) methods have been developed and extensively compared on simple datasets,where they now outperform classical approaches. In this paper,we ask whether the conclusions reached in controlled scenarios are still valid when working with Internet photo collections. We propose a methodology for evaluation and explore the influence of three aspects of deep MVS methods: network architecture,training data,and supervision. We make several key observations,which we extensively validate quantitatively and qualitatively,both for depth prediction and complete 3D reconstructions. First,complex unsupervised approaches cannot train on data in the wild. Our new approach makes it possible with three key elements: upsampling the output,softmin based aggregation and a single reconstruction loss. Second,supervised deep depthmap-based MVS methods are state-of-the art for reconstruction of few internet images. Finally,our evaluation provides very different results than usual ones. This shows that evaluation in uncontrolled scenarios is important for new architectures.
AB - Deep multi-view stereo (MVS) methods have been developed and extensively compared on simple datasets,where they now outperform classical approaches. In this paper,we ask whether the conclusions reached in controlled scenarios are still valid when working with Internet photo collections. We propose a methodology for evaluation and explore the influence of three aspects of deep MVS methods: network architecture,training data,and supervision. We make several key observations,which we extensively validate quantitatively and qualitatively,both for depth prediction and complete 3D reconstructions. First,complex unsupervised approaches cannot train on data in the wild. Our new approach makes it possible with three key elements: upsampling the output,softmin based aggregation and a single reconstruction loss. Second,supervised deep depthmap-based MVS methods are state-of-the art for reconstruction of few internet images. Finally,our evaluation provides very different results than usual ones. This shows that evaluation in uncontrolled scenarios is important for new architectures.
KW - Dataset
KW - Deep Learning
KW - MVS
KW - Multi View Stereo
UR - https://www.scopus.com/pages/publications/85125012554
U2 - 10.1109/3DV53792.2021.00058
DO - 10.1109/3DV53792.2021.00058
M3 - Conference contribution
AN - SCOPUS:85125012554
T3 - Proceedings - 2021 International Conference on 3D Vision, 3DV 2021
SP - 484
EP - 493
BT - Proceedings - 2021 International Conference on 3D Vision, 3DV 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on 3D Vision, 3DV 2021
Y2 - 1 December 2021 through 3 December 2021
ER -