TY - GEN
T1 - UNIT
T2 - 12th International Conference on 3D Vision, 3DV 2025
AU - Sautier, Corentin
AU - Puy, Gilles
AU - Boulch, Alexandre
AU - Marlet, Renaud
AU - Lepetit, Vincent
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - Online object segmentation and tracking in Lidar point clouds enables autonomous agents to understand their surroundings and make safe decisions. Unfortunately, manual annotations for these tasks are prohibitively costly. We tackle this problem with the task of class-agnostic unsupervised online instance segmentation and tracking. To that end, we leverage an instance segmentation backbone and propose a new training recipe that enables the online tracking of objects. Our network is trained on pseudo-labels, eliminating the need for manual annotations. We conduct an evaluation using metrics adapted for temporal instance segmentation. Computing these metrics requires temporally-consistent instance labels. When unavailable, we construct these labels using the available 3D bounding boxes and semantic labels in the dataset. We compare our method against strong baselines and demonstrate its superiority across two different outdoor Lidar datasets. Project page: csautier.github.io/unit
AB - Online object segmentation and tracking in Lidar point clouds enables autonomous agents to understand their surroundings and make safe decisions. Unfortunately, manual annotations for these tasks are prohibitively costly. We tackle this problem with the task of class-agnostic unsupervised online instance segmentation and tracking. To that end, we leverage an instance segmentation backbone and propose a new training recipe that enables the online tracking of objects. Our network is trained on pseudo-labels, eliminating the need for manual annotations. We conduct an evaluation using metrics adapted for temporal instance segmentation. Computing these metrics requires temporally-consistent instance labels. When unavailable, we construct these labels using the available 3D bounding boxes and semantic labels in the dataset. We compare our method against strong baselines and demonstrate its superiority across two different outdoor Lidar datasets. Project page: csautier.github.io/unit
KW - lidar
KW - self-supervision
KW - unsupervised object segmentation
KW - unsuperviser instance segmentation
UR - https://www.scopus.com/pages/publications/105016250277
U2 - 10.1109/3DV66043.2025.00124
DO - 10.1109/3DV66043.2025.00124
M3 - Conference contribution
AN - SCOPUS:105016250277
T3 - Proceedings - 2025 International Conference on 3D Vision, 3DV 2025
SP - 1307
EP - 1316
BT - Proceedings - 2025 International Conference on 3D Vision, 3DV 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 25 March 2025 through 28 March 2025
ER -