Abstract
Accurate trajectory forecasting is crucial for the performance of various systems, such as advanced driver-assistance systems and self-driving vehicles. These forecasts allow us to anticipate events that lead to collisions and, therefore, to mitigate them. Deep Neural Networks have excelled in motion forecasting, but overconfidence and weak uncertainty quantification persist. Deep Ensembles address these concerns, yet applying them to multimodal distributions remains challenging. In this paper, we propose a novel approach named Hierarchical Light Transformer Ensembles (HLT-Ens) aimed at efficiently training an ensemble of Transformer architectures using a novel hierarchical loss function. HLT-Ens leverages grouped fully connected layers, inspired by grouped convolution techniques, to capture multimodal distributions effectively. We demonstrate that HLT-Ens achieves state-of-the-art performance levels through extensive experimentation, offering a promising avenue for improving trajectory forecasting techniques. We make our code available at github.com/alafage/hlt-ens.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1682-1691 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798331510831 |
| DOIs | |
| Publication status | Published - 1 Jan 2025 |
| Externally published | Yes |
| Event | 2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 - Tucson, United States Duration: 28 Feb 2025 → 4 Mar 2025 |
Publication series
| Name | Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025 |
|---|
Conference
| Conference | 2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 |
|---|---|
| Country/Territory | United States |
| City | Tucson |
| Period | 28/02/25 → 4/03/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- ensembling methods
- motion forecasting
Fingerprint
Dive into the research topics of 'Hierarchical Light Transformer Ensembles for Multimodal Trajectory Forecasting'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver