PANO-ECHO: PANOramic depth prediction enhancement with ECHO features

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Panoramic depth estimation gains importance with more 360° images being widely available. However, traditional mono-to-depth approaches, optimized for a limited field of view, show subpar performance when naively adapted. Methods tailored to process panoramic input improve predictions but can not overcome ambiguous visual information and scale-uncertainty inherent to the task.In this paper we show the benefits of leveraging sound for improved panoramic depth estimation. Specifically, we harness audible echoes from emitted chirps as they contain rich geometric and material cues about the surrounding environment. We show that these auditory cues can enhance a state-of-the-art panoramic depth prediction framework. By integrating sound information, we improve this vision-only baseline by ≈12%.Our approach requires minimal modifications to the underlying architecture, making it easily applicable to other baseline models. We validate its efficacy on the Matterport3D and Replica datasets, demonstrating remarkable improvements in depth estimation accuracy. Our code is available here: https: //github.com/peter12398/PANO-ECHO

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE Conference on Artificial Intelligence, CAI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1063-1070
Number of pages8
ISBN (Electronic)9798350354096
DOIs
Publication statusPublished - 1 Jan 2024
Externally publishedYes
Event2nd IEEE Conference on Artificial Intelligence, CAI 2024 - Singapore, Singapore
Duration: 25 Jun 202427 Jun 2024

Publication series

NameProceedings - 2024 IEEE Conference on Artificial Intelligence, CAI 2024

Conference

Conference2nd IEEE Conference on Artificial Intelligence, CAI 2024
Country/TerritorySingapore
CitySingapore
Period25/06/2427/06/24

Keywords

  • Audio-Visual learning
  • multi-modal fusion
  • panoramic depth estimation

Fingerprint

Dive into the research topics of 'PANO-ECHO: PANOramic depth prediction enhancement with ECHO features'. Together they form a unique fingerprint.

Cite this