Physically Informed Spatial Regularization for Sound Event Localization and Detection

  • Haocheng Liu
  • , Diego Di Carlo
  • , Aditya Arie Nugraha
  • , Kazuyoshi Yoshii
  • , Gaël Richard
  • , Mathieu Fontaine

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Building Sound Event Localization and Detection (SELD) models that are robust to diverse acoustic environments remains one of the major challenges in multichannel signal processing, as reflections and reverberation can significantly confuse both the source direction and event detection. Introducing priors such as microphone geometry or room impulse response (RIR) into the model has proven effective in addressing this issue. Existing methods typically incorporate such priors in a deterministic way, often through data augmentation to enlarge data diversity. However, the uncertainty arising from the complex nature of audio acoustics remains largely underexplored in the SELD literature and naturally call for incorporating a stochastic modeling of acoustic prior. In this paper, we propose regularizing deep learning based SELD models with a physically constructed spatial covariance matrix (SCM) based on the estimated direction of arrival (DOA) and sound event detection (SED).

Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331537456
DOIs
Publication statusPublished - 1 Jan 2025
Event2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025 - Tahoe City, United States
Duration: 12 Oct 202515 Oct 2025

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
ISSN (Print)1931-1168
ISSN (Electronic)1947-1629

Conference

Conference2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025
Country/TerritoryUnited States
CityTahoe City
Period12/10/2515/10/25

Fingerprint

Dive into the research topics of 'Physically Informed Spatial Regularization for Sound Event Localization and Detection'. Together they form a unique fingerprint.

Cite this