Passer à la navigation principale Passer à la recherche Passer au contenu principal

Physically Informed Spatial Regularization for Sound Event Localization and Detection

  • Haocheng Liu
  • , Diego Di Carlo
  • , Aditya Arie Nugraha
  • , Kazuyoshi Yoshii
  • , Gaël Richard
  • , Mathieu Fontaine

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Building Sound Event Localization and Detection (SELD) models that are robust to diverse acoustic environments remains one of the major challenges in multichannel signal processing, as reflections and reverberation can significantly confuse both the source direction and event detection. Introducing priors such as microphone geometry or room impulse response (RIR) into the model has proven effective in addressing this issue. Existing methods typically incorporate such priors in a deterministic way, often through data augmentation to enlarge data diversity. However, the uncertainty arising from the complex nature of audio acoustics remains largely underexplored in the SELD literature and naturally call for incorporating a stochastic modeling of acoustic prior. In this paper, we propose regularizing deep learning based SELD models with a physically constructed spatial covariance matrix (SCM) based on the estimated direction of arrival (DOA) and sound event detection (SED).

langue originaleAnglais
titreProceedings of the 2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025
EditeurInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronique)9798331537456
Les DOIs
étatPublié - 1 janv. 2025
Evénement2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025 - Tahoe City, États-Unis
Durée: 12 oct. 202515 oct. 2025

Série de publications

NomIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
ISSN (imprimé)1931-1168
ISSN (Electronique)1947-1629

Une conférence

Une conférence2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025
Pays/TerritoireÉtats-Unis
La villeTahoe City
période12/10/2515/10/25

Empreinte digitale

Examiner les sujets de recherche de « Physically Informed Spatial Regularization for Sound Event Localization and Detection ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation