Passer à la navigation principale Passer à la recherche Passer au contenu principal

Unified Variational and Physics-aware Model for Room Impulse Response Estimation

  • Institut Polytechnique de Paris

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Room impulse response estimation is essential for tasks like speech dereverberation, which improves automatic speech recognition. Most existing methods rely on either statistical signal processing or deep neural networks designed to replicate signal processing principles. However, combining statistical and physical modeling for RIR estimation remains largely unexplored. This paper proposes a novel approach integrating both aspects through a theoretically grounded model. The RIR is decomposed into interpretable parameters: white Gaussian noise filtered by a frequency-dependent exponential decay (e.g. modeling wall absorption) and an autoregressive filter (e.g. modeling microphone response). A variational free-energy cost function enables practical parameter estimation. As a proof of concept, we show that given dry and reverberant speech signals, the proposed method outperforms classical deconvolution in noisy environments, as validated by objective metrics.

langue originaleAnglais
Pages (de - à)3818-3822
Nombre de pages5
journalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Les DOIs
étatPublié - 1 janv. 2025
Evénement26th Interspeech Conference 2025 - Rotterdam, Pays-Bas
Durée: 17 août 202521 août 2025

Empreinte digitale

Examiner les sujets de recherche de « Unified Variational and Physics-aware Model for Room Impulse Response Estimation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation