Passer à la navigation principale Passer à la recherche Passer au contenu principal

A Lightweight Dual-Stage Framework for Personalized Speech Enhancement Based on Deepfilternet2

  • Thomas Serre
  • , Mathieu Fontaine
  • , Éric Benhaim
  • , Geoffroy Dutour
  • , Slim Essid
  • Signal Processing Laboratory
  • Institut Polytechnique de Paris

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Isolating the desired speaker's voice amidst multiple speakers in a noisy acoustic context is a challenging task. Personalized speech enhancement (PSE) endeavours to achieve this by leveraging prior knowledge of the speaker's voice. Recent research efforts have yielded promising PSE models, albeit often accompanied by computationally intensive architectures, unsuitable for resource-constrained embedded devices. In this paper, we introduce a novel method to personalize a lightweight dual-stage Speech Enhancement (SE) model and implement it within DeepFilterNet2, a SE model renowned for its state-of-the-art performance. We seek an optimal integration of speaker information within the model, exploring different positions for the integration of the speaker embeddings within the dual-stage enhancement architecture. We also investigate a tailored training strategy when adapting DeepFilterNet2 to a PSE task. We show that our personalization method greatly improves the performances of DeepFilterNet2 while preserving minimal computational overhead.

langue originaleAnglais
titre2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages780-784
Nombre de pages5
ISBN (Electronique)9798350374513
Les DOIs
étatPublié - 1 janv. 2024
Modification externeOui
Evénement2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Corée du Sud
Durée: 14 avr. 202419 avr. 2024

Série de publications

Nom2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings

Une conférence

Une conférence2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
Pays/TerritoireCorée du Sud
La villeSeoul
période14/04/2419/04/24

Empreinte digitale

Examiner les sujets de recherche de « A Lightweight Dual-Stage Framework for Personalized Speech Enhancement Based on Deepfilternet2 ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation