Skip to main navigation Skip to search Skip to main content

A Lightweight Dual-Stage Framework for Personalized Speech Enhancement Based on Deepfilternet2

  • Thomas Serre
  • , Mathieu Fontaine
  • , Éric Benhaim
  • , Geoffroy Dutour
  • , Slim Essid
  • Signal Processing Laboratory
  • Institut Polytechnique de Paris

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Isolating the desired speaker's voice amidst multiple speakers in a noisy acoustic context is a challenging task. Personalized speech enhancement (PSE) endeavours to achieve this by leveraging prior knowledge of the speaker's voice. Recent research efforts have yielded promising PSE models, albeit often accompanied by computationally intensive architectures, unsuitable for resource-constrained embedded devices. In this paper, we introduce a novel method to personalize a lightweight dual-stage Speech Enhancement (SE) model and implement it within DeepFilterNet2, a SE model renowned for its state-of-the-art performance. We seek an optimal integration of speaker information within the model, exploring different positions for the integration of the speaker embeddings within the dual-stage enhancement architecture. We also investigate a tailored training strategy when adapting DeepFilterNet2 to a PSE task. We show that our personalization method greatly improves the performances of DeepFilterNet2 while preserving minimal computational overhead.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages780-784
Number of pages5
ISBN (Electronic)9798350374513
DOIs
Publication statusPublished - 1 Jan 2024
Externally publishedYes
Event2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Publication series

Name2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings

Conference

Conference2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period14/04/2419/04/24

Keywords

  • Target speech extraction
  • real-time
  • speech enhancement

Fingerprint

Dive into the research topics of 'A Lightweight Dual-Stage Framework for Personalized Speech Enhancement Based on Deepfilternet2'. Together they form a unique fingerprint.

Cite this