Skip to main navigation Skip to search Skip to main content

Soft Disentanglement in Frequency Bands for Neural Audio Codecs

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In neural-based audio feature extraction, ensuring that representations capture disentangled information is crucial for model interpretability. However, existing disentanglement methods often rely on assumptions that are highly dependent on data characteristics or specific tasks. In this work, we introduce a generalizable approach for learning disentangled features within a neural architecture. Our method applies spectral decomposition to time-domain signals, followed by a multi-branch audio codec that operates on the decomposed components. Empirical evaluations demonstrate that our approach achieves better reconstruction and perceptual performance compared to a state-of-the-art baseline while also offering potential advantages for inpainting tasks.

Original languageEnglish
Title of host publication2025 33rd European Signal Processing Conference, EUSIPCO 2025 - Proceedings
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages11-15
Number of pages5
ISBN (Electronic)9789464593624
DOIs
Publication statusPublished - 1 Jan 2025
Event33rd European Signal Processing Conference, EUSIPCO 2025 - Palermo, Italy
Duration: 8 Sept 202512 Sept 2025

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491

Conference

Conference33rd European Signal Processing Conference, EUSIPCO 2025
Country/TerritoryItaly
CityPalermo
Period8/09/2512/09/25

Keywords

  • Disentanglement
  • Frequency Decomposition
  • Inpainting
  • Neural Audio Codec

Fingerprint

Dive into the research topics of 'Soft Disentanglement in Frequency Bands for Neural Audio Codecs'. Together they form a unique fingerprint.

Cite this