BLIND ESTIMATION OF AUDIO EFFECTS USING AN AUTO-ENCODER APPROACH AND DIFFERENTIABLE DIGITAL SIGNAL PROCESSING

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Blind Estimation of Audio Effects (BE-AFX) aims at estimating the audio effects (AFXs) applied to an original, unprocessed audio sample solely based on the processed audio sample. To train such a system traditional approaches optimize a loss between ground truth and estimated AFX parameters. This involves knowing the exact implementation of the AFXs used for the process. In this work, we propose an alternative solution that eliminates the requirement for knowing this implementation. Instead, we introduce an auto-encoder approach, which optimizes an audio quality metric. We explore, suggest, and compare various implementations of commonly used mastering AFXs, using differential signal processing or neural approximations. Our findings demonstrate that our auto-encoder approach yields superior estimates of the audio quality produced by a chain of AFXs, compared to the traditional parameter-based approach, even if the latter provides a more accurate parameter estimation.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages856-860
Number of pages5
ISBN (Electronic)9798350344851
DOIs
Publication statusPublished - 1 Jan 2024
Event2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period14/04/2419/04/24

Keywords

  • audio effects
  • deep learning
  • differentiable digital signal processing
  • neural proxy

Fingerprint

Dive into the research topics of 'BLIND ESTIMATION OF AUDIO EFFECTS USING AN AUTO-ENCODER APPROACH AND DIFFERENTIABLE DIGITAL SIGNAL PROCESSING'. Together they form a unique fingerprint.

Cite this