Skip to main navigation Skip to search Skip to main content

Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport

  • Institut Polytechnique de Paris

Research output: Contribution to journalConference articlepeer-review

Abstract

In neural audio signal processing, pitch conditioning has been used to enhance the performance of synthesizers. However, jointly training pitch estimators and synthesizers is a challenge when using standard audio-to-audio reconstruction loss, leading to reliance on external pitch trackers. To address this issue, we propose using a spectral loss function inspired by optimal transportation theory that minimizes the displacement of spectral energy. We validate this approach through an unsupervised autoencoding task that fits a harmonic template to harmonic signals. We jointly estimate the fundamental frequency and amplitudes of harmonics using a lightweight encoder and reconstruct the signals using a differentiable harmonic synthesizer. The proposed approach offers a promising direction for improving unsupervised parameter estimation in neural audio applications.

Original languageEnglish
Pages (from-to)1176-1180
Number of pages5
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
DOIs
Publication statusPublished - 1 Jan 2024
Event2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Korea, Republic of
Duration: 14 Apr 202419 Apr 2024

Keywords

  • differentiable signal processing
  • frequency estimation
  • machine learning
  • optimal transport

Fingerprint

Dive into the research topics of 'Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport'. Together they form a unique fingerprint.

Cite this