Using Random Codebooks for Audio Neural AutoEncoders

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Latent representation learning has been an active field of study for decades in numerous applications. Inspired among others by the tokenization from Natural Language Processing and motivated by the research of a simple data representation, recent works have introduced a quantization step into the feature extraction. In this work, we propose a novel strategy to build the neural discrete representation by means of random codebooks. These codebooks are obtained by randomly sampling a large, predefined fixed codebook. We experimentally show the merits and potential of our approach in a task of audio compression and reconstruction.

Original languageEnglish
Title of host publication32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings
PublisherEuropean Signal Processing Conference, EUSIPCO
Pages311-315
Number of pages5
ISBN (Electronic)9789464593617
DOIs
Publication statusPublished - 1 Jan 2024
Event32nd European Signal Processing Conference, EUSIPCO 2024 - Lyon, France
Duration: 26 Aug 202430 Aug 2024

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491

Conference

Conference32nd European Signal Processing Conference, EUSIPCO 2024
Country/TerritoryFrance
CityLyon
Period26/08/2430/08/24

Keywords

  • audio reconstruction
  • feature extraction
  • quantization
  • random codebooks

Fingerprint

Dive into the research topics of 'Using Random Codebooks for Audio Neural AutoEncoders'. Together they form a unique fingerprint.

Cite this