Music retiler: Using NMF2D source separation for audio mosaicing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Musaicing (music mosaicing) aims at reconstructing a target music track by superimposing audio samples selected from a collection. This selection is based on their acoustic similarity to the target. The baseline technique to perform this is concatenative synthesis in which the superposition only occurs in time. Non-Negative Matrix Factorization has also been proposed for this task. In this, a target spectrogram is factorized into an activation matrix and a predefined basis matrix which represents the sample collection. The superposition therefore occurs in time and frequency. However, in both methods the samples used for the reconstruction represent isolated sources (such as bees) and remain unchanged during the musaicing (samples need to be pre-pitch-shifted). This reduces the applicability of these methods. We propose here a variation of the musaicing in which the samples used for the reconstruction are obtained by applying a NMF2D separation algorithm to a music collection (such as a collection of Reggae tracks). Using these separated samples, a second NMF2D algorithm is then used to automatically find the best transposition factors to represent the target. We performed an online perceptual experiment of our method which shows that it outperforms the NMF algorithm when the sources are polyphonic and multi-source.

Original languageEnglish
Title of host publicationAudio Mostly - A Conference on Interaction with Sound - 2018 Sound in Immersion and Emotion, AM 2018 - Conference Proceedings
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450366090
DOIs
Publication statusPublished - 12 Sept 2018
Event2018 International Audio Mostly Conference - A Conference on Interaction with Sound: Sound in Immersion and Emotion, AM 2018 - Wrexham, United Kingdom
Duration: 12 Sept 201814 Sept 2018

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2018 International Audio Mostly Conference - A Conference on Interaction with Sound: Sound in Immersion and Emotion, AM 2018
Country/TerritoryUnited Kingdom
CityWrexham
Period12/09/1814/09/18

Keywords

  • Mosaicing
  • Music information retrieval (MIR)
  • NMF2D
  • Sound synthesis

Fingerprint

Dive into the research topics of 'Music retiler: Using NMF2D source separation for audio mosaicing'. Together they form a unique fingerprint.

Cite this