Sequence representation of music structure using higher-order similarity matrix and maximum-likelihood approach

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present a novel method for the automatic estimation of the structure of music tracks using a sequence representation. A set of timbre-related (MFCC and Spectral Contrast) and pitch-related (Pitch Class Profile) features are first extracted from the signal leading to three similarity matrices which are then combined. We then introduce the use of higher-order (2nd and 3rd order) similarity matrices in order to reinforce the diagonals corresponding to common repetitions and reduce the background noise. Segments are then detected and a maximum-likelihood approach is proposed in order to derive simultaneously the underlying sequence representation of the music track and the most representative segment of each sequence. The proposed method is evaluated positively on the MPEG-7 "melody repetition" test set.

Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007
Pages35-40
Number of pages6
Publication statusPublished - 1 Dec 2007
Externally publishedYes
Event8th International Conference on Music Information Retrieval, ISMIR 2007 - Vienna, Austria
Duration: 23 Sept 200727 Sept 2007

Publication series

NameProceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007

Conference

Conference8th International Conference on Music Information Retrieval, ISMIR 2007
Country/TerritoryAustria
CityVienna
Period23/09/0727/09/07

Fingerprint

Dive into the research topics of 'Sequence representation of music structure using higher-order similarity matrix and maximum-likelihood approach'. Together they form a unique fingerprint.

Cite this