Supervised and unsupervised sequence modelling for DRUM transcription

Olivier Gillet, Gäel Richard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We discuss in this paper two post-processings for drum transcription systems, which aim to model typical properties of drum sequences. Both methods operate on a symbolic representation of the sequence, which is obtained by quantizing the onsets of drum strokes on an optimal tatum grid, and by fusing the posterior probabilities produced by the drum transcription system. The first proposed method is a generalization of the N-gram model. We discuss several training and recognition strategies (style-dependent models, local models) in order to maximize the reliability and the specificity of the trained models. Alternatively, we introduce a novel unsupervised algorithm based on a complexity criterion, which finds the most regular and wellstructured sequence compatible with the acoustic scores produced by the transcription system. Both approaches are evaluated on a subset of the ENST-drums corpus, and yield performance improvements.

Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007
Pages219-224
Number of pages6
Publication statusPublished - 1 Dec 2007
Externally publishedYes
Event8th International Conference on Music Information Retrieval, ISMIR 2007 - Vienna, Austria
Duration: 23 Sept 200727 Sept 2007

Publication series

NameProceedings of the 8th International Conference on Music Information Retrieval, ISMIR 2007

Conference

Conference8th International Conference on Music Information Retrieval, ISMIR 2007
Country/TerritoryAustria
CityVienna
Period23/09/0727/09/07

Fingerprint

Dive into the research topics of 'Supervised and unsupervised sequence modelling for DRUM transcription'. Together they form a unique fingerprint.

Cite this