Skip to main navigation Skip to search Skip to main content

Drum track transcription of polyphonic music using noise subspace projection

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a novel drum transcription system for polyphonic music. The use of a band-wise harmonic/noise decomposition allows the suppression of the deterministic part of the signal, which is mainly contributed by nonrhythmic instruments. The transcription is then performed on the residual noise signal, which contains most of the rhythmic information. This signal is segmented, and the events associated to each onset are classified by support vector machines (SVM) with probabilistic outputs. The features used for classification are directly extracted from the sub-band signals. An additional pre-processing stage in which the instances are reclassified using a localized model was also tested. This transcription method is evaluated on ten test sequences, each of them being performed by two drummers and being available with different mixing settings. The whole system achieves precision and recall rates of 84% for the bass drum and snare drum detection tasks.

Original languageEnglish
Title of host publicationISMIR 2005 - 6th International Conference on Music Information Retrieval
PublisherQueen Mary, University of London
Pages92-99
Number of pages8
ISBN (Print)9780955117909
Publication statusPublished - 1 Jan 2005
Event6th International Conference on Music Information Retrieval, ISMIR 2005 - London, United Kingdom
Duration: 11 Sept 200515 Sept 2005

Publication series

NameISMIR 2005 - 6th International Conference on Music Information Retrieval

Conference

Conference6th International Conference on Music Information Retrieval, ISMIR 2005
Country/TerritoryUnited Kingdom
CityLondon
Period11/09/0515/09/05

Keywords

  • Drum transcription
  • Highresolution methods
  • Rhythm analysis

Fingerprint

Dive into the research topics of 'Drum track transcription of polyphonic music using noise subspace projection'. Together they form a unique fingerprint.

Cite this