Visual analysis for drum sequence transcription

  • Kevin McGuinness
  • , Olivier Gillet
  • , Noel E. O'Connor
  • , Gaël Richard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A system is presented for analysing drum performance video sequences. A novel ellipse detection algorithm is introduced that automatically locates drum tops. This algorithm fits ellipses to edge clusters, and ranks them according to various fitness criteria. A background/foreground segmentation method is then used to extract the silhouette of the drummer and drum sticks. Coupled with a motion intensity feature, this allows for the detection of 'hits' in each of the extracted regions. In order to obtain a transcription of the performance, each of these regions is automatically labeled with the corresponding instrument class. A partial audio transcription and color cues are used to measure the compatibility between a region and its label, the Kuhn-Munkres algorithm is then employed to find the optimal labeling. Experimental results demonstrate the ability of visual analysis to enhance the performance of an audio drum transcription system.

Original languageEnglish
Title of host publication15th European Signal Processing Conference, EUSIPCO 2007 - Proceedings
Pages312-316
Number of pages5
Publication statusPublished - 1 Dec 2007
Externally publishedYes
Event15th European Signal Processing Conference, EUSIPCO 2007 - Poznan, Poland
Duration: 3 Sept 20077 Sept 2007

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491

Conference

Conference15th European Signal Processing Conference, EUSIPCO 2007
Country/TerritoryPoland
CityPoznan
Period3/09/077/09/07

Fingerprint

Dive into the research topics of 'Visual analysis for drum sequence transcription'. Together they form a unique fingerprint.

Cite this