Abstract:
This paper presents a novel drum transcription system for polyphonic music. The use of a band-wise harmonic/noise decomposition allows the suppression of the deterministic part of the signal, which is mainly contributed by nonrhythmic instruments. The transcription is then performed on the residual noise signal, which contains most of the rhythmic information. This signal is segmented, and the events associated to each onset are classified by support vector machines (SVM) with probabilistic outputs. The features used for classification are directly extracted from the sub-band signals. An additional pre-processing stage in which the instances are reclassified using a localized model was also tested. This transcription method is evaluated on ten test sequences, each of them being performed by two drummers and being available with different mixing settings. The whole system achieves precision and recall rates of 84 % for the bass drum and snare drum detection tasks.
Citations
|
4962
|
The Nature of Statistical Learning Theory
– Vapnik
- 1998
|
|
60
|
Sound onset detection by applying psychoacoustic knowledge
– Klapuri
- 1999
|
|
16
|
Exploration of techniques for automatic labeling of audio drum tracks instruments
– Gouyon, Herrera
- 2001
|
|
14
|
Conventional and Periodic N-grams in the Transcription of Drum Sequences
– PAULUS, KLAPURI
- 2003
|
|
12
|
Extraction of drum tracks from polyphonic music using Independent Subspace Analysis
– Uhle, Dittmar, et al.
- 2003
|
|
12
|
Automatic drum sound description for real-world music using template adaptation and matching methods
– Yoshii, Goto, et al.
- 2004
|
|
10
|
Automatic transcription of drum loops
– Gillet, Richard
- 2004
|
|
8
|
Percussion classification in polyphonic audio recordings using localized sound models
– Sandvold, Gouyon, et al.
- 2004
|
|
8
|
Underdetermined source separation with structured source priors
– Vincent, Rodet
- 2004
|
|
7
|
Sound source separation: Azimuth discrimination and resynthesis
– Barry, Lawlor, et al.
- 2004
|
|
7
|
Prior Subspace Analysis for Drum Transcription
– FitzGerald, Coyle, et al.
- 2003
|
|
6
|
EDS parametric modeling and tracking of audio signals
– Badeau, Boyer, et al.
- 2002
|
|
6
|
Further steps towards drum transcription of polyphonic music
– Dittmar, Uhle
- 2004
|
|
3
|
Numerical aggregation operators: State of the art
– Detyniecki
- 2001
|
|
3
|
E.Coyle, “Drum transcription in the presence of pitched instruments using
– FitzGerald, Lawlor, et al.
- 2003
|
|
3
|
Drum loops retrieval from spoken queries
– Gillet, Richard
- 2005
|
|
3
|
Query by beatboxing: Music information retrieval for the dj. ISMIR
– Kapur, Benning, et al.
- 2004
|
|
2
|
Selecting the modeling order for the esprit high resolution method: an alternative approach
– Badeau, David, et al.
- 2005
|
|
2
|
Extraction and remixing of drum tracks from polyphonic music signals
– Gillet, Richard
- 2005
|
|
2
|
Automatic extraction of approximate repetitions in polyphonic midi files based on perceptive criteria
– Meudic, St-James
- 2003
|
|
2
|
A drum pattern retrieval method by voice percussion
– Nakano, Ogata, et al.
- 2004
|