Title :
Polyphonic music transcription employing max-margin classification of spectrograhic features
Author :
Gang, Ren ; Bocko, Mark F. ; Headlam, Dave ; Lundberg, Justin
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
Abstract :
In this paper we present a transcription method for polyphonic music. The short time Fourier transform is used first to decompose an acoustic signal into sonic partials in a time-frequency representation. In general the segmented partials exhibit distinguishable features if they originate from different ldquovoicesrdquo in the polyphonic mix. We define feature vectors and utilize a max-margin classification algorithm to produce classification labels to serve as grouping cues, i.e., to decide which partials should be assigned to each voice. These classification labels are then used in statistical optimal grouping decisions and confidence levels are assigned to each decision. This classification algorithm shows promising results for the musical source separation.
Keywords :
Fourier transforms; acoustic signal processing; music; signal classification; source separation; spectroscopy; time-frequency analysis; Fourier transform; acoustic signal; confidence levels; max-margin classification; musical source separation; polyphonic music transcription; spectrograhic features; statistical optimal grouping decision; time-frequency representation; Acoustical engineering; Classification algorithms; Feature extraction; Fourier transforms; Instruments; Multiple signal classification; Music; Pattern classification; Signal processing algorithms; Time frequency analysis; classification; feature extraction; polyphonic music transcription; segmentation; short time Fourier transform;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
DOI :
10.1109/ASPAA.2009.5346475