Title :
Reliable onset detection scheme for singing voices based on enhanced difference filtering and combined features
Author :
Lin, Bor-Shen ; Huang, Hsin-Jung
Author_Institution :
Dept. of Inf. Manage., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
Abstract :
Onset detection for singing voices is an important but difficult problem for note segmentation in music search or music transcription. It is primarily because the spectral envelops as well as the shapes of vocal tracks vary frequently so as to produce fake peaks in the detection function that degrade the precision rate for onset detection. This paper proposes an onset detection scheme which utilizes the enhanced difference filter to improve the detection function by accentuating the peaks at the uprising margins of energy, and the adjustment approach for onset locations to compensate the peak shifts. A binary classifier based on GMM is further used to combine relevant features of adjacent peaks so as to make more reliable final decisions. The proposed scheme can improve the detection performance significantly, and achieve 78.9% of precision rate and 76% of recall rate at 77.5% of F-measure on the QBSH database.
Keywords :
fast Fourier transforms; information retrieval; signal detection; binary classifier; enhanced difference filtering; music information retrieval; note segmentation; reliable onset detection scheme; singing voices; Decision making; Degradation; Information filtering; Information filters; Information management; Instruments; Multiple signal classification; Music information retrieval; Shape; Spatial databases; music information retrieval; note segmentation; onset detection; query by humming; query by singing;
Conference_Titel :
Wireless Communications & Signal Processing, 2009. WCSP 2009. International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4856-2
Electronic_ISBN :
978-1-4244-5668-0
DOI :
10.1109/WCSP.2009.5371537