Title :
Noise robust features for speech/music discrimination in real-time telecommunication
Author :
Fu, Zhong-hua ; Wang, Jhing-Fa ; Xie, Lei
Author_Institution :
Sch. of Comput. Sci., Northwestern Polytech. Univ., Xi´´an, China
fDate :
June 28 2009-July 3 2009
Abstract :
While many efforts have been made in the audio signal classification field, the noise interruption problem is seldom concerned so far, especially in many telecommunication applications, where a real-time and noise robust approach is needed. This paper addresses this problem by proposing two novel robust features: average pitch density (APD) and relative tonal power density (RTPD). APD refers to the differences in tone characteristics of music and speech signals, and RTPD especially focuses on the distinct properties of the percussion instruments. The comparison experiments are implemented on two databases. The first one is reorganized from the corpus collected,. The second one consists of data collected from various recording situations. The novel features are compared with several state-of-the-art features and are found to achieve significant robustness.
Keywords :
music; real-time systems; signal classification; spectral analysis; speech processing; audio signal classification field; average pitch density; music discrimination; noise interruption problem; noise robust features; percussion instruments; real-time telecommunication; relative tonal power density; speech signal; Cepstral analysis; Cepstrum; Hidden Markov models; Instruments; Mel frequency cepstral coefficient; Multiple signal classification; Music; Noise robustness; Pattern classification; Speech enhancement; Audio classification; musical system; real cepstrum; support vector machine;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202561