DocumentCode :
699196
Title :
Several features for discrimination between vocal sounds and other environmental sounds
Author :
Shi Yuan-Yuan ; Wen Xue ; She Bin
Author_Institution :
HCI Lab., Samsung Adv. Inst. of Technol., Yongin, South Korea
fYear :
2004
fDate :
6-10 Sept. 2004
Firstpage :
2099
Lastpage :
2102
Abstract :
Several features are found to discriminate between the vocals sounds and other environmental sounds. The vocal sounds include speaking, laughter, etc., 23 kinds of nonverbal and verbal sounds; and the environmental sounds are recorded in domestic environments. The discriminative features are selected from 22 kinds of features. They are the speech recognition features of LPCC and MFCC, time-spectral features from FFT, statistics of pitch values and contour, ratio of voiced and unvoiced segments, and spectrum of pitch contour. The 9 features calculated from pitch contours perform much better than the features calculated from spectrums, which show no discriminability. The classification is performed simply by a neural network to evaluate the performance of the 9 features. They are tested on a 21 CDs environmental sound database. And the hit rate of 9 8.7 3% with the false alarm rate of 11 % are obtained. The classification result confirms the effectiveness and efficiency of the features.
Keywords :
fast Fourier transforms; feature extraction; neural nets; speech recognition; FFT; LPCC; MFCC; environmental sounds; neural network; pitch contours; speech recognition features; time-spectral features; vocal sounds; Abstracts; Artificial neural networks; Inductors; Manganese; Mel frequency cepstral coefficient; Pediatrics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7
Type :
conf
Filename :
7079726
Link To Document :
بازگشت