DocumentCode
699196
Title
Several features for discrimination between vocal sounds and other environmental sounds
Author
Shi Yuan-Yuan ; Wen Xue ; She Bin
Author_Institution
HCI Lab., Samsung Adv. Inst. of Technol., Yongin, South Korea
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
2099
Lastpage
2102
Abstract
Several features are found to discriminate between the vocals sounds and other environmental sounds. The vocal sounds include speaking, laughter, etc., 23 kinds of nonverbal and verbal sounds; and the environmental sounds are recorded in domestic environments. The discriminative features are selected from 22 kinds of features. They are the speech recognition features of LPCC and MFCC, time-spectral features from FFT, statistics of pitch values and contour, ratio of voiced and unvoiced segments, and spectrum of pitch contour. The 9 features calculated from pitch contours perform much better than the features calculated from spectrums, which show no discriminability. The classification is performed simply by a neural network to evaluate the performance of the 9 features. They are tested on a 21 CDs environmental sound database. And the hit rate of 9 8.7 3% with the false alarm rate of 11 % are obtained. The classification result confirms the effectiveness and efficiency of the features.
Keywords
fast Fourier transforms; feature extraction; neural nets; speech recognition; FFT; LPCC; MFCC; environmental sounds; neural network; pitch contours; speech recognition features; time-spectral features; vocal sounds; Abstracts; Artificial neural networks; Inductors; Manganese; Mel frequency cepstral coefficient; Pediatrics;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079726
Link To Document