DocumentCode
2857842
Title
Acoustic Feature Optimization for Emotion Affected Speech Recognition
Author
Sun, Yanqing ; Zhou, Yu ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
19-20 Dec. 2009
Firstpage
1
Lastpage
4
Abstract
This paper tries to deal with the problem of performance degradation in emotion affected speech recognition. The F-ratio analysis method in statistics is utilized to analyze the significance of different frequency bands for speech unit classification. The result is then used to optimize filter bank design for Mel-frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) features respectively in emotion affected speech recognition. Under comparable conditions, the modified features get a relative 40.14% decrease for MFCC and 34.93% for PLP in sentence error rate.
Keywords
acoustic signal processing; emotion recognition; feature extraction; optimisation; speech recognition; statistical analysis; F-ratio analysis method; Mel-frequency cepstral coefficients; acoustic feature optimization; emotion affected speech recognition; filter bank design; perceptual linear prediction; performance degradation; speech unit classification; Acoustics; Algorithm design and analysis; Cepstral analysis; Emotion recognition; Filter bank; Mel frequency cepstral coefficient; Pattern recognition; Speech analysis; Speech recognition; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-4994-1
Type
conf
DOI
10.1109/ICIECS.2009.5365821
Filename
5365821
Link To Document