DocumentCode :
506888
Title :
Emotion Recognition and Conversion for Mandarin Speech
Author :
Zhou, Yu ; Zhang, Jianping ; Wang, Ling ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
Volume :
1
fYear :
2009
fDate :
14-16 Aug. 2009
Firstpage :
179
Lastpage :
183
Abstract :
In this study, some research activities on expressive speech recognition and conversion will be introduced. A database consisting of five kinds of speech emotions (i.e. happiness, sadness, surprise, anger and neutral) is used. Not only those traditional features such as mfcc, plp, and pitch are studied, but also a new feature extraction method based on fisher´s F-Ratio is proposed and reported. In our experiments, various combinations of these features, including their high order features are applied using GMM modeling for Mandarin expressive speech recognition. Also we presented some results from emotional speech conversion with a pitch target model.
Keywords :
emotion recognition; natural language processing; speech recognition; Fisher F-Ratio; Mandarin speech; emotion recognition; expressive speech conversion; expressive speech recognition; feature extraction; speech emotions; Acoustics; Application software; Emotion recognition; Feature extraction; Fuzzy systems; Loudspeakers; Man machine systems; Spatial databases; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
Type :
conf
DOI :
10.1109/FSKD.2009.474
Filename :
5358615
Link To Document :
بازگشت