Title :
Emotion Recognition and Conversion for Mandarin Speech
Author :
Zhou, Yu ; Zhang, Jianping ; Wang, Ling ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
Abstract :
In this study, some research activities on expressive speech recognition and conversion will be introduced. A database consisting of five kinds of speech emotions (i.e. happiness, sadness, surprise, anger and neutral) is used. Not only those traditional features such as mfcc, plp, and pitch are studied, but also a new feature extraction method based on fisher´s F-Ratio is proposed and reported. In our experiments, various combinations of these features, including their high order features are applied using GMM modeling for Mandarin expressive speech recognition. Also we presented some results from emotional speech conversion with a pitch target model.
Keywords :
emotion recognition; natural language processing; speech recognition; Fisher F-Ratio; Mandarin speech; emotion recognition; expressive speech conversion; expressive speech recognition; feature extraction; speech emotions; Acoustics; Application software; Emotion recognition; Feature extraction; Fuzzy systems; Loudspeakers; Man machine systems; Spatial databases; Speech analysis; Speech recognition;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.474