DocumentCode
437348
Title
A preliminary speech analysis for recognizing emotion
Author
Razak, Aishah Abd ; Abidin, Mohd Izani Zainal ; Komiya, Ryoichi
Author_Institution
Fac. of Inf. Technol., Multimedia Univ., Malaysia
fYear
2003
fDate
25-26 Aug. 2003
Firstpage
49
Lastpage
54
Abstract
Some speech analysis to extract emotion from voice is discussed. An emotional Malay and English voice database has been developed, consisting six basic emotions namely happiness, sadness, disgust, fear, anger and surprise. As the target is content independent emotion recognition, 4 short sentences that have the most natural meaning is adopted for the illustration and analysis. A study on speech prosody is done to identify the emotional features of voice. Variation on the sample´s energy, duration, and pitch for different emotions is compared. Spectrogram analysis is done on some samples to observe the effect of formant. It is found that duration, average energy and pitch can provide some indication of emotional content of a speech, but it is not enough to correctly represent the emotions. Even though there are slightly different pattern for English and Malay samples, it is still reasonable to assume that there are standard acoustic configurations in expressing particular emotions.
Keywords
emotion recognition; speech processing; speech recognition; English voice database; Malay voice database; average energy; emotion extraction; independent emotion recognition; pitch contour; spectrogram analysis; speech analysis; speech prosody; standard acoustic configurations; Automatic speech recognition; Emotion recognition; Humans; Information technology; Man machine systems; Mobile handsets; Speech analysis; Speech processing; Speech recognition; Stress;
fLanguage
English
Publisher
ieee
Conference_Titel
Research and Development, 2003. SCORED 2003. Proceedings. Student Conference on
Print_ISBN
0-7803-8173-4
Type
conf
DOI
10.1109/SCORED.2003.1459662
Filename
1459662
Link To Document