Title :
Comparing one and two-stage acoustic modeling in the recognition of emotion in speech
Author :
Schuller, Björn ; Vlasenko, Bogdan ; Minguez, Ricardo ; Rigoll, Gerhard ; Wendemuth, Andreas
Author_Institution :
Tech. Univ. Munchen, Munich
Abstract :
In the search for a standard unit for use in recognition of emotion in speech, a whole turn, that is the full section of speech by one person in a conversation, is common. Within applications such turns often seem favorable. Yet, high effectiveness of sub-turn entities is known. In this respect a two-stage approach is investigated to provide higher temporal resolution by chunking of speech-turns according to acoustic properties, and multi-instance learning for turn-mapping after individual chunk analysis. For chunking fast pre-segmentation into emotionally quasi-stationary segments by one-pass Viterbi beam search with token passing basing on MFCC is used. Chunk analysis is realized by brute-force large feature space construction with subsequent subset selection, SVM classification, and speaker normalization. Extensive tests reveal differences compared to one-stage processing. Alternatively, syllables are used for chunking.
Keywords :
emotion recognition; pattern classification; speech recognition; support vector machines; SVM classification; brute-force large feature space construction; emotion recognition; multiinstance learning; one-pass Viterbi beam search; speaker normalization; speech recognition; subsequent subset selection; token passing; two-stage acoustic modeling; Acoustic beams; Emotion recognition; Loudspeakers; Mel frequency cepstral coefficient; Speech analysis; Speech recognition; Structural beams; Support vector machine classification; Support vector machines; Viterbi algorithm; Affective Computing; Emotion Recognition; Segmentation;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430180