DocumentCode :
3071912
Title :
Application of neural networks in emotional speech recognition
Author :
Bojanic, Milana ; Crnojevic, Vladimir ; Delic, Vlado
Author_Institution :
Fac. of Tech. Sci., Univ. of Novi Sad, Novi Sad, Serbia
fYear :
2012
fDate :
20-22 Sept. 2012
Firstpage :
223
Lastpage :
226
Abstract :
Emotional speech recognition (ESR) from the aspect of human-machine interaction (HCI) is a prerequisite for the framework of interacting partners within the HCI. This paper addresses the application of neural network (NN) in ESR. The performance of NN is tested using three different feature sets which are basis for ESR: prosodic features, spectral features and a set of their combination. The results of these feature sets are compared using several network topologies and two training algorithms. It has been shown that using joint prosodic-spectral feature set as input to three layer feed-forward NN trained with back-propagation algorithm has the best performance in 5-class emotional speech recognition task.
Keywords :
backpropagation; emotion recognition; human computer interaction; neural nets; speech recognition; ESR; HCI; NN; backpropagation algorithm; emotional speech recognition; human-machine interaction; network topologies; neural network application; prosodic features; spectral features; Accuracy; Emotion recognition; Feature extraction; Network topology; Neurons; Speech; Speech recognition; emotional speech recognition; neural network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Network Applications in Electrical Engineering (NEUREL), 2012 11th Symposium on
Conference_Location :
Belgrade
Print_ISBN :
978-1-4673-1569-2
Type :
conf
DOI :
10.1109/NEUREL.2012.6420016
Filename :
6420016
Link To Document :
بازگشت