Title :
Composite feature set for mood recognition in dialectal Assamese speech
Author :
Agarwalla, Swapna ; Sarma, Kandarpa Kumar
Author_Institution :
Dept. of Electron. & Commun. Eng., Gauhati Univ., Guwahati, India
Abstract :
Speech is a rich source of information. The speech samples can not only retain what is being spoken but also the emotional state of the speaker. In this paper, the dynamics of the prosodic features and the spectral features have been used to encode the mood content of speakers of Assamese language with dialectal components. A composite feature set has been created by fusing the spectral and the prosodic features. The performance of the system has been evaluated using two classifiers namely Recurrent Neural Network (RNN) and Feed Forward Time Delay Neural Network (FFTDNN). A comparative analysis has been made on their computational speed and recognition rates. The performance of the proposed mood verification system has also been evaluated by varying the background noise conditions.
Keywords :
emotion recognition; feedforward neural nets; natural language processing; recurrent neural nets; sensor fusion; speech coding; speech recognition; Assamese language; FFTDNN classifiers; RNN classifiers; background noise conditions; composite feature set; dialectal Assamese speech; dialectal components; feed forward time delay neural network classifiers; mood recognition; prosodic feature fusion; recurrent neural network classifiers; speaker mood content encoding; spectral feature fusion; speech samples; Artificial neural networks; Emotion recognition; Feature extraction; Mel frequency cepstral coefficient; Mood; Speech; Training; FFTDNN; MFCC; RNN; dialect; prosody; telephonic;
Conference_Titel :
Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-5990-7
DOI :
10.1109/SPIN.2015.7095290