Title :
Sinusoidal model-based analysis and classification of stressed speech
Author :
Ramamohan, S. ; Dandapat, S.
Author_Institution :
Intel Technol. India Pvt. Ltd., Bangalore, India
fDate :
5/1/2006 12:00:00 AM
Abstract :
In this paper, a sinusoidal model has been proposed for characterization and classification of different stress classes (emotions) in a speech signal. Frequency, amplitude and phase features of the sinusoidal model are analyzed and used as input features to a stressed speech recognition system. The performances of sinusoidal model features are evaluated for recognition of different stress classes with a vector-quantization classifier and a hidden Markov model classifier. To find the effectiveness of these features for recognition of different emotions in different languages, speech signals are recorded and tested in two languages, Telugu (an Indian language) and English. Average stressed speech index values are proposed for comparing differences between stress classes in a speech signal. Results show that sinusoidal model features are successful in characterizing different stress classes in a speech signal. Sinusoidal features perform better compared to the linear prediction and cepstral features in recognizing the emotions in a speech signal.
Keywords :
hidden Markov models; speech coding; speech recognition; vector quantisation; amplitude features; hidden Markov model classifier; phase features; sinusoidal model-based analysis; stressed speech classification; stressed speech recognition system; vector-quantization classifier; Cepstral analysis; Emotion recognition; Frequency; Hidden Markov models; Natural languages; Performance evaluation; Speech analysis; Speech recognition; Stress; Testing; Emotion characterization; sinusoidal model; speech recognition; stressed speech; stressed speech index;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.858071