DocumentCode :
3571026
Title :
Floating to Fixed-Point Translation with Its Application to Speech-Based Emotion Recognition
Author :
Kabi, Bibek ; Sahoo, Subhasmita ; Samantaray, Amiya Kumar ; Routray, Aurobinda
Author_Institution :
Adv. Technol. Dev. Centre, Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2014
Firstpage :
21
Lastpage :
26
Abstract :
Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional word lengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.
Keywords :
cepstral analysis; emotion recognition; floating point arithmetic; hidden Markov models; speech recognition; Berlin emotional speech database; EMO-DB; HMM; MFCCs; Mel frequency cepstral coefficients; finite wordlength effect; fixed-point arithmetic; fixed-point speech-based emotion recognition system; fixed-point translation; floating-point arithmetic; hidden Markov model; precision analysis; speaker independent emotion recognition accuracy; speech emotion engine; speech processing; Accuracy; Emotion recognition; Hidden Markov models; Mel frequency cepstral coefficient; Quantization (signal); Speech; Speech recognition; Fixed-point arithmetic; hidden Markov model (HMM); mel-frequency cepstral coeffcients (MFCCs); quantization; range estimation; speech-based emotion recognition; wordlength optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Applications of Information Technology (EAIT), 2014 Fourth International Conference of
Type :
conf
DOI :
10.1109/EAIT.2014.57
Filename :
7052017
Link To Document :
بازگشت