DocumentCode :
1712375
Title :
Analysis of lombard and angry speech using Gaussian Mixture Models and KL divergence
Author :
Mittal, Shubham ; Vyas, Swati ; Prasanna, SRM
Author_Institution :
Electronics and Communication Engineering, Indian Institute of Technology Guwahati, India
fYear :
2013
Firstpage :
1
Lastpage :
5
Abstract :
Recognition of expressions from speech has emerged as an important research area in the recent past. However, the scientific community still faces problems in differentiating between angry and lombard speech. The objective of this work is to analyze the differences between the Lombard and angry speech using the features representing the excitation source of speech production. The instantaneous fundamental frequency, the strength of excitation and loudness measure, reflecting the sharpness of the impulse-like excitation around the epochs are used as excitation source features. The distributions curves of these three parameters are next plotted. We employ the concept of Gaussian Mixture Models (GMMs) and KL divergence (a measure of relative entropy) to calculate an exact measure of difference between angry, lombard and neutral speech with context to the aforementioned parameters and successfully show differences among the Lombard and angry speech signals at the excitation source level.
Keywords :
Frequency measurement; Gaussian mixture model; Production; Resonant frequency; Speech; Speech recognition; Angry; Divergence; GMM; KL; Lombard; Source Features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2013 National Conference on
Conference_Location :
New Delhi, India
Print_ISBN :
978-1-4673-5950-4
Electronic_ISBN :
978-1-4673-5951-1
Type :
conf
DOI :
10.1109/NCC.2013.6487985
Filename :
6487985
Link To Document :
بازگشت