Title :
Speaker height estimation combining GMM and linear regression subsystems
Author :
Williams, K.A. ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
There are both scientific and technology based motivations for establishing effective speech processing algorithms that estimate speaker traits. Estimating speaker height can assist in voice forensic analysis [1], as well as provide additional side knowledge to improve speaker ID systems, or acoustic model selection for improved speech recognition. In this study, two distinct approaches for height estimation are explored. The first approach is statistical based and incorporates acoustic models within a GMM structure, while the second is a direct speech analysis approach that employs linear regression to obtain the height directly. The accuracy and trade-offs of these systems are explored as well a fusion of the two systems using data from the TIMIT corpus (which includes ground truth on speaker height).
Keywords :
Gaussian processes; acoustic signal processing; forensic science; regression analysis; speaker recognition; GMM structure; TIMIT corpus; acoustic model selection; direct speech analysis; improved speech recognition; linear regression subsystems; speaker ID system improvement; speaker height estimation; speaker trait estimation; speech processing algorithm; statistical approach; voice forensic analysis; Accuracy; Equations; Estimation; Mathematical model; Speech; Speech processing; Training; GMM; formants; height estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639131