Title :
A joint acoustic and phonological approach to speech intelligibility assessment
Author :
Nemala, Sridhar Krishna ; Elhilali, Mounya
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MN, USA
Abstract :
While current models of speech intelligibility rely on intricate acoustic analyses of speech attributes, they are limited by the lack of any linguistic information; hence failing to capture natural variability of speech sounds and confining their applicability to average intelligibility assessments. Another important limitation is that the existing models rely on the use of reference clean speech templates (or average profiles). In this work, we propose a novel approach to speech intelligibility by combining a biologically-inspired acoustic analysis of peripheral and cortical processing with phonological statistical models of speech using a hybrid GMM-SVM system. The model results in a novel scheme for speech intelligibility assessment without the use of reference clean speech templates, and the model predictions strongly correlate with scores obtained from human listeners under a variety of realistic listening environments. We further show that the proposed model enables local level tracking of intelligibility and also generalizes well to multiple speech corpora.
Keywords :
hearing; linguistics; speech; speech intelligibility; biologically-inspired acoustic analysis; cortical processing; human listeners; hybrid GMM-SVM system; joint acoustic-phonological approach; linguistic information; local level tracking; multiple speech corpora; peripheral processing; phonological statistical model; reference clean speech templates; speech attributes; speech intelligibility assessment; Biological system modeling; Brain modeling; Filters; Frequency; Humans; Predictive models; Psychoacoustic models; Speech analysis; Speech processing; Tensile stress; Speech intelligibility; hybrid GMM-SVM; psychoacoustic; spectro-temporal; statistical model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495170