DocumentCode
11238
Title
Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing
Author
Taghia, Jalil ; Martin, Rashad
Author_Institution
Inst. ofCommunication Acoust., Ruhr-Univ. Bochum, Bochum, Germany
Volume
22
Issue
1
fYear
2014
fDate
Jan. 2014
Firstpage
6
Lastpage
16
Abstract
We propose a novel method for objective speech intelligibility prediction which can be useful in many application domains such as hearing instruments and forensics. Most objective intelligibility measures available in the literature employ some kind of signal-to-noise ratio (SNR) or a correlation-based comparison between the spectro-temporal representations of clean and processed speech. In this paper, we investigate the speech intelligibility prediction from the viewpoint of information theory and introduce novel objective intelligibility measures based on the estimated mutual information between the temporal envelopes of clean speech and processed speech in the subband domain. Mutual information allows to account for higher order statistics and hence to consider dependencies beyond the conventional second order statistics. Using data from three different listening tests it is shown that the proposed objective intelligibility measures provide promising results for speech intelligibility prediction in different scenarios of speech enhancement where speech is processed by non-linear modification strategies.
Keywords
higher order statistics; speech enhancement; speech intelligibility; SNR; estimated mutual information; hearing instruments; higher order statistics; information theory; listening tests; nonlinear modification strategy; objective intelligibility measures; objective speech intelligibility prediction; second order statistics; signal-to-noise ratio; spectro-temporal representations; speech enhancement processing; subband domain; temporal envelopes; Correlation; Mutual information; Noise measurement; Random variables; Speech; Speech enhancement; Mutual information; objective measures; speech intelligibility prediction;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASL.2013.2281574
Filename
6600965
Link To Document