Title :
Objective Quality and Intelligibility Prediction for Users of Assistive Listening Devices: Advantages and limitations of existing tools
Author :
Falk, Tiago H. ; Parsa, Vijay ; Santos, Joao F. ; Arehart, Kathryn ; Hazrati, Oldooz ; Huber, Rainer ; Kates, James M. ; Scollie, Susan
Author_Institution :
Energie Mater. Telecommun. Res. Centre, Inst. Nat. de la Rech. Sci., Montreal, QC, Canada
Abstract :
This article presents an overview of 12 existing objective speech quality and intelligibility prediction tools. Two classes of algorithms are presented?intrusive and nonintrusive?with the former requiring the use of a reference signal, while the latter does not. Investigated metrics include both those developed for normal hearing (NH) listeners, as well as those tailored particularly for hearing impaired (HI) listeners who are users of assistive listening devices [i.e., hearing aids (HAs) and cochlear implants (CIs)]. Representative examples of those optimized for HI listeners include the speech-to-reverberation modulation energy ratio (SRMR), tailored to HAs (SRMR-HA) and to CIs (SRMR-CI); the modulation spectrum area (ModA); the HA speech quality (HASQI) and perception indices (HASPI); and the perception-model-based quality prediction method for hearing impairments (PEMO-Q-HI). The objective metrics are tested on three subjectively rated speech data sets covering reverberation-alone, noise-alone, and reverberation-plus-noise degradation conditions, as well as degradations resultant from nonlinear frequency compression and different speech enhancement strategies. The advantages and limitations of each measure are highlighted and recommendations are given for suggested uses of the different tools under specific environmental and processing conditions.
Keywords :
hearing aids; speech enhancement; speech intelligibility; HASQI; ModA; PEMO-Q-HI; SRMR-CI; SRMR-HA; assistive listening devices; cochlear implants; hearing aids; hearing impaired listeners; hearing impairments; modulation spectrum area; noise-alone condition; nonlinear frequency compression; normal hearing listeners; objective quality; perception indices; perception-model-based quality prediction method; reverberation-alone condition; reverberation-plus-noise degradation conditions; speech enhancement strategies; speech intelligibility prediction; speech quality prediction; speech-to-reverberation modulation energy ratio; Acoustic signal processing; Assistive devices; Auditory system; Filter banks; Modulation; Quality of service; Speech processing;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2014.2358871