Title :
Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise
Author :
Valentini-Botinhao, Cassia ; Yamagishi, Junichi ; King, Simon
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
Abstract :
In this paper we evaluate four objective measures of speech with regards to intelligibility prediction of synthesized speech in diverse noisy situations. We evaluated three intelligibility measures, the Dau measure, the glimpse proportion and the Speech Intelligibility Index (SII) and a quality measure, the Perceptual Evaluation of Speech Quality (PESQ). For the generation of synthesized speech we used a state of the art HMM-based speech synthesis system. The noisy conditions comprised four additive noises. The measures were compared with subjective intelligibility scores obtained in listening tests. The results show the Dau and the glimpse measures to be the best predictors of intelligibility, with correlations of around 0.83 to subjective scores. All measures gave less accurate predictions of intelligibility for synthetic speech than have previously been found for natural speech; in particular the SII measure. In additional experiments, we processed the synthesized speech by an ideal binary mask before adding noise. The Glimpse measure gave the most accurate intelligibility predictions in this situation.
Keywords :
hidden Markov models; speech intelligibility; speech synthesis; HMM-based speech synthesis system; HMM-based synthetic speech; PESQ; SII ideal binary mask; additive noises; intelligibility prediction; Correlation; Hidden Markov models; Noise; Noise measurement; Root mean square; Speech; Speech synthesis; HMM-based speech synthesis; objective measures for speech intelligibility;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947507