• DocumentCode
    1693003
  • Title

    Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods

  • Author

    Valentini-Botinhao, Cassia ; Godoy, Elizabeth ; Stylianou, Yannis ; Sauert, Bastian ; King, Simon ; Yamagishi, Junichi

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2013
  • Firstpage
    7854
  • Lastpage
    7858
  • Abstract
    In order to improve the intelligibility of HMM-generated Text-to-Speech (TTS) in noise, this work evaluates several speech enhancement methods, exploring combinations of noise-independent and - dependent approaches as well as algorithms previously developed for natural speech. We evaluate one noise-dependent method proposed for TTS, based on the glimpse proportion measure, and three approaches originally proposed for natural speech - one that estimates the noise and is based on the speech intelligibility index, and two noise-independent methods based on different spectral shaping techniques followed by dynamic range compression. We demonstrate how these methods influence the average spectra for different phone classes. We then present results of a listening experiment with speech-shaped noise and a competing speaker. A few methods made the TTS voice even more intelligible than the natural one. Although noise-dependent methods did not improve gains, the intelligibility differences found in distinct noises motivates such dependency.
  • Keywords
    data compression; hidden Markov models; noise; speech coding; speech enhancement; speech intelligibility; text analysis; HMM-generated speech; dynamic range compression; natural speech; noise-dependent methods; noise-independent methods; speech enhancement methods; speech intelligibility index; speech-shaped noise; text-to-speech; Gain; Natural languages; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; HMM-based speech synthesis; Speech intelligibility in noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639193
  • Filename
    6639193