• DocumentCode
    2801030
  • Title

    Statistical approach to enhancing esophageal speech based on Gaussian mixture models

  • Author

    Doi, Hironori ; Nakamura, Keigo ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4250
  • Lastpage
    4253
  • Abstract
    This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectomees. Although it doesn´t require any external devices, generated voices sound unnatural. To improve the intelligibility and naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech. A spectral parameter and excitation parameters of target normal speech are separately estimated from a spectral parameter of the esophageal speech based on Gaussian mixture models. The experimental results demonstrate that the proposed method yields significant improvements in intelligibility and naturalness. We also apply one-to-many eigenvoice conversion to esophageal speech enhancement for flexibly controlling enhanced voice quality.
  • Keywords
    Gaussian distribution; speech; speech enhancement; Gaussian mixture models; alternative speaking methods; enhanced voice quality; esophageal speech enhancement; excitation parameters; laryngectomees; normal speech; one-to-many eigenvoice conversion; spectral parameter; statistical voice conversion; Acoustic noise; Degradation; Esophagus; Information science; Loudspeakers; Noise generators; Speech analysis; Speech enhancement; Speech processing; Virtual colonoscopy; eigenvoice conversion; esophageal speech; laryngectomees; speech enhancement; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495676
  • Filename
    5495676