• DocumentCode
    2715420
  • Title

    Spectral Enhancement of Whispered Speech Based on Probability Mass Function

  • Author

    Sharifzadeh, Hamid Reza ; McLoughlin, Ian Vince ; Ahmadi, Farzaneh

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2010
  • fDate
    9-15 May 2010
  • Firstpage
    207
  • Lastpage
    211
  • Abstract
    Whispered speech can be effectively used for quiet and private communications over mobile phones and is also the communication means for ENT patients under a regime of voice rest. The reconstruction of natural sounding speech from such whispers can be useful for several types of application across different scientific fields ranging from communications to biomedical engineering. Despite the useful applications for a such technology, the reconstruction of natural speech from whispers has received relatively little research effort to date. This paper presents novel methods for spectral enhancement and formant smoothing with the aim of attaining more natural sounding speech within the reconstruction process. The proposed approach uses a probability mass-density function to identify a reliable formant trajectory through whispers and apply vocal modifications accordingly. Subjective evaluation experiments were performed, and are reported, to assess the performance of the techniques. A method for the near real-time conversion of whispers to normal phonated speech through a modified CELP codec has been discussed in our previously published work which, the proposed formant modification approach in this paper builds upon.
  • Keywords
    speech enhancement; ENT patients; biomedical engineering; formant smoothing; mobile phones; natural sounding speech; private communications; probability mass-density function; quiet communications; spectral enhancement; whispered speech; Frequency estimation; Mobile communication; Mobile handsets; Natural languages; Smoothing methods; Speech codecs; Speech coding; Speech enhancement; Speech processing; Working environment noise; CELP codec; formant trajectory; linear predictive coding; spectral enhancement; whispered speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (AICT), 2010 Sixth Advanced International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4244-6748-8
  • Type

    conf

  • DOI
    10.1109/AICT.2010.47
  • Filename
    5489846