DocumentCode
2715420
Title
Spectral Enhancement of Whispered Speech Based on Probability Mass Function
Author
Sharifzadeh, Hamid Reza ; McLoughlin, Ian Vince ; Ahmadi, Farzaneh
Author_Institution
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2010
fDate
9-15 May 2010
Firstpage
207
Lastpage
211
Abstract
Whispered speech can be effectively used for quiet and private communications over mobile phones and is also the communication means for ENT patients under a regime of voice rest. The reconstruction of natural sounding speech from such whispers can be useful for several types of application across different scientific fields ranging from communications to biomedical engineering. Despite the useful applications for a such technology, the reconstruction of natural speech from whispers has received relatively little research effort to date. This paper presents novel methods for spectral enhancement and formant smoothing with the aim of attaining more natural sounding speech within the reconstruction process. The proposed approach uses a probability mass-density function to identify a reliable formant trajectory through whispers and apply vocal modifications accordingly. Subjective evaluation experiments were performed, and are reported, to assess the performance of the techniques. A method for the near real-time conversion of whispers to normal phonated speech through a modified CELP codec has been discussed in our previously published work which, the proposed formant modification approach in this paper builds upon.
Keywords
speech enhancement; ENT patients; biomedical engineering; formant smoothing; mobile phones; natural sounding speech; private communications; probability mass-density function; quiet communications; spectral enhancement; whispered speech; Frequency estimation; Mobile communication; Mobile handsets; Natural languages; Smoothing methods; Speech codecs; Speech coding; Speech enhancement; Speech processing; Working environment noise; CELP codec; formant trajectory; linear predictive coding; spectral enhancement; whispered speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications (AICT), 2010 Sixth Advanced International Conference on
Conference_Location
Barcelona
Print_ISBN
978-1-4244-6748-8
Type
conf
DOI
10.1109/AICT.2010.47
Filename
5489846
Link To Document