DocumentCode :
3632073
Title :
Reconstruction of dysphonic speech for synthesizing normally phonated speech
Author :
H. Irem Turkmen;M. Elif Karsligil
Author_Institution :
Bilgisayar M?hendisli?i B?l?m?, Y?ld?z Teknik ?niversitesi, Turkey
fYear :
2009
fDate :
4/1/2009 12:00:00 AM
Firstpage :
632
Lastpage :
635
Abstract :
In this study, a novel system, delivering synthetic speech with the quality near to natural, is designed and implemented by reconstructing dysphonic speech of patients that have lost their voice totally due to apoplectic chordae vocalis, organic lesions of vocal cords or partial laryngectomy. In the proposed system, MELP (Mixed Excitation Linear Prediction) is used for synthesizing the normal speech. The unvoiced phonemes are determined in dysphonic speech and synthetic pitch is generated by using pitch-formant frequency relation and then formant distortion modification is applied and voicing is added for the phonemes other than unvoiced phonemes. MELP will be used for synthesizing enhanced speech by using modified acoustic features. Spectral distance measurement is made and subjective listening tests are applied for assessing the produced synthetic speech by our proposed system. Our tests show that, synthetic speech produced this way is preferable when compared to dysphonic speech especially in terms of timbre, recognition and naturality.
Keywords :
"Speech synthesis","Acoustic testing","Lesions","Frequency","Acoustic distortion","Speech enhancement","Distance measurement","System testing","Timbre","Speech recognition"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference, 2009. SIU 2009. IEEE 17th
ISSN :
2165-0608
Print_ISBN :
978-1-4244-4435-9
Type :
conf
DOI :
10.1109/SIU.2009.5136475
Filename :
5136475
Link To Document :
بازگشت