مرکز منطقه ای اطلاع رساني علوم و فناوري - Reconstruction of dysphonic speech for synthesizing normally phonated speech

DocumentCode :

3632073

Title :

Reconstruction of dysphonic speech for synthesizing normally phonated speech

Author :

H. Irem Turkmen;M. Elif Karsligil

Author_Institution :

Bilgisayar M?hendisli?i B?l?m?, Y?ld?z Teknik ?niversitesi, Turkey

fYear :

2009

fDate :

4/1/2009 12:00:00 AM

Firstpage :

632

Lastpage :

635

Abstract :

In this study, a novel system, delivering synthetic speech with the quality near to natural, is designed and implemented by reconstructing dysphonic speech of patients that have lost their voice totally due to apoplectic chordae vocalis, organic lesions of vocal cords or partial laryngectomy. In the proposed system, MELP (Mixed Excitation Linear Prediction) is used for synthesizing the normal speech. The unvoiced phonemes are determined in dysphonic speech and synthetic pitch is generated by using pitch-formant frequency relation and then formant distortion modification is applied and voicing is added for the phonemes other than unvoiced phonemes. MELP will be used for synthesizing enhanced speech by using modified acoustic features. Spectral distance measurement is made and subjective listening tests are applied for assessing the produced synthetic speech by our proposed system. Our tests show that, synthetic speech produced this way is preferable when compared to dysphonic speech especially in terms of timbre, recognition and naturality.

Keywords :

"Speech synthesis","Acoustic testing","Lesions","Frequency","Acoustic distortion","Speech enhancement","Distance measurement","System testing","Timbre","Speech recognition"

Publisher :

ieee

Conference_Titel :

Signal Processing and Communications Applications Conference, 2009. SIU 2009. IEEE 17th

ISSN :

2165-0608

Print_ISBN :

978-1-4244-4435-9

Type :

conf

DOI :

10.1109/SIU.2009.5136475

Filename :

5136475

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3632073