Title :
Improvement of Esophageal Speech using LPC and LF Model
Author :
Sirichokswad, R. ; Charoensuk, Warakorn ; Boonpramuk, Panuthat ; Kasemkosin, N. ; Szu, Harold H.
Author_Institution :
Mahidol Univ., Bangkok
Abstract :
Esophageal speech is a restoration of speech communication in laryngectomized patient. Due to irregular pharyngoesophageal (PE) segment vibration and aerodynamic limitation, the esophageal phonation provides higher volatility of the fundamental frequency (f 0) compared with laryngeal phonation. It is difficult to determine an accurate f 0 in esophageal speech. This paper focuses on algorithms for f 0 modification. Linear predictive coding (LPC) and autocorrelation function are used to calculate the f 0 They are well performed in the normal case. However, the determination results of f 0 in esophageal speech are highly unstable without any modification of a conventional LPC technique. By proposing a smoothing technique, an accurate f 0 and pitch period in esophageal speech can be determined and used in LF model. Experimental results from 18 subjects suggest that average f 0 of esophageal phonation is lower than laryngeal phonation. The speech synthesized using this proposing technique produced better sound quality than un-processed esophageal speech.
Keywords :
linear predictive coding; medical computing; smoothing methods; speech; speech processing; LF model; LPC model; autocorrelation function; esophageal phonation; esophageal speech fundamental frequency; esophageal speech improvement; esophageal speech pitch period; fundamental frequency modification; laryngeal phonation; laryngectomized patient; linear predictive coding; pharyngoesophageal segment vibration; smoothing technique; speech communication restoration;
Conference_Titel :
Biomedical and Pharmaceutical Engineering, 2006. ICBPE 2006. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-981-05-79
Electronic_ISBN :
81-904262-1-4