Title :
HMM-based Korean speech synthesis system for hand-held devices
Author :
Kim, Sang-Jin ; Kim, Jong-Jin ; Hahn, Minsoo
Author_Institution :
Lab. of Speech & Audio Inf., Inf. & Commun. Univ., Daejeon
Abstract :
Speech interface may be the first choice as a user interface for robots or hand-held devices such as personal digital assistants (PDAs) and portable multimedia players (PMPs). However, those devices have the limitation of the memory space and the computation power. The hidden Markov model (HMM)-based speech synthesis is presently considered to be suitable for the embedded systems. In this paper, our HMM-based Korean speech synthesis system is described. Statistical HMM models for Korean speech units are trained with the hand-labeled speech database including the contextual information about phoneme, word phrase, and multilevel break strength. Mel-cepstrum and line spectrum pair (LSP) are compared for the spectrum modeling, and two-band excitation based on the harmonic plus noise speech model is utilized for the mixed excitation source. The developed small-size Korean synthesis system produced considerably high quality speech with a fairly good prosody
Keywords :
hidden Markov models; natural language processing; speech synthesis; HMM; Korean speech synthesis system; Mel-cepstrum; PDA; contextual information; embedded systems; hand-held devices; hand-labeled speech database; harmonic plus noise speech model; hidden Markov model; line spectrum pair; mixed excitation source; multilevel break strength; personal digital assistants; phoneme; portable multimedia players; spectrum modeling; speech interface; two-band excitation; word phrase; Context modeling; Databases; Embedded system; Hidden Markov models; Orbital robotics; Personal digital assistants; Portable media players; Power system modeling; Speech synthesis; User interfaces;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2006.273160