Title :
Letter-to-sound conversion using coupled Hidden Markov Models for lexicon compression
Author :
Hao Che ; Jianhua Tao ; Shifeng Pan
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
Letter-to-Sound (LTS) conversion, which is used to compress the lexicon for embedded application purpose, has become an important part in Text-to-Speech (TTS) system. In this paper, coupled Hidden Markov Models (CHMM) for LTS conversion is proposed. In the phase of preprocessing, many-to-many alignment is adopted for lexicon alignment instead of one-to-one alignment which is commonly used in previous approaches. Two Hidden Markov Models (HMM) which are respectively designed to predict the best phonemic string and corresponding graphemic substring segmentation are coupled in the phase of phonemes generation. The best phonemic string as the global optimal solution is given by maximizing the joint likelihood. Both combined and separated phone/stress prediction are concerned in stress assignment. The experimental result shows the performance of our approach is better than other previous approaches.
Keywords :
hidden Markov models; maximum likelihood estimation; speech synthesis; CHMM; LTS conversion; TTS system; coupled hidden Markov models; embedded application; global optimal solution; graphemic substring segmentation; joint likelihood maximisation; letter-to-sound conversion; lexicon alignment; lexicon compression; many-to-many alignment; one-to-one alignment; phone-stress prediction; phoneme generation; phonemic string; stress assignment; text-to-speech system; Accuracy; Context; Hidden Markov models; Joints; Performance evaluation; Predictive models; Stress; Letter-to-sound conversion; coupled Hidden Markov Models; lexicon compression; many-to-many alignment; stress assignment;
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
DOI :
10.1109/ICSDA.2012.6422464