Title :
Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM
Author :
Rasipuram, Ramya ; Doss, Mathew Magimai
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Abstract :
This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.
Keywords :
decoding; hidden Markov models; speech recognition; speech synthesis; KL-HMM; Kullback Leibler divergence; acoustic data driven; automatic speech recognition; decoding framework; grapheme-to-phoneme conversion; hidden Markov model; linguistic resources; probabilistic relation; sequence information; text-to-speech systems; Acoustics; Biological system modeling; Context; Context modeling; Dictionaries; Hidden Markov models; Pragmatics; Kullback-Leibler divergence based HMM; Lexicon; grapheme; grapheme-to-phoneme converter; multilayer perceptron; phoneme;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289003