DocumentCode :
3165829
Title :
Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM
Author :
Rasipuram, Ramya ; Doss, Mathew Magimai
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4841
Lastpage :
4844
Abstract :
This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.
Keywords :
decoding; hidden Markov models; speech recognition; speech synthesis; KL-HMM; Kullback Leibler divergence; acoustic data driven; automatic speech recognition; decoding framework; grapheme-to-phoneme conversion; hidden Markov model; linguistic resources; probabilistic relation; sequence information; text-to-speech systems; Acoustics; Biological system modeling; Context; Context modeling; Dictionaries; Hidden Markov models; Pragmatics; Kullback-Leibler divergence based HMM; Lexicon; grapheme; grapheme-to-phoneme converter; multilayer perceptron; phoneme;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289003
Filename :
6289003
Link To Document :
بازگشت