DocumentCode
3165829
Title
Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM
Author
Rasipuram, Ramya ; Doss, Mathew Magimai
Author_Institution
Idiap Res. Inst., Martigny, Switzerland
fYear
2012
fDate
25-30 March 2012
Firstpage
4841
Lastpage
4844
Abstract
This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.
Keywords
decoding; hidden Markov models; speech recognition; speech synthesis; KL-HMM; Kullback Leibler divergence; acoustic data driven; automatic speech recognition; decoding framework; grapheme-to-phoneme conversion; hidden Markov model; linguistic resources; probabilistic relation; sequence information; text-to-speech systems; Acoustics; Biological system modeling; Context; Context modeling; Dictionaries; Hidden Markov models; Pragmatics; Kullback-Leibler divergence based HMM; Lexicon; grapheme; grapheme-to-phoneme converter; multilayer perceptron; phoneme;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6289003
Filename
6289003
Link To Document