مرکز منطقه ای اطلاع رساني علوم و فناوري - Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM

DocumentCode :

3165829

Title :

Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM

Author :

Rasipuram, Ramya ; Doss, Mathew Magimai

Author_Institution :

Idiap Res. Inst., Martigny, Switzerland

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4841

Lastpage :

4844

Abstract :

This paper proposes a novel grapheme-to-phoneme (G2P) conversion approach where first the probabilistic relation between graphemes and phonemes is captured from acoustic data using Kullback-Leibler divergence based hidden Markov model (KL-HMM) system. Then, through a simple decoding framework the information in this probabilistic relation is integrated with the sequence information in the orthographic transcription of the word to infer the phoneme sequence. One of the main application of the proposed G2P approach is in the area of low linguistic resource based automatic speech recognition or text-to-speech systems. We demonstrate this potential through a simulation study where linguistic resources from one domain is used to create linguistic resources for a different domain.

Keywords :

decoding; hidden Markov models; speech recognition; speech synthesis; KL-HMM; Kullback Leibler divergence; acoustic data driven; automatic speech recognition; decoding framework; grapheme-to-phoneme conversion; hidden Markov model; linguistic resources; probabilistic relation; sequence information; text-to-speech systems; Acoustics; Biological system modeling; Context; Context modeling; Dictionaries; Hidden Markov models; Pragmatics; Kullback-Leibler divergence based HMM; Lexicon; grapheme; grapheme-to-phoneme converter; multilayer perceptron; phoneme;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289003

Filename :

6289003

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3165829