DocumentCode :
1689953
Title :
Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic
Author :
Rasipuram, Ramya ; Bell, P. ; Magimai-Doss, Mathew
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2013
Firstpage :
7334
Lastpage :
7338
Abstract :
Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the primary resource required to build a good ASR system is a well developed phoneme pronunciation lexicon. However, under-resourced languages typically lack such lexical resources. In this paper, we investigate recently proposed grapheme-based ASR in the framework of Kullback-Leibler divergence based hidden Markov model (KL-HMM) for underresourced languages, particularly Scottish Gaelic which has no lexical resources. More specifically, we study the use of grapheme and multilingual phoneme class conditional probabilities (posterior features) as feature observations in KL-HMM. ASR studies conducted show that the proposed approach yields better system compared to the conventional HMM/GMM approach using cepstral features. Furthermore, grapheme posterior features estimated using both auxiliary data and Gaelic data yield the best system.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; ASR systems; GMM approach; Gaelic data; HMM approach; KL-HMM; Kullback-Leibler divergence based hidden Markov model; Scottish Gaelic; auxiliary data; cepstral features; feature observations; grapheme posterior features; multilingual phoneme class conditional probabilities; multilingual posterior features; phoneme pronunciation lexicon; standard automatic speech recognition systems; subword units; under-resourced languages; under-resourced speech recognition; Cepstral analysis; Context; Context modeling; Hidden Markov models; Speech; Speech recognition; Automatic speech recognition; Kullback-Leibler divergence based hidden Markovmodel; Scottish Gaelic; grapheme; phoneme; posterior feature; under-resourced speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639087
Filename :
6639087
Link To Document :
بازگشت