Title :
Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages
Author :
Stüker, Sebastian
Author_Institution :
Inst. fur Theor. Inf., Univ. Karlsruhe, Karlsruhe
fDate :
March 31 2008-April 4 2008
Abstract :
Automatic speech recognition (ASR) systems have been developed only for a very limited number of the estimated 7,000 languages in the world. In order to avoid the evolvement of a digital divide between languages for which ASR systems exist and those without one, it is necessary to be able to rapidly create ASR systems for new languages in a cost efficient way. Grapheme based systems, which eliminate the costly need for a pronunciation dictionary, have been shown to work for a variety of languages. They are thus destined for porting ASR systems to new languages. This paper studies the use of multilingual grapheme based models for rapidly bootstrapping acoustic models in new languages. The cross language performance of a standard, multilingual (ML) acoustic model on a new language is improved by introducing a new, modified version of polyphone decision tree specialization that improves the performance of the ML models by up to 15.5% relative.
Keywords :
acoustic signal processing; decision trees; speech recognition; ASR systems; automatic speech recognition; modified polyphone decision tree; multilingual acoustic model; multilingual grapheme models; porting multilingual grapheme; pronunciation dictionary; Acceleration; Automatic speech recognition; Costs; Cultural differences; Decision trees; Dictionaries; Natural languages; Speech recognition; Switches; Vocabulary; Automatic Speech Recognition; Grapheme based acoustic models; Multilingual ASR; Rapid Porting of ASR systems;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518593