DocumentCode :
1849171
Title :
ASR domain adaptation methods for low-resourced languages: Application to Romanian language
Author :
Cucu, Horia ; Besacier, Laurent ; Burileanu, Corneliu ; Buzo, Andi
Author_Institution :
Univ. Politeh. of Bucharest, Bucharest, Romania
fYear :
2012
fDate :
27-31 Aug. 2012
Firstpage :
1648
Lastpage :
1652
Abstract :
This study investigates the possibility of using statistical machine translation to create domain-specific language resources. We propose a methodology that aims to create a domain-specific automatic speech recognition system for a low-resourced language when in-domain text corpora are available only in a high-resourced language. We evaluate a new semi-supervised method and compare it with previously developed semi-supervised and unsupervised approaches. Moreover, in the effort of creating an out-of-domain language model for Romanian, we introduce and experiment an effective diacritics restoration algorithm.
Keywords :
language translation; learning (artificial intelligence); natural language processing; speech recognition; statistical analysis; ASR domain adaptation methods; Romanian language; domain specific language resource; low-resourced languages; semisupervised method; speech recognition system; statistical machine translation; text corpora; Acoustics; Adaptation models; Automatic speech recognition; Context; Hidden Markov models; Probabilistic logic; Speech; ASR domain adaptation; SMT; diacritics restoration; language modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
ISSN :
2219-5491
Print_ISBN :
978-1-4673-1068-0
Type :
conf
Filename :
6333947
Link To Document :
بازگشت