DocumentCode
1849171
Title
ASR domain adaptation methods for low-resourced languages: Application to Romanian language
Author
Cucu, Horia ; Besacier, Laurent ; Burileanu, Corneliu ; Buzo, Andi
Author_Institution
Univ. Politeh. of Bucharest, Bucharest, Romania
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
1648
Lastpage
1652
Abstract
This study investigates the possibility of using statistical machine translation to create domain-specific language resources. We propose a methodology that aims to create a domain-specific automatic speech recognition system for a low-resourced language when in-domain text corpora are available only in a high-resourced language. We evaluate a new semi-supervised method and compare it with previously developed semi-supervised and unsupervised approaches. Moreover, in the effort of creating an out-of-domain language model for Romanian, we introduce and experiment an effective diacritics restoration algorithm.
Keywords
language translation; learning (artificial intelligence); natural language processing; speech recognition; statistical analysis; ASR domain adaptation methods; Romanian language; domain specific language resource; low-resourced languages; semisupervised method; speech recognition system; statistical machine translation; text corpora; Acoustics; Adaptation models; Automatic speech recognition; Context; Hidden Markov models; Probabilistic logic; Speech; ASR domain adaptation; SMT; diacritics restoration; language modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6333947
Link To Document