DocumentCode :
575630
Title :
A study on adapting Czech automatic speech recognition system to Croatian language
Author :
Nouza, Jan ; Cerva, Petr ; Zdansky, Jindrich ; Kucharova, Michaela
Author_Institution :
Inst. of Inf. Technol. & Electron., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear :
2012
fDate :
12-14 Sept. 2012
Firstpage :
227
Lastpage :
230
Abstract :
After successful adaptation of our Czech large-vocabulary speech recognition system to Slovak, we investigate the possibility to port it to another Slavic language, Croatian in this case. We describe how we build a large lexicon (recently with 255 thousand entries) and a language model from publicly available Internet sources and how an existing Czech acoustic model (AM) can be utilized for bootstrapping and training a model applicable for Croatian. For the AM adaptation we use the Croatian part of the GlobalPhone database. An independent evaluation is done on a test set made of transcribed broadcast recordings of Radio Pula. When using the original Czech acoustic model, the word error rate is 27.6%, with the model adapted to Croatian, it is reduced to 19.4%.
Keywords :
speech recognition; AM adaptation; Croatian language; Czech acoustic model; Czech large-vocabulary speech recognition system; Radio Pula; Slavic language; Slovak; adapting Czech automatic speech recognition system; bootstrapping; broadcast recordings; lexicon; successful adaptation; Acoustics; Adaptation models; Databases; Hidden Markov models; Speech; Speech recognition; Training; cross-lingual adaptation; speech recogniton;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ELMAR, 2012 Proceedings
Conference_Location :
Zadar
ISSN :
1334-2630
Print_ISBN :
978-1-4673-1243-1
Type :
conf
Filename :
6338512
Link To Document :
بازگشت