DocumentCode
2838178
Title
Language identification using discriminative weighted language models
Author
Wang, Shizhen ; Liu, Jia ; Liu, Runsheng
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
53
Lastpage
56
Abstract
In this paper, discriminative weighted language models are proposed to better distinguish between similar languages. Through parallel phone recognizers followed by language modeling (PPRLM) system in the first stage, two best candidates are hypothesized and then processed using discriminative language models. Experimental results show that, compared with the traditional one-pass language identification (LID) systems, the proposed two-pass method can greatly improve the performance without considerably increasing the computational costs. Tested on the evaluation set of the CallFriend corpus, the final system achieved an error rate of 14.90% on the 30s 12-way close-set task.
Keywords
error statistics; linguistics; speech processing; speech recognition; 12-way close-set task; CallFriend corpus; PPRLM system; discriminative weighted language models; error rate; language identification; language modeling; parallel phone recognizers; performance; similar languages; two-pass method; Character recognition; Computational efficiency; Error analysis; Explosions; Hidden Markov models; Man machine systems; Natural languages; Speech recognition; System testing; Target recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409584
Filename
1409584
Link To Document