DocumentCode
405640
Title
Automatic language identification based on GMBM-UBBM
Author
Qu, Dan ; Wang, Bingxi
Author_Institution
Inst. of Inf. Eng. of Inf. Eng. Univ., Zhengzhou, China
fYear
2003
fDate
26-29 Oct. 2003
Firstpage
722
Lastpage
727
Abstract
Gaussian mixture model is an effective method for speaker-independent language identification tasks. Gaussian mixture bigram model integrates bigram time correlation to extend the GMM. A language identification algorithm of GMBM-UBBM is proposed based on GMBM and GMM-UBM and some experiments are conducted using OGI-TS multilanguage telephone speech corpus. Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish between the three languages with maximal 4.378% improvement accuracy superior to conventional GMM-UBM.
Keywords
Gaussian processes; natural languages; speaker recognition; GMBM-UBBM; Gaussian mixture bigram model; OGI-TS multilanguage telephone speech corpus; bigram time correlation; language identification algorithm; speaker-independent language; Banking; Emergency services; Information retrieval; Internet; Natural languages; Probability density function; Speech; Statistics; Telephony; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location
Beijing, China
Print_ISBN
0-7803-7902-0
Type
conf
DOI
10.1109/NLPKE.2003.1276000
Filename
1276000
Link To Document