Title : 
Automatic language identification based on GMBM-UBBM
         
        
            Author : 
Qu, Dan ; Wang, Bingxi
         
        
            Author_Institution : 
Inst. of Inf. Eng. of Inf. Eng. Univ., Zhengzhou, China
         
        
        
        
        
        
            Abstract : 
Gaussian mixture model is an effective method for speaker-independent language identification tasks. Gaussian mixture bigram model integrates bigram time correlation to extend the GMM. A language identification algorithm of GMBM-UBBM is proposed based on GMBM and GMM-UBM and some experiments are conducted using OGI-TS multilanguage telephone speech corpus. Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish between the three languages with maximal 4.378% improvement accuracy superior to conventional GMM-UBM.
         
        
            Keywords : 
Gaussian processes; natural languages; speaker recognition; GMBM-UBBM; Gaussian mixture bigram model; OGI-TS multilanguage telephone speech corpus; bigram time correlation; language identification algorithm; speaker-independent language; Banking; Emergency services; Information retrieval; Internet; Natural languages; Probability density function; Speech; Statistics; Telephony; Training data;
         
        
        
        
            Conference_Titel : 
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
         
        
            Conference_Location : 
Beijing, China
         
        
            Print_ISBN : 
0-7803-7902-0
         
        
        
            DOI : 
10.1109/NLPKE.2003.1276000