• DocumentCode
    405640
  • Title

    Automatic language identification based on GMBM-UBBM

  • Author

    Qu, Dan ; Wang, Bingxi

  • Author_Institution
    Inst. of Inf. Eng. of Inf. Eng. Univ., Zhengzhou, China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    722
  • Lastpage
    727
  • Abstract
    Gaussian mixture model is an effective method for speaker-independent language identification tasks. Gaussian mixture bigram model integrates bigram time correlation to extend the GMM. A language identification algorithm of GMBM-UBBM is proposed based on GMBM and GMM-UBM and some experiments are conducted using OGI-TS multilanguage telephone speech corpus. Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish between the three languages with maximal 4.378% improvement accuracy superior to conventional GMM-UBM.
  • Keywords
    Gaussian processes; natural languages; speaker recognition; GMBM-UBBM; Gaussian mixture bigram model; OGI-TS multilanguage telephone speech corpus; bigram time correlation; language identification algorithm; speaker-independent language; Banking; Emergency services; Information retrieval; Internet; Natural languages; Probability density function; Speech; Statistics; Telephony; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1276000
  • Filename
    1276000