• DocumentCode
    514891
  • Title

    Methods to Improve Gaussian Mixture Model for Language Identification

  • Author

    Xu, Yonghua ; Yang, Jian ; Chen, Jiang

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Yun Nan Univ., Kunming, China
  • Volume
    2
  • fYear
    2010
  • fDate
    13-14 March 2010
  • Firstpage
    656
  • Lastpage
    659
  • Abstract
    Multiple-language phone recognition and n-gram language modeling produce the best performance in formal language identification (LID) evaluations, but this method needs many kinds of data that were phonetically transcribed utterances. This phone-based system isn´t easily to apply for dialects or minority languages identification, because it is difficulty to obtain kinds of utterances which were orthographically or phonetically transcribed. Gaussian mixture model with the Universal Background Model (GMM-UBM), which has been successfully employed in speaker verification, is an effective approach to solve this problem. The GMM-UBM LID system, which didn´t obtain utterances that were orthographically or phonetically transcribed, reduces the time requirement for both training and testing. In this paper we proposed a new combination method utilizing Output Scores Fusion techniques for acoustic scores and language model scores to improve the performance of the GMM-UBM based LID system. Experiment results show that the combination method as described above is another efficient method for language identification problems.
  • Keywords
    Gaussian processes; natural language processing; speaker recognition; speech processing; GMM-EJBM based LED system; GMM-UBM LED system; Gaussian mixture model; formal language identification; multiple language phone recognition; n-gram language modeling; output scores fusion techniques; phone based system; speaker verification; universal background model; Automation; Data engineering; Electronic mail; Formal languages; Information science; Linear discriminant analysis; Loudspeakers; Mechatronics; Natural languages; Speech recognition; GMM Rcognizer; GMM-UBM; LDA; Language Model; Scores Fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Measuring Technology and Mechatronics Automation (ICMTMA), 2010 International Conference on
  • Conference_Location
    Changsha City
  • Print_ISBN
    978-1-4244-5001-5
  • Electronic_ISBN
    978-1-4244-5739-7
  • Type

    conf

  • DOI
    10.1109/ICMTMA.2010.545
  • Filename
    5459859