• DocumentCode
    3430297
  • Title

    THUEE system for the Albayzin 2012 language recognition evaluation

  • Author

    Weiwei Liu ; Wei-Qiang Zhang ; Liang He ; Jiaming Xu ; Jia Liu

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2013
  • fDate
    6-10 July 2013
  • Firstpage
    109
  • Lastpage
    112
  • Abstract
    Albayzin 2012 language recognition evaluation (LRE) is one of the most challenging language recognition evaluation, which is mainly reflected in: (1) the target languages are more confusable with other languages, which might push down the system performance; (2) developing and test data is heterogeneous regarding duration, number of speakers, ambient noise/music, channel conditions, etc. (3) signals may contain noise, background music and any kind of nonhuman sounds. To solve these problem, in Department of Electronic Engineering, Tsinghua University (THUEE) system we develop (1) 47-phoneme English Gaussian mixture model-hidden Markov model (GMM-HMM) decoder with background noise model for voice activity detection (2) noisy and clean model separately and fusing the weighted model (3) linear discriminant analysis-minimal mutual information (LDA-MMI)+Multifocal fusion method to improve the LRE system performance and the system yielded Fact = 0.1513 in empty training closed-set tests.
  • Keywords
    Gaussian processes; decoding; hidden Markov models; natural language processing; speech coding; speech processing; Albayzin 2012 language recognition evaluation; Department of Electronic Engineering, Tsinghua University system; GMM-HMM decoder; Gaussian mixture model-hidden Markov model; LDA-MMI; LRE; THUEE system; background noise model; empty training closed-set tests; linear discriminant analysis-minimal mutual information; multifocal fusion method; nonhuman sounds; test data; voice activity detection; weighted model; Educational institutions; Hidden Markov models; Noise measurement; Speech; Speech recognition; Training; Training data; Albayzin 2012 language recognition evaluation (LRE); Department of Electronic Engineering; Tsinghua University (THUEE);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on
  • Conference_Location
    Beijing
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2013.6625308
  • Filename
    6625308