• DocumentCode
    2451716
  • Title

    A dynamic gap dimension reduction approach for high order n-gram phonotactic language recognition

  • Author

    Liu, Weiwei ; Zhang, Wei-Qiang ; Liu, Jia

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    16-18 July 2012
  • Firstpage
    971
  • Lastpage
    975
  • Abstract
    In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6,7 in SVM-based phonotactic language recognition by using a dynamic gap dimension reduction algorithm, which can reduce the affection of insert error as the same time. For evaluating the approaches, experiments were carried out on the NIST-LRE2009 database in which systems were built by means of open software (HTK, SRILM) and an English phone decoder. The experimental results on NIST-LRE2009 30s, 10s, 3s closed-set test by proposed method are 2.77%, 7.87%, 19.73% (meaning a relative reduction of 6.73%, 5.74%, 8.44% compared with the baseline system) in terms of equal error rate (EER) and the fusion of the systems with the proposed approaches yielded 2.26%, 7.62%, 18.06%. Details of implementation and experimental results are presented in this paper.
  • Keywords
    natural language processing; public domain software; speech recognition; EER; English phone decoder; HTK; NIST-LRE2009 database; SRILM; SVM-based phonotactic language recognition; dynamic gap dimension reduction algorithm; equal error rate; high order n-gram phonotactic language recognition; insert error affection reduction; open software; Heuristic algorithms; Kernel; Lattices; NIST; Speech recognition; Support vector machines; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2012 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0173-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2012.6376755
  • Filename
    6376755