DocumentCode
2451716
Title
A dynamic gap dimension reduction approach for high order n-gram phonotactic language recognition
Author
Liu, Weiwei ; Zhang, Wei-Qiang ; Liu, Jia
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2012
fDate
16-18 July 2012
Firstpage
971
Lastpage
975
Abstract
In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6,7 in SVM-based phonotactic language recognition by using a dynamic gap dimension reduction algorithm, which can reduce the affection of insert error as the same time. For evaluating the approaches, experiments were carried out on the NIST-LRE2009 database in which systems were built by means of open software (HTK, SRILM) and an English phone decoder. The experimental results on NIST-LRE2009 30s, 10s, 3s closed-set test by proposed method are 2.77%, 7.87%, 19.73% (meaning a relative reduction of 6.73%, 5.74%, 8.44% compared with the baseline system) in terms of equal error rate (EER) and the fusion of the systems with the proposed approaches yielded 2.26%, 7.62%, 18.06%. Details of implementation and experimental results are presented in this paper.
Keywords
natural language processing; public domain software; speech recognition; EER; English phone decoder; HTK; NIST-LRE2009 database; SRILM; SVM-based phonotactic language recognition; dynamic gap dimension reduction algorithm; equal error rate; high order n-gram phonotactic language recognition; insert error affection reduction; open software; Heuristic algorithms; Kernel; Lattices; NIST; Speech recognition; Support vector machines; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4673-0173-2
Type
conf
DOI
10.1109/ICALIP.2012.6376755
Filename
6376755
Link To Document