A dynamic gap dimension reduction approach for high order n-gram phonotactic language recognition

Author

Liu, Weiwei ; Zhang, Wei-Qiang ; Liu, Jia

Author_Institution

Dept. of Electron. Eng., Tsinghua Univ., Beijing, China

fYear

2012

fDate

16-18 July 2012

Firstpage

971

Lastpage

975

Abstract

In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6,7 in SVM-based phonotactic language recognition by using a dynamic gap dimension reduction algorithm, which can reduce the affection of insert error as the same time. For evaluating the approaches, experiments were carried out on the NIST-LRE2009 database in which systems were built by means of open software (HTK, SRILM) and an English phone decoder. The experimental results on NIST-LRE2009 30s, 10s, 3s closed-set test by proposed method are 2.77%, 7.87%, 19.73% (meaning a relative reduction of 6.73%, 5.74%, 8.44% compared with the baseline system) in terms of equal error rate (EER) and the fusion of the systems with the proposed approaches yielded 2.26%, 7.62%, 18.06%. Details of implementation and experimental results are presented in this paper.

Keywords

natural language processing; public domain software; speech recognition; EER; English phone decoder; HTK; NIST-LRE2009 database; SRILM; SVM-based phonotactic language recognition; dynamic gap dimension reduction algorithm; equal error rate; high order n-gram phonotactic language recognition; insert error affection reduction; open software; Heuristic algorithms; Kernel; Lattices; NIST; Speech recognition; Support vector machines; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Audio, Language and Image Processing (ICALIP), 2012 International Conference on

Conference_Location

Shanghai

Print_ISBN

978-1-4673-0173-2

Type

conf

DOI

10.1109/ICALIP.2012.6376755

Filename

6376755