DocumentCode
2173435
Title
A dynamic approach to the selection of high order n-grams in phonotactic language recognition
Author
Penagarikano, Mikel ; Varona, Amparo ; Rodriguez-Fuentes, Luis Javier ; Bordel, German
Author_Institution
Dept. of Electr. & Electron., Univ. of the Basque Country, Bilbao, Spain
fYear
2011
fDate
22-27 May 2011
Firstpage
4412
Lastpage
4415
Abstract
Due to computational bounds, most SVM-based phonotactic language recognition systems consider only low-order n-grams (up to n = 3), thus limiting the potential performance of this approach. The huge amount of n-grams for n ≥ 4 makes it computationally unfeasible even selecting the most frequent n-grams. In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6, 7 in SVM-based phonotactic language recognition, thanks to a dynamic n-gram selection algorithm. The most frequent n-grams are selected, but computational issues (those regarding memory requirements) are prevented, since counts are periodically updated and only those units with the highest counts are retained for subsequent processing. Systems were built by means of open software (Brno University of Technology phone decoders, HTK, LIBLINEAR and FoCal) and experiments were carried out on the NIST LRE2007 database. Applying the proposed approach, a 1.36% EER was achieved when using up to 4-grams, 1.32% EER when using up to 5-grams (11.2% improvement with regard to using up to 3-grams) and 1.34% EER when using up to 6-grams or 7-grams.
Keywords
speech recognition; EER; SVM-based phonotactic language recognition systems; high order N-grams; open software; Databases; Decoding; Heuristic algorithms; Lattices; NIST; Speech; Support vector machines; Feature Selection; Phonotactic Language Recognition; SVM; high-order n-grams;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947332
Filename
5947332
Link To Document