• DocumentCode
    2173435
  • Title

    A dynamic approach to the selection of high order n-grams in phonotactic language recognition

  • Author

    Penagarikano, Mikel ; Varona, Amparo ; Rodriguez-Fuentes, Luis Javier ; Bordel, German

  • Author_Institution
    Dept. of Electr. & Electron., Univ. of the Basque Country, Bilbao, Spain
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4412
  • Lastpage
    4415
  • Abstract
    Due to computational bounds, most SVM-based phonotactic language recognition systems consider only low-order n-grams (up to n = 3), thus limiting the potential performance of this approach. The huge amount of n-grams for n ≥ 4 makes it computationally unfeasible even selecting the most frequent n-grams. In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n = 4, 5, 6, 7 in SVM-based phonotactic language recognition, thanks to a dynamic n-gram selection algorithm. The most frequent n-grams are selected, but computational issues (those regarding memory requirements) are prevented, since counts are periodically updated and only those units with the highest counts are retained for subsequent processing. Systems were built by means of open software (Brno University of Technology phone decoders, HTK, LIBLINEAR and FoCal) and experiments were carried out on the NIST LRE2007 database. Applying the proposed approach, a 1.36% EER was achieved when using up to 4-grams, 1.32% EER when using up to 5-grams (11.2% improvement with regard to using up to 3-grams) and 1.34% EER when using up to 6-grams or 7-grams.
  • Keywords
    speech recognition; EER; SVM-based phonotactic language recognition systems; high order N-grams; open software; Databases; Decoding; Heuristic algorithms; Lattices; NIST; Speech; Support vector machines; Feature Selection; Phonotactic Language Recognition; SVM; high-order n-grams;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947332
  • Filename
    5947332