• DocumentCode
    396227
  • Title

    Short segment automatic language identification using a multifeature-transition matrix approach

  • Author

    Grieco, John J. ; Pomales, E.O.

  • Author_Institution
    Air Force Res. Lab., Rome, NY, USA
  • Volume
    3
  • fYear
    2003
  • fDate
    25-28 May 2003
  • Abstract
    This paper focuses on a new technique for automatic language identification (ALID). The primary goal of this endeavor is to develop a technique which requires a minimal amount of training data and can operate on very short segments of speech which also has the flexibility to add new languages in an easy fashion. A secondary goal of this effort is to create an algorithm requiring low computation. A new approach for language identification, based on multi-feature (MF), multi-classifier (MC) transition matrices is presented. This approach not only models the static acoustic components of a language, but also the dynamics of sub-sound to sub-sound transitions within a language. The transition matrix concept not only is performance competitive with other techniques found in the literature, but also is particularly suited for the short segment problem. Closed set experiments on the 3 second segments of the 1996 NIST Language Identification Evaluation database show the MF/MC transition matrix technique performance to be promising.
  • Keywords
    feature extraction; matrix algebra; natural languages; speech processing; speech recognition; ALID; MF/MC transition matrix technique performance; NIST Language Identification Evaluation database; algorithm computation; language addition flexibility; language static acoustic components; multi-feature multi-classifier transition matrices; multifeature-transition matrix approach; short segment automatic language identification; speech segments; sub-sound to sub-sound transition dynamics; training data; Databases; Laboratories; NIST; Natural languages; Speech; Statistics; Stochastic processes; Target recognition; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
  • Print_ISBN
    0-7803-7761-3
  • Type

    conf

  • DOI
    10.1109/ISCAS.2003.1205123
  • Filename
    1205123