Title :
Short segment automatic language identification using a multifeature-transition matrix approach
Author :
Grieco, John J. ; Pomales, E.O.
Author_Institution :
Air Force Res. Lab., Rome, NY, USA
Abstract :
This paper focuses on a new technique for automatic language identification (ALID). The primary goal of this endeavor is to develop a technique which requires a minimal amount of training data and can operate on very short segments of speech which also has the flexibility to add new languages in an easy fashion. A secondary goal of this effort is to create an algorithm requiring low computation. A new approach for language identification, based on multi-feature (MF), multi-classifier (MC) transition matrices is presented. This approach not only models the static acoustic components of a language, but also the dynamics of sub-sound to sub-sound transitions within a language. The transition matrix concept not only is performance competitive with other techniques found in the literature, but also is particularly suited for the short segment problem. Closed set experiments on the 3 second segments of the 1996 NIST Language Identification Evaluation database show the MF/MC transition matrix technique performance to be promising.
Keywords :
feature extraction; matrix algebra; natural languages; speech processing; speech recognition; ALID; MF/MC transition matrix technique performance; NIST Language Identification Evaluation database; algorithm computation; language addition flexibility; language static acoustic components; multi-feature multi-classifier transition matrices; multifeature-transition matrix approach; short segment automatic language identification; speech segments; sub-sound to sub-sound transition dynamics; training data; Databases; Laboratories; NIST; Natural languages; Speech; Statistics; Stochastic processes; Target recognition; Testing; Training data;
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
DOI :
10.1109/ISCAS.2003.1205123