DocumentCode
396227
Title
Short segment automatic language identification using a multifeature-transition matrix approach
Author
Grieco, John J. ; Pomales, E.O.
Author_Institution
Air Force Res. Lab., Rome, NY, USA
Volume
3
fYear
2003
fDate
25-28 May 2003
Abstract
This paper focuses on a new technique for automatic language identification (ALID). The primary goal of this endeavor is to develop a technique which requires a minimal amount of training data and can operate on very short segments of speech which also has the flexibility to add new languages in an easy fashion. A secondary goal of this effort is to create an algorithm requiring low computation. A new approach for language identification, based on multi-feature (MF), multi-classifier (MC) transition matrices is presented. This approach not only models the static acoustic components of a language, but also the dynamics of sub-sound to sub-sound transitions within a language. The transition matrix concept not only is performance competitive with other techniques found in the literature, but also is particularly suited for the short segment problem. Closed set experiments on the 3 second segments of the 1996 NIST Language Identification Evaluation database show the MF/MC transition matrix technique performance to be promising.
Keywords
feature extraction; matrix algebra; natural languages; speech processing; speech recognition; ALID; MF/MC transition matrix technique performance; NIST Language Identification Evaluation database; algorithm computation; language addition flexibility; language static acoustic components; multi-feature multi-classifier transition matrices; multifeature-transition matrix approach; short segment automatic language identification; speech segments; sub-sound to sub-sound transition dynamics; training data; Databases; Laboratories; NIST; Natural languages; Speech; Statistics; Stochastic processes; Target recognition; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN
0-7803-7761-3
Type
conf
DOI
10.1109/ISCAS.2003.1205123
Filename
1205123
Link To Document