Title :
Language Identification: A Tutorial
Author :
Ambikairajah, Eliathamby ; Li, Haizhou ; Wang, Liang ; Yin, Bo ; Sethu, Vidhyasaharan
Author_Institution :
Univ. of New South Wales, Sydney, NSW, Australia
Abstract :
This tutorial presents an overview of the progression of spoken language identification (LID) systems and current developments. The introduction provides a background on automatic language identification systems using syntactic, morphological, and in particular, acoustic, phonetic, phonotactic and prosodic level information. Different frontend features that are used in LID systems are presented. Several normalization and language modelling techniques have also been presented. We also discuss different LID system architectures that embrace a variety of front-ends and back-ends, and configurations such as hierarchical and fusion classifiers. Evaluations of the LID system are presented using NIST language recognition evaluation tasks.
Keywords :
natural language processing; speech recognition; LID system architectures; NIST language recognition; automatic language identification systems; language modelling techniques; spoken language identification; Automatic speech recognition; Identifications; Mel frequency cepstral coefficient; Speech recognition; Tutorials;
Journal_Title :
Circuits and Systems Magazine, IEEE
DOI :
10.1109/MCAS.2011.941081