Title :
Spoken Language Recognition: From Fundamentals to Practice
Author :
Haizhou Li ; Bin Ma ; Kong Aik Lee
Author_Institution :
Inst. for Infocomm Res., Agency for Sci., Technol. & Res. (A*STAR), Singapore, Singapore
fDate :
5/1/2013 12:00:00 AM
Abstract :
Spoken language recognition refers to the automatic process through which we determine or verify the identity of the language spoken in a speech sample. We study a computational framework that allows such a decision to be made in a quantitative manner. In recent decades, we have made tremendous progress in spoken language recognition, which benefited from technological breakthroughs in related areas, such as signal processing, pattern recognition, cognitive science, and machine learning. In this paper, we attempt to provide an introductory tutorial on the fundamentals of the theory and the state-of-the-art solutions, from both phonological and computational aspects. We also give a comprehensive review of current trends and future research directions using the language recognition evaluation (LRE) formulated by the National Institute of Standards and Technology (NIST) as the case studies.
Keywords :
speech recognition; cognitive science; computational aspect; computational framework; language recognition evaluation; machine learning; pattern recognition; phonological aspect; signal processing; spoken language recognition; technological breakthrough; Acoustic signal processing; Classification; Databases; Information processing; NIST; Natural language processing; Semantics; Speech processing; Speech recognition; Acoustic features; calibration; classifier; fusion; language recognition evaluation (LRE); phonotactic features; spoken language recognition; tokenization; vector space modeling;
Journal_Title :
Proceedings of the IEEE
DOI :
10.1109/JPROC.2012.2237151