DocumentCode :
865633
Title :
A Vector Space Modeling Approach to Spoken Language Identification
Author :
Li, Haizhou ; Ma, Bin ; Lee, Chin-Hui
Author_Institution :
Inst. for Infocomm Res., Singapore
Volume :
15
Issue :
1
fYear :
2007
Firstpage :
271
Lastpage :
284
Abstract :
We propose a novel approach to automatic spoken language identification (LID) based on vector space modeling (VSM). It is assumed that the overall sound characteristics of all spoken languages can be covered by a universal collection of acoustic units, which can be characterized by the acoustic segment models (ASMs). A spoken utterance is then decoded into a sequence of ASM units. The ASM framework furthers the idea of language-independent phone models for LID by introducing an unsupervised learning procedure to circumvent the need for phonetic transcription. Analogous to representing a text document as a term vector, we convert a spoken utterance into a feature vector with its attributes representing the co-occurrence statistics of the acoustic units. As such, we can build a vector space classifier for LID. The proposed VSM approach leads to a discriminative classifier backend, which is demonstrated to give superior performance over likelihood-based n-gram language modeling (LM) backend for long utterances. We evaluated the proposed VSM framework on 1996 and 2003 NIST Language Recognition Evaluation (LRE) databases, achieving an equal error rate (EER) of 2.75% and 4.02% in the 1996 and 2003 LRE 30-s tasks, respectively, which represents one of the best results reported on these popular tasks
Keywords :
decoding; natural language processing; speech coding; speech recognition; unsupervised learning; acoustic segment models; automatic spoken language identification; decoding; likelihood-based n-gram language modeling; phonetic transcription; sound characteristics; spoken utterance; unsupervised learning; vector space classifier; vector space modeling approach; Artificial neural networks; Decoding; Hidden Markov models; NIST; Natural languages; Space technology; Speech recognition; Statistics; Support vector machines; Unsupervised learning; Acoustic segment models (ASMs); artificial neural network (ANN); spoken language identification; support vector machine (SVM); text categorization; vector space model (VSM);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2006.876860
Filename :
4032773
Link To Document :
بازگشت