Title :
Spoken Language recognition using support vector machines with generative front-end
Author :
Lee, Kong-Aik ; You, Changhuai ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res. (I2R), Agency for Sci. Technol. & Res. (A*STAR), Singapore
fDate :
March 31 2008-April 4 2008
Abstract :
This paper introduces a spoken language recognition system with a generative front-end and a discriminative backend. The generative front-end is built upon an ensemble of Gaussian densities. These Gaussian densities are trained to represent elementary speech sound units characterizing a wide variety of languages. We formulate the generative front-end in a form of sequence kernel. This sequence kernel transforms a spoken utterance into a feature vector with its attributes representing the occurrence statistics of the speech sound units. A discriminative support vector machine (SVM) then operates on the feature vectors to make classification decision. The proposed language recognition system demonstrates competitive performance on NIST 1996, 2003 and 2005 LRE corpora.
Keywords :
Gaussian processes; decision making; natural languages; speech recognition; support vector machines; 2005 LRE corpora; Gaussian density; NIST 1996; NIST 2003; classification decision making; generative front-end; occurrence statistics; sequence kernel transform; spoken language recognition; support vector machine; Decoding; Hidden Markov models; Kernel; NIST; Natural languages; Speech recognition; Statistics; Support vector machine classification; Support vector machines; Testing; Language recognition; sequence kernel; support vector machine;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518569