Title :
Soft margin estimation of Gaussian mixture model parameters for spoken language recognition
Author :
Zhu, Donglai ; Ma, Bin ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Abstract :
This paper extends our previous work on large margin estimation (LME) of GMM parameters with extend Baum-Welch (EBW) for spoken language recognition. To overcome the problem in the LME that negative samples in the training set are not used in parameter estimation, we propose a soft margin estimation (SME) method in this paper. The soft margin is scaled by a loss function measuring the distance between a negative sample and the classification boundary. We formulate the constrained optimization of SME as an unconstrained optimization among both positive samples and negative samples using a penalty function, and update the GMM parameters with the EBW algorithm. Experiments on the NIST language recognition evaluation (LRE) 2007 task show that the SME method effectively improves the LME performance.
Keywords :
Gaussian processes; optimisation; parameter estimation; speech recognition; EBW algorithm; Gaussian mixture model parameters; NIST language recognition evaluation task; SME constrained optimization; classification boundary; extended Baum-Welch; large margin estimation; loss function; parameter estimation; penalty function; soft margin estimation method; spoken language recognition; Constraint optimization; Hamming distance; Hidden Markov models; Maximum likelihood estimation; NIST; Natural languages; Parameter estimation; Support vector machine classification; Support vector machines; Training data; extended Baum-Welch; soft margin estimation; spoken language recognition;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495079