Discriminative learning for optimizing detection performance in spoken language recognition

Author

Zhu, Donglai ; Li, Haizhou ; Ma, Bin ; Lee, Chin-Hui

Author_Institution

Inst. for Infocomm Res., Singapore

fYear

2008

fDate

March 31 2008-April 4 2008

Firstpage

4161

Lastpage

4164

Abstract

We propose novel approaches for optimizing the detection performance in spoken language recognition. Two objective functions are designed to directly relate model parameters to two performance metrics of interest, the detection cost function and the area under the detection-error-tradeoff curve, respectively. Both metrics are approximated with differentiable functions of model parameters by using a smoothing function based on a class misclassification measure. The model parameters are optimized by using the generalized probabilistic descent algorithm. We conduct experiments on the NIST 2003 and 2005 Language Recognition Evaluation corpora. Results show that the proposed approaches effectively improve the performance over the maximum likelihood training approach.

Keywords

Gaussian processes; learning (artificial intelligence); natural language processing; speech recognition; Gaussian mixture model; NIST Language Recognition Evaluation corpora; class misclassification measure; detection cost function; detection performance optimization; detection-error-tradeoff curve; differentiable functions; discriminative learning; generalized probabilistic descent algorithm; maximum likelihood training approach; model parameters; smoothing function; spoken language recognition; Cost function; Detectors; Error analysis; Maximum likelihood detection; NIST; Natural languages; Pattern recognition; Smoothing methods; Speech recognition; Support vector machines; Gaussian mixture model; detection cost function; detection error tradeoff; discriminative learning; spoken language recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location

Las Vegas, NV

ISSN

1520-6149

Print_ISBN

978-1-4244-1483-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2008.4518571

Filename

4518571