مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification

DocumentCode :

1224395

Title :

Feature Space Mahalanobis Sequence Kernels: Application to SVM Speaker Verification

Author :

Louradour, Jérôme ; Daoudi, Khalid ; Bach, Francis

Author_Institution :

Dept. of Comput. Sci. & Oper. Res. (DIRO), Univ. of Montreal, Montreal, QC

Volume :

Issue :

fYear :

2007

Firstpage :

2465

Lastpage :

2475

Abstract :

The generalized linear discriminant sequence (GLDS) kernel has been shown to provide very good performance and efficiency at the NIST Speaker Recognition Evaluations (SRE) in the last few years. This kernel is based on an explicit map of polynomial expansions of input frames which, because of practical limitations, have to be of a degree less or equal to three. In this paper, we consider an extension of the GLDS kernel to allow not only any polynomial degree but also any embedding, including infinite-dimensional ones associated with Mercer kernels (such as Gaussian kernels). It turns out that the resulting kernels belong to the family of posterior covariance kernels. However, their exact ldquokernelizedrdquo form involves the computation of the Gram matrix on background data, and may be intractable when the background corpus is very large (which is the case in speaker verification). To overcome this problem, we use a low-rank approximation of the Gram matrix to provide an approximate but tractable form of these kernels. We then present comparative experiments on NIST SRE 2005. The results show that our sequence kernel outperforms the GLDS one, and gives similar (individual) performances to the traditional universal background model-Gaussiam mixture model (UBM-GMM) system. As expected, the fusion of both improves the scores.

Keywords :

Gaussian processes; approximation theory; covariance matrices; polynomials; sequences; speaker recognition; support vector machines; Gaussian kernel; Gram matrix; Mahalanobis sequence kernel; Mercer kernel; SVM; covariance kernel; feature space; generalized linear discriminant sequence kernel; low-rank approximation; polynomial; speaker recognition evaluation; speaker verification; support vector machine; Covariance matrix; Feature extraction; Kernel; Loudspeakers; Monitoring; NIST; Polynomials; Speaker recognition; Support vector machine classification; Support vector machines; Sequence kernel; speaker verification; support vector machines (SVMs);

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2007.905147

Filename :

4317571

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1224395