DocumentCode
33390
Title
Using Polynomial Kernel Support Vector Machines for Speaker Verification
Author
Yaman, Sibel ; Pelecanos, Jason
Author_Institution
Stat. Biometrics Group, IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
20
Issue
9
fYear
2013
fDate
Sept. 2013
Firstpage
901
Lastpage
904
Abstract
In this letter, we propose a discriminative modeling approach for the speaker verification problem that uses polynomial kernel support vector machines (PK-SVMs). The proposed approach is rooted in an equivalence relationship between the state-of-the-art probabilistic linear discriminant analysis (PLDA) and second degree polynomial kernel methods. We present two techniques for overcoming the memory and computational challenges that PK-SVMs pose. The first of these, a kernel evaluation simplification trick, eliminates the need to explicitly compute dot products for a huge number of training samples. The second technique makes use of the massively parallel processing power of modern graphical processing units. We performed experiments on the Phase I speaker verification track of the DARPA sponsored Robust Automatic Transcription of Speech (RATS) program. We found that, in the multi-session enrollment experiments, second degree PK-SVMs outperformed PLDA across all tasks in terms of the official evaluation metric, and third and fourth degree PK-SVMs provided a performance improvement over the second degree PK-SVMs. Furthermore, for the “30s-30s” task, a linear score combination between the PLDA and PK-SVM based systems provided 27% improvement relative to the PLDA baseline in terms of the official evaluation metric.
Keywords
polynomials; probability; speaker recognition; modern graphical processing units; multisession enrollment; polynomial kernel support vector machines; probabilistic linear discriminant analysis; robust automatic transcription; second degree polynomial kernel methods; speaker verification; Graphics processing units; Kernel; Mathematical model; Polynomials; Support vector machines; Training; Vectors; Probabilistic linear discriminant analysis; speaker verification; support vector machines;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2013.2273127
Filename
6557422
Link To Document