Title :
Multiple kernel learning for speaker verification
Author :
Longworth, C. ; Gales, M.J.F.
Author_Institution :
Dept. of Eng., Cambridge Univ., Cambridge
fDate :
March 31 2008-April 4 2008
Abstract :
Many speaker verification (SV) systems combine multiple classifiers using score-fusion to improve system performance. For SVM classifiers, an alternative strategy is to combine at the kernel level. This involves finding a suitable kernel weighting, known as multiple kernel learning (MKL). Recently, an efficient maximum-margin scheme for MKL has been proposed. This work examines several refinements to this scheme for SV. The standard scheme has a known tendency towards sparse weightings, which may not be optimal for SV. A regularisation term is proposed, allowing the appropriate level of sparsity to be selected. Cross-speaker tying of kernel weights is also applied to improve robustness. Various combinations of dynamic kernels were evaluated, including derivative and parametric kernels based upon different model structures. The performance achieved on the NIST 2002 SRE when combining five kernels was 4.83% EER.
Keywords :
pattern classification; speaker recognition; multiple classifiers; multiple kernel learning; score-fusion; speaker verification; Kernel; NIST; Robustness; Speaker recognition; Speech; Support vector machine classification; Support vector machines; System performance; Classifier Combination; Dynamic kernels; Speaker recognition; Support Vector Machines;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517926