Title :
Open-set semi-supervised audio-visual speaker recognition using co-training LDA and Sparse Representation Classifiers
Author :
Xuran Zhao ; Evans, Noah ; Dugelay, Jean-Luc
Author_Institution :
Dept. of Multimedia Commun., EURECOM, Biot, France
Abstract :
Semi-supervised learning is attracting growing interest within the biometrics community. Almost all prior work focuses on closed-set scenarios, in which samples labelled automatically are assumed to belong to an enrolled class. This is often not the case in realistic applications and thus open-set alternatives are needed. This paper proposes a new approach to open-set, semi-supervised learning based on co-training, Linear Discriminant Analysis (LDA) subspaces and Sparse Representation Classifiers (SRCs). Experiments on the standard MOBIO dataset show how the new approach can utilize automatically labelled data to augment a smaller, manually labelled dataset and thus improve the performance of an open-set audio-visual person recognition system.
Keywords :
audio-visual systems; biometrics (access control); learning (artificial intelligence); pattern classification; sparse matrices; speaker recognition; LDA classifiers; LDA subspaces; SRC; biometrics community; closed- set scenarios; linear discriminant analysis subspaces; open-set alternatives; open-set audio-visual person recognition system; open-set semisupervised audio-visual speaker recognition; sparse representation classifiers; standard MOBIO dataset; Biometrics (access control); Face; Feature extraction; Principal component analysis; Semisupervised learning; Standards; Training; Semi-supervised learning; co-training; multimodal biometrics; open-set identification;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638208