Title :
Speaker authentication using video-based lip information
Author :
Goswami, B. ; Chan, C. ; Kittler, J. ; Christmas, W.
Author_Institution :
FEPS, Univ. of Surrey, Guildford, UK
Abstract :
The lip-region can be interpreted as either a genetic or behavioural biometric trait depending on whether static or dynamic information is used. In this paper, we use a texture descriptor called Local Ordinal Contrast Pattern (LOCP) in conjunction with a novel spatiotemporal sampling method called Windowed Three Orthogonal Planes (WTOP) to represent both appearance and dynamics features ob served in visual speech. This representation, with standard speaker verification engines, is shown to improve the performance of the lip biometric trait compared to the state-of-the-art. The improvement obtained suggests that there is enough discriminative information in the mouth-region to enable its use as a primary biometric as opposed to a "soft" biometric trait.
Keywords :
audio-visual systems; biometrics (access control); spatiotemporal phenomena; speaker recognition; video signal processing; behavioural biometric trait; dynamic video information; genetic biometric trait; lip; local ordinal contrast pattern; spatiotemporal sampling method; speaker authentication; static video information; texture descriptor; visual speech; windowed three orthogonal planes; Databases; Feature extraction; Histograms; Pixel; Spatiotemporal phenomena; Speech; Yttrium; Biometrics; lip; spatiotemporal;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946880