DocumentCode :
1092895
Title :
Synergy of Lip-Motion and Acoustic Features in Biometric Speech and Speaker Recognition
Author :
Faraj, Maycel-Isaac ; Bigun, Josef
Author_Institution :
Halmstad Univ., Halmstad
Volume :
56
Issue :
9
fYear :
2007
Firstpage :
1169
Lastpage :
1175
Abstract :
This paper presents the scheme and evaluation of a robust audio-visual digit-and-speaker-recognition system using lip motion and speech biometrics. Moreover, a liveness verification barrier based on a person´s lip movement is added to the system to guard against advanced spoofing attempts such as replayed videos. The acoustic and visual features are integrated at the feature level and evaluated first by a support vector machine for digit and speaker identification and, then, by a Gaussian mixture model for speaker verification. Based on ap300 different personal identities, this paper represents, to our knowledge, the first extensive study investigating the added value of lip motion features for speaker and speech-recognition applications. Digit recognition and person-identification and verification experiments are conducted on the publicly available XM2VTS database showing favorable results (speaker verification is 98 percent, speaker identification is 100 percent, and digit identification is 83 percent to 100 percent).
Keywords :
Gaussian processes; acoustic signal processing; image motion analysis; speaker recognition; support vector machines; video signal processing; Gaussian mixture model; XM2VTS database; audio-visual digit-and-speaker-recognition system; biometric speech recognition; digit recognition; lip-motion synergy; liveness verification barrier; person-identification; speaker verification; support vector machine; Biometrics; Hidden Markov models; Loudspeakers; Mouth; Shape; Speaker recognition; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; GMM; SVM; Speech recognition; biometrics; lip motion; lip reading; motion estimation; normal image flow; normal image velocity; speaker recognition;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2007.1074
Filename :
4288084
Link To Document :
بازگشت