DocumentCode :
398693
Title :
Multimodal speaker identification with audio-video processing
Author :
Yemez, E. ; Kanak, A. ; Erzin, E. ; Tekalp, A. Murat
Author_Institution :
Coll. of Eng., Koc Univ., Istanbul, Turkey
Volume :
3
fYear :
2003
fDate :
14-17 Sept. 2003
Abstract :
In this paper we present a multimodal audio-visual speaker identification system. The objective is to improve the recognition performance over conventional unimodal schemes. The proposed system decomposes the information existing in a video stream into three components: speech, face texture and lip motion. Lip motion between successive frames is first computed in terms of optical flow vectors and then encoded as a feature vector in a magnitude direction histogram domain. The feature vectors obtained along the whole stream are then interpolated to match the rate of the speech signal and fused with mel frequency cepstral coefficients (MFCC) of the corresponding speech signal. The resulting joint feature vectors are used to train and test a Hidden Markov Model (HMM) based identification system. Face texture images are treated separately in eigenface domain and integrated to the system through decision-fusion. Experimental results are also included for demonstration of the system performance.
Keywords :
audio signal processing; hidden Markov models; image texture; speaker recognition; video signal processing; HMM; audio-video processing; face texture images; feature vector; hidden Markov models; lip motion; magnitude-direction histogram domain; mel frequency cepstral coefficients; multimodal speaker identification; optical flow vectors; speech signals; video stream; Biometrics; Graphics; Hidden Markov models; Histograms; Image databases; Laboratories; Multimedia systems; Signal processing; Speech; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on
ISSN :
1522-4880
Print_ISBN :
0-7803-7750-8
Type :
conf
DOI :
10.1109/ICIP.2003.1247167
Filename :
1247167
Link To Document :
بازگشت