مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature Extraction From Talking Mouths for Video-Based Bi-Modal Speaker Verification

DocumentCode :

2331599

Title :

Feature Extraction From Talking Mouths for Video-Based Bi-Modal Speaker Verification

Author :

Ouyang, Hua ; Lee, Tan ; Chan, W.N.

Author_Institution :

Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

As the low-cost video transmission becomes popular, video-based bi-modal (audio and visual) authentication has great potential in various applications that require access control over handheld terminals. In this paper, we propose to use the averaged mouth image (AMI) for speaker verification. The AMI is computed by averaging properly aligned mouth images over the whole video sequence. Despite its simplicity, the AMI not only contains appearance information but also describes stylistic articulation gestures of individual speakers. The AMI is found to be fairly invariant against the spoken content. The experimental results show that the AMI based features are very effective in discriminating speaking persons. Explicit and precise extraction of lip contours or other feature points are not required. For bi-modal verification, the proposed video features are found to be highly complementary to the audio features

Keywords :

feature extraction; image sequences; speaker recognition; video signal processing; averaged mouth image; feature extraction; lip contours; low-cost video transmission; talking mouths; video sequence; video-based bi-modal authentication; video-based bi-modal speaker verification; Access control; Ambient intelligence; Authentication; Biometrics; Data mining; Face recognition; Feature extraction; Lips; Mouth; Video sequences;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1661325

Filename :

1661325

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2331599