DocumentCode :
3110573
Title :
A Correction of Missing Reliability for Robust Bimodal Speaker Identification
Author :
Tariquzzaman, Md ; Kim, Jin Young ; Na, Seung You
Author_Institution :
Sch. of Electron. & Comput. Eng., Chonnam Nat. Univ., Gwangju, South Korea
fYear :
2009
fDate :
16-18 Dec. 2009
Firstpage :
239
Lastpage :
243
Abstract :
Speaker identification in real environment is a key issue in biometrics technology for human computer interaction. In this paper, we propose a fuzzy membership function for adaptive threshold in different modalities reliability measure for robust bimodal speaker identification. In the bimodal speaker identification system, we will also propose an extension of a modified convection reliability function applied to both the audio and lip information to account optimal reliability simultaneously for audio and visual information integration. For creating mismatch in between train and test data, babble noises and artificial illumination have been added to test speeches and lip images, respectively. Local PCA have been applied at features level to both classifiers system for reducing the dimension of feature vector at different stage of signal distortion. We have applied particle swarm optimization (PSO) for optimizing the proposed fuzzy based adaptive threshold and modified convection function´s optimizing parameters. The entire speaker identification experiments have been performed using VidTimit database. Experimental results show that our proposed method enhanced the identification accuracy in comparison with the baseline system thus demonstrated the validation of the proposed approach and most notably maintains the consistency of the integration process.
Keywords :
audio signal processing; biometrics (access control); distortion; fuzzy set theory; human computer interaction; particle swarm optimisation; principal component analysis; speaker recognition; PCA; VidTimit database; artificial illumination; audio information integration; babble noises; biometrics technology; convection reliability function; fuzzy based adaptive threshold; fuzzy membership function; human computer interaction; identification accuracy; lip information; particle swarm optimization; robust bimodal speaker identification; signal distortion; visual information integration; Biometrics; Distortion; Human computer interaction; Lighting; Particle swarm optimization; Principal component analysis; Robustness; Spatial databases; Speech; Testing; fuzzy membership function; local PCA; optimal reliability; particle swarm optimization; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Multimedia Technology, 2009. ICIMT '09. International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-0-7695-3922-5
Type :
conf
DOI :
10.1109/ICIMT.2009.77
Filename :
5381210
Link To Document :
بازگشت