DocumentCode
239579
Title
Sparse coding based lip texture representation for visual speaker identification
Author
Jun-Yao Lai ; Shi-Lin Wang ; Xing-Jian Shi ; Liew, Alan Wee-Chung
Author_Institution
Sch. of EIEE, Shanghai Jiao Tong Univ., Shanghai, China
fYear
2014
fDate
20-23 Aug. 2014
Firstpage
607
Lastpage
610
Abstract
Recent research has shown that the speaker´s lip shape and movement contain rich identity-related information and can be adopted for speaker identification and authentication. Among all the static lip features, the lip texture (intensity variation inside the outer lip contour) is of high discriminative power to differentiate various speakers. However, the existing lip texture feature representations cannot describe the texture information adequately and provide unsatisfactory identification results. In this paper, a sparse representation of the lip texture is proposed and a corresponding visual speaker identification scheme is presented. In the training stage, a sparse dictionary is built based on the texture samples for each speaker. In the testing stage, for any lip image investigated, the lip texture information is extracted and the reconstruction errors using all the dictionaries for every speaker are calculated. The lip image is identified to the speaker with the minimum reconstruction error. The experimental results show that the proposed sparse coding based scheme can achieve much better identification accuracy (91.37% for isolate image and 98.21% for image sequence) compared with several state-of-the-art methods when considering the lip texture information only.
Keywords
speaker recognition; speech coding; identity-related information; lip texture feature representations; lip texture information; reconstruction error; sparse coding based lip texture representation; sparse coding based scheme; sparse dictionary; sparse representation; speaker authentication; speaker identification scheme; texture information; visual speaker identification; Accuracy; Dictionaries; Digital signal processing; Encoding; Image reconstruction; Shape; Visualization; Lip texture; lip biometrics; sparse coding; visual speaker identificaiton;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing (DSP), 2014 19th International Conference on
Conference_Location
Hong Kong
Type
conf
DOI
10.1109/ICDSP.2014.6900736
Filename
6900736
Link To Document