• DocumentCode
    239579
  • Title

    Sparse coding based lip texture representation for visual speaker identification

  • Author

    Jun-Yao Lai ; Shi-Lin Wang ; Xing-Jian Shi ; Liew, Alan Wee-Chung

  • Author_Institution
    Sch. of EIEE, Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2014
  • fDate
    20-23 Aug. 2014
  • Firstpage
    607
  • Lastpage
    610
  • Abstract
    Recent research has shown that the speaker´s lip shape and movement contain rich identity-related information and can be adopted for speaker identification and authentication. Among all the static lip features, the lip texture (intensity variation inside the outer lip contour) is of high discriminative power to differentiate various speakers. However, the existing lip texture feature representations cannot describe the texture information adequately and provide unsatisfactory identification results. In this paper, a sparse representation of the lip texture is proposed and a corresponding visual speaker identification scheme is presented. In the training stage, a sparse dictionary is built based on the texture samples for each speaker. In the testing stage, for any lip image investigated, the lip texture information is extracted and the reconstruction errors using all the dictionaries for every speaker are calculated. The lip image is identified to the speaker with the minimum reconstruction error. The experimental results show that the proposed sparse coding based scheme can achieve much better identification accuracy (91.37% for isolate image and 98.21% for image sequence) compared with several state-of-the-art methods when considering the lip texture information only.
  • Keywords
    speaker recognition; speech coding; identity-related information; lip texture feature representations; lip texture information; reconstruction error; sparse coding based lip texture representation; sparse coding based scheme; sparse dictionary; sparse representation; speaker authentication; speaker identification scheme; texture information; visual speaker identification; Accuracy; Dictionaries; Digital signal processing; Encoding; Image reconstruction; Shape; Visualization; Lip texture; lip biometrics; sparse coding; visual speaker identificaiton;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing (DSP), 2014 19th International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICDSP.2014.6900736
  • Filename
    6900736