• DocumentCode
    542280
  • Title

    Automatic speechreading with application to speaker verification

  • Author

    Broun, C.C. ; Zhang, X. ; Mersereau, R.M. ; Clements, M.

  • Author_Institution
    Motorola Human Interface Labs, Tempe, Arizona 85284, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Speech not only conveys the linguistic information, but also characterizes the talker´s identity and therefore can be used in personal authentication. While most of the speech information is contained in the acoustic channel, the lip movement during speech production also provides useful information. In this paper we investigate the effectiveness of visual speech features in a speaker veri£cation task. We £rst present the visual front-end of the automatic speechreading system. We then develop a recognition engine to train and recognize sequences of visual parameters. The experimental results based on the XM2VTS database [I] demonstrate that visual information is highly effective in reducing both false acceptance and false rejection rates in speaker veri£cation tasks.
  • Keywords
    Polynomials; Signal to noise ratio; Tongue;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743810
  • Filename
    5743810