• DocumentCode
    590666
  • Title

    GIF-LR:GA-based informative feature for lipreading

  • Author

    Ukai, Naoya ; Seko, T. ; Tamura, Shinji ; Hayamizu, Satoru

  • Author_Institution
    Dept. of Inf. Sci., Gifu Univ., Gifu, Japan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we propose a general and discriminative feature “GIF” (GA-based Informative Feature), and apply the feature to lipreading (visual speech recognition). The feature extraction method consists of two transforms, that convert an input vector to GIF for recognition. The transforms can be computed using training data and Genetic Algorithm (GA). For lipreading, we extract a fundamental feature as an input vector from an image; the vector consists of intensity values at all the pixels in an input lip image, which are enumerated from left-top to right-bottom. Recognition experiments of continuous digit utterances were conducted using an audio-visual corpus including more than 268,000 lip images. The recognition results show that the GIF-based method is better than the baseline method using eigenlip features.
  • Keywords
    feature extraction; genetic algorithms; image recognition; speech recognition; GA based informative feature; GIF-LR; discriminative feature; eigenlip feature; feature extraction method; genetic algorithm; lipreading; training data; visual speech recognition; Accuracy; Feature extraction; Genetic algorithms; Hidden Markov models; Speech recognition; Vectors; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6411813