• DocumentCode
    534976
  • Title

    An investigation on the usage of image quality assessment in visual speech recognition

  • Author

    Banitalebi, Amin ; Moosaei, Maryam ; Hossein-Zadeh, Gholam Ali

  • Author_Institution
    Control & Intell. Process. Center of Excellence, Univ. of Tehran, Tehran, Iran
  • Volume
    5
  • fYear
    2010
  • fDate
    16-18 Oct. 2010
  • Firstpage
    2327
  • Lastpage
    2331
  • Abstract
    Having a robust speech recognition scheme that can be relied upon in different environments is a strong requirement for modern systems. Previous works in field of lipreading mainly have used a level of segmentation at the beginning and then used the structure of the mouth, facial muscles of the speaker, some critical points on the lip, or the motion of these points for word recognition. In this paper we present a novel way of processing the video signal for lipreading application. We neither used segmentation level nor the extraction of important facial points. Instead, we´ve used HVS (human visual system) based image quality metrics, especially complex wavelet structural similarity (CW-SSIM) and visual information fidelity (VIF) as our similarity criterions. We used an intelligent frame by frame video comparison technique and we applied mentioned metrics in our approach. Experimental results showed that in comparison to other methods, this novel method can recognize the true letter among the letters of the utilized dictionary with an acceptable accuracy.
  • Keywords
    speech recognition; video signal processing; wavelet transforms; CW-SSIM; VIF; complex wavelet structural similarity; human visual system; image quality assessment; lipreading; video signal; visual information fidelity; visual speech recognition; word recognition; Accuracy; Dictionaries; Feature extraction; Measurement; Speech; Speech recognition; Visualization; Lipreading; Visual Information Fidelity; Visual Word Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2010 3rd International Congress on
  • Conference_Location
    Yantai
  • Print_ISBN
    978-1-4244-6513-2
  • Type

    conf

  • DOI
    10.1109/CISP.2010.5646210
  • Filename
    5646210