• DocumentCode
    1586792
  • Title

    AVAS: Speech database for multimodal recognition applications

  • Author

    Antar, Samar ; Sagheer, Alaa ; Aly, Sherin ; Tolba, M.F.

  • Author_Institution
    Center for Artificial Intell. & Robot. (CAIRO), Aswan Univ., Aswan, Egypt
  • fYear
    2013
  • Firstpage
    123
  • Lastpage
    128
  • Abstract
    Audio-visual speech recognition (AVSR) systems represent an important branch in the human computer interaction (HCI) domain, since it is the simplest way to interact with computer. However, difficulties due to visual variations in video sequence can significantly degrade the recognition performance of AVSR systems. Although several corpuses have been created in this area, most of them are not include realistic visual variations in video sequence. This paper presents the first Audio-Visual Speech recognition corpus using Arabic language denoted as AVAS. All AVAS samples contain two of the most important visual variations; illumination variations and head pose variations, in the same video recording. Hence, AVAS is useful in the development of robust AVSR systems, automatic speech recognition “audio-only” systems, lip-reading “visual-only” systems and face recognition across pose and illumination variations.
  • Keywords
    audio databases; audio-visual systems; human computer interaction; image sequences; natural language processing; speech recognition; video signal processing; AVAS; AVSR systems; Arabic language; HCI domain; audio-only systems; audio-visual speech recognition; automatic speech recognition; face recognition; head pose variations; human computer interaction; illumination variations; lip-reading visual-only systems; multimodal recognition applications; recognition performance; speech database; video recording; video sequence; visual variations; Artificial intelligence; Computers; Educational institutions; Head; Image resolution; Lighting; Tracking; audio visual speech recognition; audio-visual speech database; visual variations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hybrid Intelligent Systems (HIS), 2013 13th International Conference on
  • Conference_Location
    Gammarth
  • Print_ISBN
    978-1-4799-2438-7
  • Type

    conf

  • DOI
    10.1109/HIS.2013.6920467
  • Filename
    6920467