• DocumentCode
    311364
  • Title

    Multimodal interfaces for multimedia information agents

  • Author

    Waibel, Alex ; Suhm, Bemhard ; Vo, Minh Tue ; Yang, Jie

  • Author_Institution
    Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    167
  • Abstract
    When humans communicate they take advantage of a rich spectrum of cues. Some are verbal and acoustic. Some are non-verbal and non-acoustic. Signal processing technology has devoted much attention to the recognition of speech, as a single human communication signal. Most other complementary communication cues, however, remain unexplored and unused in human-computer interaction. In this paper we show that the addition of non-acoustic or non-verbal cues can significantly enhance robustness, flexibility, naturalness and performance of human-computer interaction. We demonstrate computer agents that use speech, gesture, handwriting, pointing, spelling jointly for more robust, natural and flexible human-computer interaction in the various tasks of an information worker: information creation, access, manipulation or dissemination
  • Keywords
    graphical user interfaces; image recognition; information dissemination; multimedia computing; natural language interfaces; software agents; speech recognition; complementary communication cues; computer agents; flexibility; gesture; handwriting; human communication signal; human-computer interaction; information access; information creation; information dissemination; information manipulation; information worker; multimedia information agents; multimodal interfaces; naturalness; nonacoustic cues; nonverbal cues; pointing; robustness; signal processing technology; speech; spelling; Computer interfaces; Face detection; Handwriting recognition; Humans; Interactive systems; Robustness; Shape; Speech processing; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.599587
  • Filename
    599587