• DocumentCode
    1349927
  • Title

    On the applications of multimedia processing to communications

  • Author

    Cox, Richard V. ; Haskell, Barry G. ; Lecun, Yann ; Shahraray, Behzad ; Rabiner, Lawrence

  • Author_Institution
    Speech & Image Process. Services Res. Lab., AT&T Bell Labs., Florham Park, NJ, USA
  • Volume
    86
  • Issue
    5
  • fYear
    1998
  • fDate
    5/1/1998 12:00:00 AM
  • Firstpage
    755
  • Lastpage
    824
  • Abstract
    The challenge of multimedia processing is to provide services that seamlessly integrate text, sound, image, and video information and to do it in a way that preserves the ease of use and interactivity of conventional plain old telephone service (POTS) telephony. To achieve this goal, there are a number of technological problems that must be considered, including: compression and coding of multimedia signals, including algorithmic issues, standards issues, and transmission issues; synthesis and recognition of multimedia signals, including speech, images, handwriting, and text; organization, storage, and retrieval of multimedia signals, including the appropriate method and speed of delivery, resolution, and quality of service; access methods to the multimedia signal, including spoken natural language interfaces, agent interfaces, and media conversion tools; searching by text, speech, and image queries; browsing by accessing the text, by voice, or by indexed images. In each of these areas, a great deal of progress has been made in the past few years, driven in part by the relentless growth in multimedia personal computers and in part by the promise of broad-band access from the home and from wireless connections. Standards have also played a key role in driving new multimedia services, both on the POTS network and on the Internet. It is the purpose of this paper to review the status of the technology in each of the areas listed above and to illustrate current capabilities by describing several multimedia applications that have been implemented at AT&T Labs over the past several years
  • Keywords
    data compression; multimedia communication; multimedia computing; query processing; user interfaces; video coding; HDTV; agents; audio coding; cable modems; coding; communications networks; compression; content-based video sampling; document compression; fax coding; handwriting; image coding; multimedia indexing; multimedia processing; multimedia signals; optical character recognition; organization; recognition; retrieval; speech coding; storage; synthesis; teleconferencing; telephone service; telephony; text; video coding; video information; video telephony; Appropriate technology; Image coding; Image storage; Multimedia communication; Multimedia systems; Speech coding; Speech synthesis; Standards organizations; Telephony; Video compression;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/5.664272
  • Filename
    664272