• DocumentCode
    1958046
  • Title

    A natural acoustic front-end for Interactive TV in the EU-Project DICIT

  • Author

    Marquardt, L. ; Svaizer, P. ; Mabande, E. ; Brutti, A. ; Zieger, C. ; Omologo, M. ; Kellermann, W.

  • Author_Institution
    Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg, Erlangen, Germany
  • fYear
    2009
  • fDate
    23-26 Aug. 2009
  • Firstpage
    894
  • Lastpage
    899
  • Abstract
    ldquoDistant-talking Interfaces for Control of Interactive TVrdquo (DICIT) is a European Union-funded project whose main objective is to integrate distant-talking voice interaction as a complementary modality to the use of a remote control in interactive TV systems. Hands-free and seamless control enables a natural user-system interaction providing a suitable means to greatly ease information retrieval. In the given living room scenario the system recognizes commands spoken by multiple and possibly moving users, even in the presence of background noise and TV surround audio. This paper focuses on the multichannel acoustic frontend (MCAF) processing for acoustic scene interpretation which is based on the combination of multi-channel acoustic echo cancellation, blind source separation, beamforming, acoustic event classification, and multiple speaker localization. The fully functional DICIT prototype consists of the MCAF, automatic speech recognition, natural language understanding, mixed-initiative dialogue and satellite connection.
  • Keywords
    acoustic signal processing; array signal processing; blind source separation; echo suppression; human computer interaction; interactive television; natural language processing; signal classification; speaker recognition; telecommunication control; EU-Project DICIT; European Union-funded project; MCAF; acoustic event classification; acoustic scene interpretation; automatic speech recognition; background noise; beamforming; blind source separation; distant-talking interface; distant-talking voice interaction; information retrieval; interactive TV control system; living room scenario; mixed-initiative dialogue; multichannel acoustic echo cancellation; multichannel acoustic frontend processing; natural acoustic front-end; natural language understanding; natural user-system interaction; remote control; satellite connection; speaker localization; Array signal processing; Background noise; Blind source separation; Control systems; Echo cancellers; Information retrieval; Layout; Loudspeakers; Prototypes; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and Signal Processing, 2009. PacRim 2009. IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    978-1-4244-4560-8
  • Electronic_ISBN
    978-1-4244-4561-5
  • Type

    conf

  • DOI
    10.1109/PACRIM.2009.5291251
  • Filename
    5291251