• DocumentCode
    179577
  • Title

    Robust far-field spoken command recognition for home automation combining adaptation and multichannel processing

  • Author

    Katsamanis, Athanasios ; Rodomagoulakis, I. ; Potamianos, Gerasimos ; Maragos, Petros ; Tsiami, Antigoni

  • Author_Institution
    Sch. of ECE, Nat. Tech. Univ. of Athens, Athens, Greece
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    5547
  • Lastpage
    5551
  • Abstract
    The paper presents our approach to speech-controlled home automation. We are focusing on the detection and recognition of spoken commands preceded by a key-phrase as recorded in a voice-enabled apartment by a set of multiple microphones installed in the rooms. For both problems we investigate robust modeling, environmental adaptation and multichannel processing to cope with a) insufficient training data and b) the far-field effects and noise in the apartment. The proposed integrated scheme is evaluated in a challenging and highly realistic corpus of simulated audio recordings and achieves F-measure close to 0.70 for key-phrase spotting and word accuracy close to 98% for the command recognition task.
  • Keywords
    home automation; microphones; speech recognition; audio recordings; environmental adaptation; far-field effects; insufficient training data; key-phrase spotting; microphones; multichannel processing; robust far-field spoken command recognition; speech-controlled home automation; spoken command detection; voice-enabled apartment; word accuracy; Acoustics; Adaptation models; Hidden Markov models; Microphones; Robustness; Speech; Speech recognition; adaptation; distant speech recognition; keyword spotting; multichannel processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854664
  • Filename
    6854664