• DocumentCode
    672385
  • Title

    NMF-based keyword learning from scarce data

  • Author

    Ons, Bart ; Gemmeke, Jort F. ; Van hamme, Hugo

  • Author_Institution
    Dept. ESAT-PSI, KULeuven, Leuven, Belgium
  • fYear
    2013
  • fDate
    8-12 Dec. 2013
  • Firstpage
    392
  • Lastpage
    397
  • Abstract
    This research is situated in a project aimed at the development of a vocal user interface (VUI) that learns to understand its users specifically persons with a speech impairment. The vocal interface adapts to the speech of the user by learning the vocabulary from interaction examples. Word learning is implemented through weakly supervised non-negative matrix factorization (NMF). The goal of this study is to investigate how we can improve word learning when the number of interaction examples is low. We demonstrate two approaches to train NMF models on scarce data: 1) training word models using smoothed training data, and 2) training word models that strictly correspond to the grounding information derived from a few interaction examples. We found that both approaches can substantially improve word learning from scarce training data.
  • Keywords
    human computer interaction; learning (artificial intelligence); matrix decomposition; natural language interfaces; speech recognition; speech-based user interfaces; NMF-based keyword learning; VUI; grounding information; interaction examples; scarce training data; smoothed training data; speech impairment; training word; user understanding; vocabulary; vocal interface; vocal user interface; weakly supervised nonnegative matrix factorization; Accuracy; Acoustics; Smoothing methods; Speech; Training; Training data; Vectors; data scarcity; vocabulary acquisition; vocal user interface; weakly supervised non-negative matrix factorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
  • Conference_Location
    Olomouc
  • Type

    conf

  • DOI
    10.1109/ASRU.2013.6707762
  • Filename
    6707762