• DocumentCode
    3153537
  • Title

    Noise robust keyword spotting for user generated video blogs

  • Author

    Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.

  • Author_Institution
    ICT Res. Inst., Univ. of Wollongong, Wollongong, NSW, Australia
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a template-based system for speaker independent key word spotting (KWS) in continuous speech that can help in automatic analysis, indexing, search and retrieval of user generated videos by content. Extensive experiments on clean speech confirm that the proposed approach is superior to a HMM approach when applied to noisy speech with different signal-to-noise ratio (SNR) levels. Experiments conducted to detect swear words, personal names and product names within a set of online user generated video blogs shows significantly better recall and precision results compared to a traditional ASR-based approach.
  • Keywords
    Web sites; speech processing; video retrieval; HMM approach; automatic analysis; indexing analysis; noise robust keyword spotting; noisy speech; online user generated video blogs; signal-to-noise ratio levels; speaker independent key word spotting; template-based system; user generated video retrieval; user generated video search; Blogs; Hidden Markov models; Histograms; Noise measurement; Signal to noise ratio; Speech; Keyword Spotting (KWS); Noise Robustness; Social Networks; Template Matching (TM); Users Video Blogs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2013.6607589
  • Filename
    6607589