• DocumentCode
    3763249
  • Title

    Analyzing Artificial Neural Networks and Dynamic Time Warping for spoken keyword recognition under transient noise conditions

  • Author

    Paulo Lopez-Meyer;Hector Cordourier-Maruri;Arturo Quinto-Martinez;Omesh Tickoo

  • Author_Institution
    Intel Corporation, Intel Labs, Jones Farms, Hillsboro, OR, USA
  • fYear
    2015
  • Firstpage
    274
  • Lastpage
    277
  • Abstract
    Spoken keyword recognition has been under the spotlight for the past several decades, but has gained significant attention in recent years due to the rapid increase in front-end technology applications for mobile and wearable computing. This work presents the trade-off in performance between Artificial Neural Networks (ANN) and Dynamic Time Warping (DTW) methodologies, implemented for this task under three different transient noise conditions (inside a car, in a pub, and outdoors), where no external noise reduction pre-processing is used. For this purpose, two types of recognition models were implemented: speaker dependent (SD) and speaker independent (SI). Experimental results show comparable high keyword recognition precision in SD models for both ANN and DTW using baseline data, i.e. no transient noise, but for the SI models, a significant drop in precision was observed for the case of DTW. Additional precision analyses present the results on how the different types of transient noise affect the recognition methodologies of interest. From the point of view of storage resources, both methodologies require comparable memory usage for the SD models, however, the SI model increases the memory needed with the DTW methodology. Lastly, time performance analysis showed a faster recognition time using the ANN methodology.
  • Keywords
    "Artificial neural networks","Mel frequency cepstral coefficient","Silicon","Transient analysis","Wearable computers","Computational modeling","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Sensing Technology (ICST), 2015 9th International Conference on
  • Electronic_ISBN
    2156-8073
  • Type

    conf

  • DOI
    10.1109/ICSensT.2015.7438406
  • Filename
    7438406