• DocumentCode
    1696571
  • Title

    Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering

  • Author

    Mantena, Gautam ; Anguera, Xavier

  • Author_Institution
    Telefonica Res., Barcelona, Spain
  • fYear
    2013
  • Firstpage
    8515
  • Lastpage
    8519
  • Abstract
    With the increase in multi-media data over the Internet, query by example spoken term detection (QbE-STD) has become important in providing a search mechanism to find spoken queries in spoken audio. Audio search algorithms should be efficient in terms of speed and memory to handle large audio files. In general, approaches derived from the well known dynamic time warping (DTW) algorithm suffer from scalability problems. To overcome such problems, an Information Retrieval-based DTW (IR-DTW) algorithm has been proposed recently. IR-DTW borrows techniques from Information Retrieval community to detect regions which are more likely to contain the spoken query and then uses a standard DTW to obtain exact start and end times. One drawback of the IR-DTW is the time taken for the retrieval of similar reference points for a given query point. In this paper we propose a method to improve the search performance of IR-DTW algorithm using a clustering based technique. The proposed method has shown an estimated speedup of 2400X.
  • Keywords
    Internet; audio recording; multimedia communication; query processing; IR-DTW algorithm; Internet; QbE-STD; audio files; audio search; dynamic time warping; hierarchical k-means clustering; information retrieval; multimedia data; query by example spoken term detection; reference points; scalability problems; search mechanism; speed improvements; spoken audio; spoken queries; Clustering algorithms; Heuristic algorithms; Indexing; Information retrieval; Standards; Vectors; Spoken term detection; audio search; indexing; query by example; retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639327
  • Filename
    6639327