• DocumentCode
    768561
  • Title

    A quick search method for audio and video signals based on histogram pruning

  • Author

    Kashino, Kunio ; Kurozumi, Takayuki ; Murase, Hiroshi

  • Author_Institution
    NTT Commun. Sci. Labs., Atsugi, Japan
  • Volume
    5
  • Issue
    3
  • fYear
    2003
  • Firstpage
    348
  • Lastpage
    357
  • Abstract
    This paper proposes a quick method of similarity-based signal searching to detect and locate a specific audio or video signal given as a query in a stored long audio or video signal. With existing techniques, similarity-based searching may become impractical in terms of computing time in the case of searching through long-running (several-days´ worth of) signals. The proposed algorithm, which is referred to as time-series active search, offers significantly faster search with sufficient accuracy. The key to the acceleration is an effective pruning algorithm introduced in the histogram matching stage. Through the pruning, the actual number of matching calculations can be reduced by 200 to 500 times compared with exhaustive search while guaranteeing exactly the same search result. Experiments show that the proposed method can correctly detect and locate a 15-s signal in a 48-h recording of TV broadcasts within 1 s, once the feature vectors are calculated and quantized. As extentions of the basic algorithm, efficient AND/OR search methods for searching for multiple query signals and a feature dithering method for coping with signal distortion are also discussed.
  • Keywords
    audio signal processing; pattern matching; time series; video signal processing; 15 s; 48 h; TV broadcasts; audio signals; efficient AND/OR search methods; feature dithering method; histogram pruning; multiple query signals; quick search method; signal distortion; similarity-based signal searching; time-series active search; video signals; Acceleration; Content based retrieval; Histograms; Information retrieval; Internet; Multimedia databases; Music information retrieval; Search methods; Signal detection; Speech;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2003.813281
  • Filename
    1223562