• DocumentCode
    2930651
  • Title

    A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications

  • Author

    Sundaram, Shiva ; Narayanan, Shrikanth

  • Author_Institution
    Deutsche Telekom Labs., TU-Berlin, Berlin, Germany
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    466
  • Lastpage
    469
  • Abstract
    In the recently proposed latent perceptual indexing of audio, a collection of clips is indexed using unit-document frequency measures between a set of reference clusters as units and the clips as the documents. The reference units are derived by clustering the bag-of-feature vectors extracted from the whole audio library using an unsupervised clustering technique. Indexing is achieved through reduced-rank approximation (using singular-value decomposition) of the unit-document co-occurrence measure matrix that is obtained for the given set of reference clusters and the collection of audio clips. In our initial investigation, the k-means algorithm was used to derive the reference units. In this paper, we attempt to reduce the computation load requirements for the k-means algorithm and singular-value decomposition by randomly splitting the training data into smaller sized parts instead of working on it as a whole. We present results of classification experiments on the BBC sound effects library and our results indicate this approach can significantly reduce the computation time without significant loss in classification performance.
  • Keywords
    Internet; audio databases; content-based retrieval; divide and conquer methods; multimedia systems; pattern clustering; singular value decomposition; unsupervised learning; audio clips; audio indexing; audio library; bag-of-feature vectors; divide-and-conquer approach; k-means algorithm; latent perceptual indexing; randomly splitting; reduced-rank approximation; singular-value decomposition; unit-document cooccurrence measure matrix; unit-document frequency; unsupervised clustering technique; Clustering algorithms; Concurrent computing; Content based retrieval; Frequency measurement; Indexing; Large scale integration; Libraries; Matrix decomposition; Measurement units; Training data; Latent Perceptual Indexing; Web 2.0 applications; audio classification; clustering; content based audio retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202535
  • Filename
    5202535