Title :
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications
Author :
Sundaram, Shiva ; Narayanan, Shrikanth
Author_Institution :
Deutsche Telekom Labs., TU-Berlin, Berlin, Germany
fDate :
June 28 2009-July 3 2009
Abstract :
In the recently proposed latent perceptual indexing of audio, a collection of clips is indexed using unit-document frequency measures between a set of reference clusters as units and the clips as the documents. The reference units are derived by clustering the bag-of-feature vectors extracted from the whole audio library using an unsupervised clustering technique. Indexing is achieved through reduced-rank approximation (using singular-value decomposition) of the unit-document co-occurrence measure matrix that is obtained for the given set of reference clusters and the collection of audio clips. In our initial investigation, the k-means algorithm was used to derive the reference units. In this paper, we attempt to reduce the computation load requirements for the k-means algorithm and singular-value decomposition by randomly splitting the training data into smaller sized parts instead of working on it as a whole. We present results of classification experiments on the BBC sound effects library and our results indicate this approach can significantly reduce the computation time without significant loss in classification performance.
Keywords :
Internet; audio databases; content-based retrieval; divide and conquer methods; multimedia systems; pattern clustering; singular value decomposition; unsupervised learning; audio clips; audio indexing; audio library; bag-of-feature vectors; divide-and-conquer approach; k-means algorithm; latent perceptual indexing; randomly splitting; reduced-rank approximation; singular-value decomposition; unit-document cooccurrence measure matrix; unit-document frequency; unsupervised clustering technique; Clustering algorithms; Concurrent computing; Content based retrieval; Frequency measurement; Indexing; Large scale integration; Libraries; Matrix decomposition; Measurement units; Training data; Latent Perceptual Indexing; Web 2.0 applications; audio classification; clustering; content based audio retrieval;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202535