DocumentCode
2930651
Title
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications
Author
Sundaram, Shiva ; Narayanan, Shrikanth
Author_Institution
Deutsche Telekom Labs., TU-Berlin, Berlin, Germany
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
466
Lastpage
469
Abstract
In the recently proposed latent perceptual indexing of audio, a collection of clips is indexed using unit-document frequency measures between a set of reference clusters as units and the clips as the documents. The reference units are derived by clustering the bag-of-feature vectors extracted from the whole audio library using an unsupervised clustering technique. Indexing is achieved through reduced-rank approximation (using singular-value decomposition) of the unit-document co-occurrence measure matrix that is obtained for the given set of reference clusters and the collection of audio clips. In our initial investigation, the k-means algorithm was used to derive the reference units. In this paper, we attempt to reduce the computation load requirements for the k-means algorithm and singular-value decomposition by randomly splitting the training data into smaller sized parts instead of working on it as a whole. We present results of classification experiments on the BBC sound effects library and our results indicate this approach can significantly reduce the computation time without significant loss in classification performance.
Keywords
Internet; audio databases; content-based retrieval; divide and conquer methods; multimedia systems; pattern clustering; singular value decomposition; unsupervised learning; audio clips; audio indexing; audio library; bag-of-feature vectors; divide-and-conquer approach; k-means algorithm; latent perceptual indexing; randomly splitting; reduced-rank approximation; singular-value decomposition; unit-document cooccurrence measure matrix; unit-document frequency; unsupervised clustering technique; Clustering algorithms; Concurrent computing; Content based retrieval; Frequency measurement; Indexing; Large scale integration; Libraries; Matrix decomposition; Measurement units; Training data; Latent Perceptual Indexing; Web 2.0 applications; audio classification; clustering; content based audio retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202535
Filename
5202535
Link To Document