Title :
Fast query by example of environmental sounds via robust and efficient cluster-based indexing
Author :
Xue, Jiachen ; Wichern, Gordon ; Thornburg, Harvey ; Spanias, Andreas
Author_Institution :
Arts, Media, & Eng., Arizona State Univ., Tempe, AZ
fDate :
March 31 2008-April 4 2008
Abstract :
There has been much recent progress in the technical infrastructure necessary to continuously characterize and archive all sounds, or more precisely auditory streams, that occur within a given space or human life. Efficient and intuitive access, however, remains a considerable challenge. In specifically musical domains, i.e., melody retrieval, query-by-example (QBE) has found considerable success in accessing music that matches a specific query. We propose an extension of the QBE paradigm to the broad class of natural and environmental sounds, which occur frequently in continuous recordings. We explore several cluster-based indexing approaches, namely non-negative matrix factorization (NMF) and spectral clustering to efficiently organize and quickly retrieve archived audio using the QBE paradigm. Experiments on a test database compare the performance of the different clustering algorithms in terms of recall, precision, and computational complexity. Initial results indicate significant improvements over both exhaustive search schemes and traditional K- means clustering, and excellent overall performance in the example-based retrieval of environmental sounds.
Keywords :
acoustic signal processing; audio databases; audio signal processing; database indexing; hidden Markov models; matrix decomposition; music; pattern clustering; pattern matching; query processing; spectral analysis; K-means clustering; acoustic signal processing; cluster-based indexing; computational complexity; environmental sounds; example-based retrieval; hidden Markov model; melody retrieval; music database; nonnegative matrix factorization; query by example; spectral clustering; Audio databases; Audio recording; Clustering algorithms; Hidden Markov models; Indexing; Matrix decomposition; Music information retrieval; Robustness; Spatial databases; Streaming media; Acoustic signal analysis; Clustering methods; Database query processing; Hidden Markov models;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517532