Title :
Automatic low-dimensional analysis of audio databases
Author :
Arias, Joss Anibal ; Andre-Obrecht, Regine ; Farinas, Jerome
Author_Institution :
IRIT/SAMOVA, Univ. Paul Sabatier, Toulouse
Abstract :
In this paper we present an approach designed to map variable size audio sequences into fixed-length vectors, useful to discover contents of audio databases. First, we model standard audio parameters with Gaussian mixture models (GMM). Then, symmetric Kullback-Leiber divergences between models are approximated with a Monte-Carlo method. We use these statistical dissimilarities to find a low-dimensional representation of each audio sequence through Multidimensional scaling (MDS) algorithm. Vectors in low-dimensional spaces are then easily explored with kernel and clustering methods. Experiments carried out in different kind of audio databases (music, speakers and languages) show good potential of the proposed approach and provide a framework for more challenging applications.
Keywords :
Gaussian processes; Monte Carlo methods; audio databases; pattern clustering; Gaussian mixture models; Monte-Carlo method; audio databases; automatic low-dimensional analysis; fixed-length vectors; multidimensional scaling algorithm; statistical dissimilarities; variable size audio sequences; Audio databases; Data analysis; Feature extraction; Hidden Markov models; Kernel; Machine learning; Machine learning algorithms; Multidimensional systems; Natural languages; Spatial databases; Unsupervised learning; audio databases; machine learning;
Conference_Titel :
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location :
London
Print_ISBN :
978-1-4244-2043-8
Electronic_ISBN :
978-1-4244-2044-5
DOI :
10.1109/CBMI.2008.4564996