Title :
Evolutionary feature synthesis for content-based audio retrieval
Author :
Kiranyaz, Serkan ; Raitoharju, Jenni ; Gabbouj, Moncef
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
Abstract :
Although there is a wide variety of low-level audio features for content-based audio indexing and retrieval, they may lack the discrimination power needed for accurate description of the aural content, leading into a poor content-based retrieval performance. Furthermore, manual selection of features among a vast collection may easily lead into sub-optimal solutions. In this paper, we propose an evolutionary feature synthesis technique, which co-exists with a feature selection scheme. The synthesis process seeks for the optimal linear / non-linear operators and feature weights from a pre-defined search space, so as to synthesize a highly discriminative set of new (artificial) features from the set of selected features. The evolutionary search process in the multi-dimensional solution space is based on multi-dimensional particle swarm optimization (MD PSO) algorithm, along with a fractional global best formation (FGBF) technique. Unlike in many existing feature generation approaches found in the literature, the dimension of the synthesized feature vector is also optimized during the process. The synthesized features by the proposed approach are compared with original audio descriptors in an extensive set of retrieval tasks. The experimental results clearly demonstrate a crucial improvement of up to 15-25% in the retrieval performance. Moreover, the proposed synthesis technique surpasses the performance of the artificial neural networks for retrieving accurate audio content.
Keywords :
audio signal processing; content-based retrieval; evolutionary computation; feature extraction; indexing; particle swarm optimisation; search problems; FGBF technique; MD PSO algorithm; content-based audio indexing; content-based audio retrieval; content-based retrieval performance; evolutionary feature synthesis technique; evolutionary search process; feature generation; feature selection; feature vector; feature weights; fractional global best formation technique; low-level audio features; multidimensional particle swarm optimization; multidimensional solution space; optimal linear operator; optimal nonlinear operator; search space; suboptimal solutions; Audio databases; Encoding; Feature extraction; Mel frequency cepstral coefficient; Particle swarm optimization; Vectors; Content based retrieval; Evolutionary computation; Feature extraction; Particle swarm optimization;
Conference_Titel :
Communications, Signal Processing, and their Applications (ICCSPA), 2013 1st International Conference on
Conference_Location :
Sharjah
Print_ISBN :
978-1-4673-2820-3
DOI :
10.1109/ICCSPA.2013.6487265