DocumentCode :
80095
Title :
Multiobjective Time Series Matching for Audio Classification and Retrieval
Author :
Esling, Philippe ; Agon, Carlos
Author_Institution :
Inst. de Rech. et Coordination Acoust./Musique (IRCAM) Lab., Paris, France
Volume :
21
Issue :
10
fYear :
2013
fDate :
Oct. 2013
Firstpage :
2057
Lastpage :
2072
Abstract :
Seeking sound samples in a massive database can be a tedious and time consuming task. Even when metadata are available, query results may remain far from the timbre expected by users. This problem stems from the nature of query specification, which does not account for the underlying complexity of audio data. The Query By Example (QBE) paradigm tries to tackle this shortcoming by finding audio clips similar to a given sound example. However, it requires users to have a well-formed soundfile of what they seek, which is not always a valid assumption. Furthermore, most audio-retrieval systems rely on a single measure of similarity, which is unlikely to convey the perceptual similarity of audio signals. We address in this paper an innovative way of querying generic audio databases by simultaneously optimizing the temporal evolution of multiple spectral properties. We show how this problem can be cast into a new approach merging multiobjective optimization and time series matching, called MultiObjective Time Series (MOTS) matching. We formally state this problem and report an efficient implementation. This approach introduces a multidimensional assessment of similarity in audio matching. This allows to cope with the multidimensional nature of timbre perception and also to obtain a set of efficient propositions rather than a single best solution. To demonstrate the performances of our approach, we show its efficiency in audio classification tasks. By introducing a selection criterion based on the hypervolume dominated by a class, we show that our approach outstands the state-of-art methods in audio classification even with a few number of features. We demonstrate its robustness to several classes of audio distortions. Finally, we introduce two innovative applications of our method for sound querying.
Keywords :
Pareto optimisation; audio databases; audio signal processing; content-based retrieval; pattern classification; pattern matching; query processing; time series; MOTS matching; Pareto optimization; QBE paradigm; audio classification tasks; audio clips; audio data complexity; audio distortions; audio matching; audio retrieval systems; audio signals; content-based retrieval; generic audio database querying; metadata; multiobjective optimization approach; multiobjective time series matching; multiple spectral property; query by example; query specification; selection criterion; sound file; sound querying; timbre perception; Audio databases; Pareto optimization; classification algorithms; content-based retrieval; data mining; data structures; indexing; multimedia databases; music information retrieval; pattern matching; query processing; time series analysis;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2265086
Filename :
6521366
Link To Document :
بازگشت