Title :
Audio indexing including frequency tracking of simultaneous multiple sources in speech and music
Author :
Le Coz, M. ; Pinquier, Julien ; Andre-Obrecht, Regine ; Mauclair, J.
Author_Institution :
IRIT, Toulouse, France
Abstract :
In this paper, we present a complete system for audio indexing. This system is based state-of-the-art methods of Speech-Music-Noise segmentation and Monophonic/Polyphonic estimation. After those methods we propose an original system of superposed sources detection. This approach is based on the analysis of the evolution of the predominant frequencies. In order to validate the whole system we used different corpora : Radio broadcasts, studio music and degraded field records. The first results are encouraging and show the potential of our approach which is generic and can be used on both music and speech contents.
Keywords :
audio signal processing; database indexing; music; audio indexing; degraded field records; monophonic-polyphonic estimation; music contents; radio broadcasts; simultaneous multiple source frequency tracking; speech contents; speech-music-noise segmentation; state-of-the-art methods; studio music; superposed source detection system; Context; Harmonic analysis; Indexing; Instruments; Noise; Speech; Time-frequency analysis;
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on
Conference_Location :
Veszprem
Print_ISBN :
978-1-4799-0955-1
DOI :
10.1109/CBMI.2013.6576547