Title :
An efficient method for the unsupervised discovery of signalling motifs in large audio streams
Author :
Muscariello, Armando ; Gravier, Guillaume ; Bimbot, Frédéric
Author_Institution :
CNRS, IRISA, Rennes, France
Abstract :
Providing effective tools to navigate and access through long audio archives, or monitor and classify broadcast streams, proves to be an extremely challenging task. Main issues originate from the varied nature of patterns of interest in a composite audio environment, the massive size of such databases, and the capability of performing when prior knowledge on audio content is scarce or absent. This paper proposes a computational architecture aimed at discovering occurrences of repeating patterns in audio streams by means of unsupervised learning. The targeted repetitions (or motifs) are called signalling, by analogy with a biological nomenclature, as referring to a broad class of audio patterns (as jingles, songs, advertisements, etc...) frequently occurring in broadcast audio. We adapt a system originally developed for word discovery applications, and demonstrate its effectiveness in a song discovery scenario. The adaption consists in speeding up critical parts of the computations, mostly based on audio feature coarsening, to deal with the large occurrence period of repeating songs in radio streams.
Keywords :
audio databases; audio streaming; data mining; unsupervised learning; audio archive; audio content; audio stream; biological nomenclature; broadcast stream; signalling motif; unsupervised discovery; unsupervised learning; word discovery application; Accuracy; Computer architecture; Heuristic algorithms; Libraries; Pattern matching; Search problems; Speech;
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2011 9th International Workshop on
Conference_Location :
Madrid
Print_ISBN :
978-1-61284-432-9
Electronic_ISBN :
1949-3983
DOI :
10.1109/CBMI.2011.5972536