Title :
Visually-based audio texture segmentation for audio scene analysis
Author :
Ghozi, R. ; Fraj, O. ; Jaidane, M.
Author_Institution :
Unite Signaux et Syst. (U2S), Ecole Nat. d´Ing. de Tunis, Tunis, Tunisia
Abstract :
In an analogy with image texture segmentation in visual scene analysis, this paper describes a method for segmenting sound textures in a stream of audio. In particular, we propose a visual scheme to partition an audio stream signal into pieces of audio textures. This visual representation is based on the inter-similarity matrix of the MFCC feature in the signal frames. Classical image enhancement such as binarization and median filtering are applied to the inter-similarity matrix in-order to partition the matrix into homogenous regions. A novelty test operator is then used to localize the boundaries of the image regions, which correspond to audio textures boundaries, signalling thereby a change of audio scene. The perceptual and computational advantages of this visually-based audio texture segmentation are illustrated using a wide range of sound textures of varying degree of complexity.
Keywords :
audio signal processing; audio streaming; matrix algebra; MFCC feature; audio scene analysis; audio stream; audio textures boundaries; binarization; classical image enhancement; homogenous regions; image regions; image texture segmentation; inter-similarity matrix; median filtering; signal frames; signalling; sound textures; visual representation; visual scene analysis; visual scheme; visually-based audio texture segmentation; Filtering; Image analysis; Image enhancement; Image segmentation; Kernel; Streaming media; Visualization; MFCC; Novelty test; audio texture; image enhancement; inter-similarity matrix audio/image segmentation;
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6