Title :
Audio classification using acoustic images for retrieval from multimedia databases
Author :
Paraskevas, Ioannis ; Chilton, Edward
Author_Institution :
Surrey Univ., Guildford, UK
Abstract :
With the increasing use of audio-visual databases, the need for automatic content-based classification has grown in importance. In this paper, a novel method for the automatic recognition of acoustic utterances is presented using acoustic images as the basis for the feature extraction. This method effectively employs the spectrogram, the Wigner-Ville distribution and co-occurrence matrices. The images are then compressed, using statistical methods, before being combined into a single feature matrix to be presented to a classifier. Initial results obtained from the classification of a database of sport sounds and gunshots indicate that the method is capable of accurate discrimination for coarse and fine classification respectively.
Keywords :
Wigner distribution; audio acoustics; audio signal processing; audio-visual systems; content-based retrieval; data compression; feature extraction; image classification; image coding; image texture; matrix algebra; multimedia databases; Wigner-Ville distribution; acoustic images; acoustic utterances automatic recognition; audio classification; audio-visual databases; automatic content-based classification; co-occurrence matrices; compressed images; feature extraction; gunshots database; multimedia databases; single feature matrix; spectrogram; sport sounds database; statistical signal processing; time-frequency distributions; Audio databases; Feature extraction; Image coding; Image databases; Image recognition; Image retrieval; Information retrieval; Multimedia databases; Spatial databases; Spectrogram;
Conference_Titel :
Video/Image Processing and Multimedia Communications, 2003. 4th EURASIP Conference focused on
Print_ISBN :
953-184-054-7
DOI :
10.1109/VIPMC.2003.1220460