DocumentCode :
2546419
Title :
Audio segmentation and classification based on a selective analysis scheme
Author :
Ghaemmaghami, Shahrokh
Author_Institution :
Sharif Univ. of Technol., Tehran, Iran
fYear :
2004
fDate :
5-7 Jan. 2004
Firstpage :
42
Lastpage :
48
Abstract :
This paper addresses a new approach to segmentation and classification of audio through analysis of a smaller set of selective frames, which are identified by temporal decomposition (TD). These frames are located at the most steady instants, or event centroids, within a given block of the signal, which yield the maximal diversity over the set of selected features. Based on this selection scheme, the number of frames used in the analysis is reduced by at least 40%, while the temporal resolution is doubled as compared to that in typical audio classifiers. By constructing a classification system to segment audio into speech, music, speech-music, and others, it is shown that the proposed method outperforms the typical classifiers in most cases. In addition, by using hierarchical TD for frame selection, it is made possible to adapt the audio classifier with other segmentation schemes, e.g., visual classification based on motion picture analysis, for accurate audio-visual segmentation of multimedia data.
Keywords :
audio signal processing; signal classification; vector quantisation; audio classification; audio data annotation; audio data handling; audio data indexing; audio segmentation; audio-visual segmentation; event centroids; intelligent management; motion picture analysis; multimedia data; selective analysis scheme; temporal decomposition; temporal resolution; video data annotation; video data handling; video data indexing; visual classification; Data analysis; Energy management; Frequency; Indexing; Instruments; Motion pictures; Music; Signal processing; Signal resolution; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Modelling Conference, 2004. Proceedings. 10th International
Print_ISBN :
0-7695-2084-7
Type :
conf
DOI :
10.1109/MULMM.2004.1264965
Filename :
1264965
Link To Document :
بازگشت