Title :
A generic audio classification and segmentation approach for multimedia indexing and retrieval
Author :
Kiranyaz, Serkan ; Qureshi, Ahmad Farooq ; Gabbouj, Moncef
Author_Institution :
Inst. of Signal Process., Tampere Univ. of Technol., Finland
fDate :
5/1/2006 12:00:00 AM
Abstract :
We focus the attention on the area of generic and automatic audio classification and segmentation for audio-based multimedia indexing and retrieval applications. In particular, we present a fuzzy approach toward hierarchic audio classification and global segmentation framework based on automatic audio analysis providing robust, bi-modal, efficient and parameter invariant classification over global audio segments. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the proposed framework significantly increases the efficiency and the accuracy of audio-based retrieval especially in large multimedia databases.
Keywords :
audio databases; database indexing; fuzzy set theory; information retrieval; multimedia databases; automatic audio segmentation; fuzzy approach; fuzzy region modeling; generic audio classification; multimedia databases; multimedia indexing; multimedia retrieval; Audio coding; Content based retrieval; Digital audio players; Indexing; Information resources; Information retrieval; Multimedia databases; Music information retrieval; Robustness; Speech enhancement; Automatic audio classification and segmentation; fuzzy modeling; multimedia indexing and retrieval; perceptual rule-based approach;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.857573