Title :
A fuzzy approach towards perceptual classification and segmentation of MP3/AAC audio
Author :
Kiranyaz, Serkan ; Qureshi, Ahmad Farooq ; Gabbouj, Moncef
Author_Institution :
Institue of Signal Process., Tampere Univ. of Technol., Finland
Abstract :
The paper presents a novel perceptual based fuzzy approach towards classification and segmentation for MP3 and AAC audio in the compressed domain. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the method is robust to capturing and encoding parameters of MP3 and AAC bit streams. Due to the efficiency obtained from fuzzy-region modeling and improved accuracy via rule-based semantic approach, the method is designed specifically for the audio-based multimedia indexing and retrieval systems.
Keywords :
audio coding; data compression; fuzzy set theory; signal classification; AAC audio; advanced audio coding; audio multimedia indexing; fuzzy approach; fuzzy region modeling; retrieval systems; Audio coding; Background noise; Content based retrieval; Design methodology; Digital audio players; Indexing; Information retrieval; Multimedia systems; Speech; Streaming media;
Conference_Titel :
Control, Communications and Signal Processing, 2004. First International Symposium on
Print_ISBN :
0-7803-8379-6
DOI :
10.1109/ISCCSP.2004.1296516