• DocumentCode
    2958977
  • Title

    A fuzzy approach towards perceptual classification and segmentation of MP3/AAC audio

  • Author

    Kiranyaz, Serkan ; Qureshi, Ahmad Farooq ; Gabbouj, Moncef

  • Author_Institution
    Institue of Signal Process., Tampere Univ. of Technol., Finland
  • fYear
    2004
  • fDate
    21-24 March 2004
  • Firstpage
    727
  • Lastpage
    730
  • Abstract
    The paper presents a novel perceptual based fuzzy approach towards classification and segmentation for MP3 and AAC audio in the compressed domain. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the method is robust to capturing and encoding parameters of MP3 and AAC bit streams. Due to the efficiency obtained from fuzzy-region modeling and improved accuracy via rule-based semantic approach, the method is designed specifically for the audio-based multimedia indexing and retrieval systems.
  • Keywords
    audio coding; data compression; fuzzy set theory; signal classification; AAC audio; advanced audio coding; audio multimedia indexing; fuzzy approach; fuzzy region modeling; retrieval systems; Audio coding; Background noise; Content based retrieval; Design methodology; Digital audio players; Indexing; Information retrieval; Multimedia systems; Speech; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control, Communications and Signal Processing, 2004. First International Symposium on
  • Print_ISBN
    0-7803-8379-6
  • Type

    conf

  • DOI
    10.1109/ISCCSP.2004.1296516
  • Filename
    1296516