• DocumentCode
    900484
  • Title

    A generic audio classification and segmentation approach for multimedia indexing and retrieval

  • Author

    Kiranyaz, Serkan ; Qureshi, Ahmad Farooq ; Gabbouj, Moncef

  • Author_Institution
    Inst. of Signal Process., Tampere Univ. of Technol., Finland
  • Volume
    14
  • Issue
    3
  • fYear
    2006
  • fDate
    5/1/2006 12:00:00 AM
  • Firstpage
    1062
  • Lastpage
    1081
  • Abstract
    We focus the attention on the area of generic and automatic audio classification and segmentation for audio-based multimedia indexing and retrieval applications. In particular, we present a fuzzy approach toward hierarchic audio classification and global segmentation framework based on automatic audio analysis providing robust, bi-modal, efficient and parameter invariant classification over global audio segments. The input audio is split into segments, which are classified as speech, music, fuzzy or silent. The proposed method minimizes critical errors of misclassification by fuzzy region modeling, thus increasing the efficiency of both pure and fuzzy classification. The experimental results show that the critical errors are minimized and the proposed framework significantly increases the efficiency and the accuracy of audio-based retrieval especially in large multimedia databases.
  • Keywords
    audio databases; database indexing; fuzzy set theory; information retrieval; multimedia databases; automatic audio segmentation; fuzzy approach; fuzzy region modeling; generic audio classification; multimedia databases; multimedia indexing; multimedia retrieval; Audio coding; Content based retrieval; Digital audio players; Indexing; Information resources; Information retrieval; Multimedia databases; Music information retrieval; Robustness; Speech enhancement; Automatic audio classification and segmentation; fuzzy modeling; multimedia indexing and retrieval; perceptual rule-based approach;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.857573
  • Filename
    1621218