• DocumentCode
    3335698
  • Title

    Nonexclusive audio segmentation and indexing as a pre-processor for audio information mining

  • Author

    Li, Francis F.

  • Author_Institution
    Sch. of Comput., Sci. & Eng., Univ. of Salford, Salford, UK
  • Volume
    03
  • fYear
    2013
  • fDate
    16-18 Dec. 2013
  • Firstpage
    1593
  • Lastpage
    1597
  • Abstract
    Much content related information can be extracted from recorded soundtracks, such as those of multimedia files. The soundtracks might be heuristically classified into three categories namely speech, music and ambient or event sounds. Research in the past focused on algorithms to classify audio clips in an exclusive manner. However, soundtracks from media content are often presented as overlapped mixtures of all these three types of sounds. Nonexclusive segmentation and indexing are therefore essential pre-processors for effective audio information mining and metadata generation. This paper emphasizes the importance of nonexclusive indexing and segmentation methods, identifies the challenges and proposes a universal architecture for nonexclusive segmentation and indexing as a pre-processor for audio information mining, metadata extraction and scene analysis. Related feature selection, pattern recognition and signal processing algorithms are presented and testing results discussed.
  • Keywords
    audio signal processing; data mining; indexing; meta data; signal classification; audio clip classification; audio information mining; event sound category; feature selection; media content; metadata extraction; metadata generation; multimedia files; music category; nonexclusive audio indexing; nonexclusive audio segmentation method; pattern recognition; pre-processors; recorded sound tracks; scene analysis; signal processing algorithms; speech category; Artificial neural networks; Feature extraction; Indexing; Multiple signal classification; Music; Speech; Training; Audio segmentation; Metadata; audio information mining; classification; content descriptor; indexing; scene analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2013 6th International Congress on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4799-2763-0
  • Type

    conf

  • DOI
    10.1109/CISP.2013.6743930
  • Filename
    6743930