• DocumentCode
    3183559
  • Title

    A framework for audio analysis based on classification and temporal segmentation

  • Author

    Tzanetakis, George ; Cook, Perry

  • Author_Institution
    Dept. of Comput. Sci., Princeton Univ., NJ, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    61
  • Abstract
    Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes a framework for experimenting evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing classification and annotation of audio files
  • Keywords
    audio signal processing; speech-based user interfaces; annotation; audio analysis; audio interfaces; audio tools; classification; completely automatic audio analysis; computer audio data; framework; manual browsing; tape-recorder paradigm; temporal segmentation; Computer interfaces; Computer science; Humans; Information analysis; Information retrieval; Internet; Optical computing; Search engines; Testing; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    EUROMICRO Conference, 1999. Proceedings. 25th
  • Conference_Location
    Milan
  • ISSN
    1089-6503
  • Print_ISBN
    0-7695-0321-7
  • Type

    conf

  • DOI
    10.1109/EURMIC.1999.794763
  • Filename
    794763