DocumentCode :
3183559
Title :
A framework for audio analysis based on classification and temporal segmentation
Author :
Tzanetakis, George ; Cook, Perry
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., NJ, USA
Volume :
2
fYear :
1999
fDate :
1999
Firstpage :
61
Abstract :
Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes a framework for experimenting evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing classification and annotation of audio files
Keywords :
audio signal processing; speech-based user interfaces; annotation; audio analysis; audio interfaces; audio tools; classification; completely automatic audio analysis; computer audio data; framework; manual browsing; tape-recorder paradigm; temporal segmentation; Computer interfaces; Computer science; Humans; Information analysis; Information retrieval; Internet; Optical computing; Search engines; Testing; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
EUROMICRO Conference, 1999. Proceedings. 25th
Conference_Location :
Milan
ISSN :
1089-6503
Print_ISBN :
0-7695-0321-7
Type :
conf
DOI :
10.1109/EURMIC.1999.794763
Filename :
794763
Link To Document :
بازگشت