DocumentCode :
2281112
Title :
Sound analysis using MPEG compressed audio
Author :
Tzanetakis, George ; Cook, Perry
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., NJ, USA
Volume :
2
fYear :
2000
fDate :
2000
Abstract :
There is a huge amount of audio data available that is compressed using the MPEG audio compression standard. Sound analysis is based on the computation of short time feature vectors that describe the instantaneous spectral content of the sound. An interesting possibility is the calculation of features directly from compressed data. Since the bulk of the feature calculation is performed during the encoding stage this process has a significant performance advantage if the available data is compressed. Combining decoding and analysis in one stage is also very important for audio streaming applications. In this paper, we describe the calculation of features directly from MPEG audio compressed data. Two of the basic processes of analyzing sound are: segmentation and classification. To illustrate the effectiveness of the calculated features we have implemented two case studies: a general audio segmentation algorithm and a music/speech classifier. Experimental data is provided to show that the results obtained are comparable with sound analysis algorithms working directly with audio samples
Keywords :
audio coding; data compression; feature extraction; music; pattern classification; MPEG compressed audio; audio compression; audio data; audio segmentation; audio streaming; classification; decoding; instantaneous spectral content; music/speech classifier; short time feature vectors; sound analysis; Audio compression; Auditory system; Computer science; Humans; Multiple signal classification; Performance analysis; Speech; Streaming media; Transform coding; Video compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.859071
Filename :
859071
Link To Document :
بازگشت