DocumentCode :
485307
Title :
Research and application of audio feature in compressed domain
Author :
Liaoyu Chang ; Xiaoqing Yu ; Haiying Tan ; Wanggen Wan
Author_Institution :
Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai
fYear :
2007
fDate :
12-14 Dec. 2007
Firstpage :
390
Lastpage :
393
Abstract :
In this paper, by analyzing audio features in compressed domain based on audio encoding/decoding theory, we investigate the feature extraction directly from MP3 (MPEGl-layer3) compressed data stream and propose how to calculate these features such as RMS (root mean squared), SC (spectral centroid), BER (band energy ratio), BW (band width) and MFCC (Mel-frequency cepstral coefficients) from the spectral information available in the decoding stage. Also, the experiments are conducted and the results are analyzed to show the application of some aforementioned features. All the work conducted is for the purpose of laying a foundation for realizing audio information classification, retrieval and recognition in MP3 audio format.
Keywords :
audio coding; data compression; decoding; feature extraction; information retrieval; mean square error methods; MP3; MPEGl-layer3; Mel-frequency cepstral coefficients; audio encoding-decoding theory; audio feature; audio information retrieval; audio recognition; band energy ratio; domain compressibility; feature extraction; information classification; root mean square; spectral centroid; MFCC; audio feature; compressed domain; encoding/decoding;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Wireless, Mobile and Sensor Networks, 2007. (CCWMSN07). IET Conference on
Conference_Location :
Shanghai
ISSN :
0537-9989
Print_ISBN :
978-0-86341-836-5
Type :
conf
Filename :
4786220
Link To Document :
بازگشت