DocumentCode :
2151175
Title :
Soundtrack classification by transient events
Author :
Cotton, Courtenay V. ; Ellis, Daniel P W ; Loui, Alexander C.
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
473
Lastpage :
476
Abstract :
We present a method for video classification based on information in the soundtrack. Unlike previous approaches which describe the audio via statistics of mel-frequency cepstral coefficient (MFCC) features calculated on uniformly-spaced frames, we investigate an approach to focusing our representation on audio transients corresponding to sound-track events. These event-related features can reflect the "foreground" of the soundtrack and capture its short-term temporal structure better than conventional frame-based statistics. We evaluate our method on a test set of 1873 YouTube videos labeled with 25 semantic concepts. Retrieval results based on transient features alone are comparable to an MFCC-based system, and fusing the two representations achieves a relative improvement of 7.5% in mean average precision (MAP).
Keywords :
acoustic signal processing; audio signal processing; cepstral analysis; signal classification; statistical analysis; video databases; video retrieval; MAP; MFCC features; MFCC-based system; YouTube videos; audio transients; conventional frame-based statistics; event-related features; mean average precision; mel-frequency cepstral coefficient fetures; retrieval results; short-term temporal structure; sound-track events; soundtrack classification; soundtrack information; transient events; transient features; uniformly-spaced frames; video classification; Feature extraction; Gain control; Histograms; Mel frequency cepstral coefficient; Semantics; Time frequency analysis; Transient analysis; Acoustic signal processing; Multimedia databases; Video indexing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946443
Filename :
5946443
Link To Document :
بازگشت