DocumentCode :
2609496
Title :
Sports audio classification based on MFCC and GMM
Author :
Jiqing, Liu ; Yuan, Dong ; Jun, Huang ; Xianyu, Zhao ; Haila, Wang
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2009
fDate :
18-20 Oct. 2009
Firstpage :
482
Lastpage :
485
Abstract :
Audio segmentation and classification can provide useful information for multimedia content analysis. In this paper, we present a approach to segment and categorize the sports audio into speech, music and other environmental sounds for sports video classification and highlight detection. We investigate the performance of mel frequency cepstral coefficients (MFCC) in a Gaussian mixture model frame work, and compare it to traditional short-time energy and zero-crossing rate feature. We achieve a correct identification close to 90% on MFCC with its first and second derivatives.
Keywords :
Gaussian distribution; audio signal processing; cepstral analysis; multimedia computing; signal classification; speech recognition; GMM; Gaussian mixture model; MFCC; audio classification; audio segmentation; mel frequency cepstral coefficients; sports; Application software; Cepstral analysis; Filters; Mel frequency cepstral coefficient; Music; Signal processing; Speech analysis; Support vector machine classification; Support vector machines; Telecommunications; GMM; MFCC; audio classification; sports audio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Broadband Network & Multimedia Technology, 2009. IC-BNMT '09. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4590-5
Electronic_ISBN :
978-1-4244-4591-2
Type :
conf
DOI :
10.1109/ICBNMT.2009.5348520
Filename :
5348520
Link To Document :
بازگشت