DocumentCode :
2781905
Title :
Sports audio segmentation and classification
Author :
Huang, Jun ; Dong, Yuan ; Liu, Jiqing ; Chengyu Dong ; Wang, Haila
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2009
fDate :
6-8 Nov. 2009
Firstpage :
379
Lastpage :
383
Abstract :
The audio stream is an important component of a sports video. In this paper, we present a system for audio segmentation and classification, which can segment and classify the sports audio stream into speech, non-speech very well. The novel point in our research is that we apply the segmentation and clustering method which is often used in speaker diarization system for broadcast news to the analysis of sports videos. After the segmentation and Bayesian Information Criterion (BIC) clustering is performed, Gaussian Mixture Model (GMM) is used in the classifier to identify the kind of sound for each segment. Experiments on a database composed of 6 hour audio stream in the Eurosport TV program show that the average accuracy can reach 87.3% on segmentation and classification. This research is very useful for analyzing the content of sports videos in detail.
Keywords :
audio streaming; speech recognition; Bayesian information criterion clustering; Gaussian mixture model; audio stream; speaker diarization system; sports audio segmentation; Bayesian methods; Entropy; Loudspeakers; Merging; Music; Robustness; Speech processing; Streaming media; Telecommunications; Videos; GMM; audio segmentation and classification; content analysis; sports audio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Infrastructure and Digital Content, 2009. IC-NIDC 2009. IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4898-2
Electronic_ISBN :
978-1-4244-4900-6
Type :
conf
DOI :
10.1109/ICNIDC.2009.5360872
Filename :
5360872
Link To Document :
بازگشت