Title :
Commentator´s Speech Extraction in Audio Stream of Sports Games
Author :
Lu, Li ; Ge, Fengpei ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
Abstract :
This paper proposes a method to deal with the problem of extracting commentator´s speech in audio stream of live sports games. First, a two-pass metric-based audio segmentation module is developed to segment the audio stream into short ones with homogeneous acoustic features. Then a model-based classification module is adopted to extract the speech segments. For robust audio classification, various audio features have been used in this paper. Finally, a music scene analysis (Music-CASA) method is adopted to remove the speech in the advertisements with minimum loss of commentator´s speech. By integrating all the techniques, an average F value of 94.79% is achieved in the commentator´s speech extraction task evaluated on eleven games of six kinds of sports.
Keywords :
audio signal processing; audio streaming; music; signal classification; speech processing; sport; audio stream; commentator speech extraction; homogeneous acoustic features; live sports games; model-based classification module; music scene analysis method; robust audio classification; speech segments; two-pass metric-based audio segmentation module; Computer science; Data mining; Image analysis; Information retrieval; Robustness; Speech analysis; Speech recognition; Streaming media; Support vector machines; Technology management;
Conference_Titel :
Research Challenges in Computer Science, 2009. ICRCCS '09. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3927-0
Electronic_ISBN :
978-1-4244-5410-5
DOI :
10.1109/ICRCCS.2009.24