DocumentCode :
2695145
Title :
Parallel model combination and word recognition in soccer audio
Author :
Longton, Jack H. ; Jackson, Philip J B
Author_Institution :
Centre for Vision, Speech & Signal Process. (CVSSP), Surrey Univ., Guildford
fYear :
2008
fDate :
June 23 2008-April 26 2008
Firstpage :
1465
Lastpage :
1468
Abstract :
Audio from broadcast soccer can be used for identifying highlights from the game. Audio cues derived from these sources provide valuable information about game events, as can the detection of key words used by the commentators. In this paper we interpret the feasibility of incorporating both commentator word recognition and information about the additive background noise in an HMM structure. A limited set of audio cues, which have been extracted from data collected from the 2006 FIFA World Cup, are used to create an extension to the Aurora-2 database. The new database is then tested with various PMC models and compared to the standard baseline, clean and multi-condition training methods. It is found that incorporating SNR and noise type information into the PMC process is beneficial to recognition performance.
Keywords :
audio databases; database indexing; hidden Markov models; speech recognition; 2006 FIFA World Cup; Aurora-2 database; HMM structure; PMC models; additive background noise; audio cues; audio indexing; broadcast soccer; commentator word recognition; game events; key word detection; parallel model combination; soccer audio; Background noise; Bridges; Databases; Hidden Markov models; Layout; Microphones; Noise level; Speech enhancement; Speech recognition; Testing; Audio indexing; HMM; soccer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
Type :
conf
DOI :
10.1109/ICME.2008.4607722
Filename :
4607722
Link To Document :
بازگشت