Title :
3 classes segmentation for analysis of football audio sequences
Author :
Lefèvre, S. ; Maillard, B. ; Vincent, N.
Author_Institution :
Lab. d´´Informatique, Univ. de Tours, France
Abstract :
We are dealing with segmentation of audio data in order to analyse football audio/video sequences. Audio data is divided into short sequences (typically with duration of one or half a second) which is classified into several classes (speaker, crowd and referee whistle). Every sequence can then be further analysed depending on the class it belongs to. In order to segment audio data, several methods are presented. First simple techniques are reviewed for segmentation in two classes. From the limitations of these approaches, a method based on cepstral analysis is detailed. Next we present two more complex methods dealing with 3 classes segmentation. The first one is based on hidden Markov models whereas the second one is a combination of a C-mean classifier and multidimensional hidden Markov models.
Keywords :
audio signal processing; cepstral analysis; hidden Markov models; sequences; signal classification; C-mean classifier; audio data segmentation; cepstral analysis; crowd sequence; football audio sequences; hidden Markov models; multidimensional HMM; referee whistle sequence; signal segmentation; speaker sequence; video sequences; Cepstral analysis; Event detection; Frequency; Hidden Markov models; Indexing; Multidimensional systems; Multimedia systems; Performance analysis; Signal analysis; Video sequences;
Conference_Titel :
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN :
0-7803-7503-3
DOI :
10.1109/ICDSP.2002.1028253