Title :
Video genre verification using both acoustic and visual modes
Author :
Roach, Matthew ; Mason, John ; Xu, Li-Qun
Author_Institution :
Univ. of Wales, Swansea, UK
Abstract :
This paper reports on the verification of the video genre: sport, cartoon, news, commercial and music. Results for the two modes, acoustic and visual, and for combined modes show an average equal error rate (ERR) of 16%, 15% and 10%, respectively. These reflect verification accuracy and as such are believed to be the first of their kind; previously published work has focused on closed set identification, assuming the video is known to belong to one of a fixed set. The results also demonstrate the influence of the genre to be classified: the best performance for the visual mode has an EER of 4% (cartoons), and the best performance for the acoustic mode has EER of 0.6% (news). Finally, the combination of the modes presents a more consistent accuracy across the five genre with an EER of 10%.
Keywords :
audio signal processing; feature extraction; multimedia systems; video signal processing; acoustic mode; cartoon; closed set identification; equal error rate; multimedia material; music; news; sport; verification accuracy; video genre verification; visual mode; Acoustic measurements; Digital multimedia broadcasting; Error analysis; Internet; Labeling; Multimedia communication; Paper technology; Signal processing; TV; Video signal processing;
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
DOI :
10.1109/MMSP.2002.1203271