Title :
Extracting Captions in Complex Background from Videos
Author :
Liu, Xiaoqian ; Wang, Weiqiang ; Zhu, Tingshao
Author_Institution :
Chinese Acad. of Sci., Grad. Univ., Beijing, China
Abstract :
Captions in videos play a significant role for automatically understanding and indexing video content, since much semantic information is associated with them. This paper presents an effective approach to extracting captions from videos, in which multiple different categories of features (edge, color, stroke etc.) are utilized, and the spatio-temporal characteristics of captions are considered. First, our method exploits the distribution of gradient directions to decompose a video into a sequence of clips temporally, so that each clip contains a caption at most, which makes the successive extraction computation more efficient and accurate. For each clip, the edge and corner information are then utilized to locate text regions. Further, text pixels are extracted based on the assumption that text pixels in text regions always have homogeneous color, and their quantity dominates the region relative to non-text pixels with different colors. Finally, the segmentation results are further refined. The encouraging experimental results on 2565 characters have preliminarily validated our approach.
Keywords :
feature extraction; gradient methods; image colour analysis; image segmentation; video signal processing; color feature; edge feature; gradient direction distribution; homogeneous color; stroke feature; text pixel extraction; video caption extraction; video content indexing; video content understanding; video segmentation; Feature extraction; Image color analysis; Image edge detection; Image segmentation; Optical character recognition software; Pixel; Videos; captions; extracting; temporal features;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.790