Title :
Automatic identification of text in digital video key frames
Author :
Li, Huiping ; Doermann, David
Author_Institution :
Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA
Abstract :
Scene and graphic text can provide important supplemental index information in video sequences. In this paper we address the problem automatically identifying text regions in digital video key frames. The text contained in video frames is typically very noisy because it is aliased and/or digitized at a much lower resolution than typical document images, making identification, extraction and recognition difficult. The proposed method is based on the use of a hybrid wavelet/neural network segmenter on a series of overlapping small windows to classify regions which contain text. To detect text over a wide range of font sizes, the method is applied to a pyramid of images and the regions identified at each level are integrated
Keywords :
image segmentation; image sequences; neural nets; noise; video signal processing; wavelet transforms; aliasing; automatic text identification; digital video key frames; font sizes; graphic text; hybrid wavelet/neural network segmenter; image pyramid; noise; scene text; supplemental index information; text region identification; video sequences; Computer graphics; Data mining; Feature extraction; Image recognition; Image resolution; Image segmentation; Laboratories; Layout; Motion pictures; Neural networks;
Conference_Titel :
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-8186-8512-3
DOI :
10.1109/ICPR.1998.711097