Automatic identification of text in digital video key frames

Author

Li, Huiping ; Doermann, David

Author_Institution

Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA

Volume

1

fYear

1998

fDate

16-20 Aug 1998

Firstpage

129

Abstract

Scene and graphic text can provide important supplemental index information in video sequences. In this paper we address the problem automatically identifying text regions in digital video key frames. The text contained in video frames is typically very noisy because it is aliased and/or digitized at a much lower resolution than typical document images, making identification, extraction and recognition difficult. The proposed method is based on the use of a hybrid wavelet/neural network segmenter on a series of overlapping small windows to classify regions which contain text. To detect text over a wide range of font sizes, the method is applied to a pyramid of images and the regions identified at each level are integrated

Keywords

image segmentation; image sequences; neural nets; noise; video signal processing; wavelet transforms; aliasing; automatic text identification; digital video key frames; font sizes; graphic text; hybrid wavelet/neural network segmenter; image pyramid; noise; scene text; supplemental index information; text region identification; video sequences; Computer graphics; Data mining; Feature extraction; Image recognition; Image resolution; Image segmentation; Laboratories; Layout; Motion pictures; Neural networks;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on

Conference_Location

Brisbane, Qld.

ISSN

1051-4651

Print_ISBN

0-8186-8512-3

Type

conf

DOI

10.1109/ICPR.1998.711097

Filename

711097