Title :
Edge-based method for text detection from complex document images
Author :
Pietikäinen, Matti ; Okun, Oleg
Author_Institution :
Dept. of Electr. Eng., Oulu Univ., Finland
fDate :
6/23/1905 12:00:00 AM
Abstract :
Detection of text from documents in which text is embedded in complex colored and textured backgrounds is a very challenging problem. In this paper, we propose a simple texture-based approach based on edge information for this task. The performance of our method is compared to that obtained by a method based on the discrete cosine transform which was recently proposed by Y. Zhong et al. (2000) for text localization in compressed digital video. In our experiments, both methods performed about equally well for small-sized text, but our method was better in the case of large-sized text. The principal advantage of our approach is that in addition to the text detection problem, the same edge representation can also be used for other image interpretation tasks
Keywords :
character recognition; feature extraction; image representation; complex document images; discrete cosine transform; edge information; edge representation; edge-based method; image interpretation; text detection; text detection problem; texture-based approach; Image coding; Image edge detection; Image segmentation; Intelligent systems; Machine intelligence; Machine vision; Periodic structures; Robustness; Video compression; Web pages;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953800