Title :
Text segmentation for automatic document processing
Author :
Mital, Dinesh P. ; Leng, Goh Wee
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Inst., Singapore
Abstract :
There has been a considerable interest in designing automatic systems that can scan a given paper document and store it on electronic media for easier storage, manipulation and access. Most documents contain graphics and images, in addition to text. Thus, the document image has to be segmented to identify text and image regions, so that appropriate techniques may be applied to those regions. We present a new technique for image segmentation in which text and image regions, in a given document image, are automatically identified. Technique is based on a differential-processing text extraction concept. The proposed technique is capable of analysing complex document image layouts. The document image is processed by using textural feature analysis. The results of the proposed method are presented with test images which demonstrate the robustness of the technique
Keywords :
document handling; document image processing; feature extraction; image segmentation; image texture; automatic document processing; automatic systems; differential processing text extraction; document access; document image layouts; document manipulation; document storage; electronic media; graphics; image regions; image segmentation; test images; text regions; textural feature analysis; Data mining; Graphics; Image analysis; Image converters; Image segmentation; Image texture analysis; Optical character recognition software; Storage automation; Technical drawing; Text analysis;
Conference_Titel :
Consumer Electronics, 1995., Proceedings of International Conference on
Conference_Location :
Rosemont, IL
Print_ISBN :
0-7803-2140-5
DOI :
10.1109/ICCE.1995.517923