DocumentCode
3211689
Title
A rule-based system for document image segmentation
Author
Fisher, James L. ; Hinds, Stuart C. ; D´Amato, Donald P.
Author_Institution
Mitre Corp., McLean, VA, USA
Volume
i
fYear
1990
fDate
16-21 Jun 1990
Firstpage
567
Abstract
A rule-based system for automatically segmenting a document image into regions of text and nontext is presented. The initial stages of the system perform image enhancement functions such as adaptive thresholding, morphological processing, and skew detection and correction. The image segmentation process consists of smearing the original image via the run length smoothing algorithm, calculating the connected components locations and statistics, and filtering (segmenting) the image based on these statistics. The text regions can be converted (via an optical character reader) to a computer-searchable form, and the nontext regions can be extracted and preserved. The rule-based structure allows easy fine tuning of the algorithmic steps to produce robust rules, to incorporate additional tools (as they become available), and to handle special segmentation needs
Keywords
computerised pattern recognition; document image processing; knowledge based systems; statistical analysis; adaptive thresholding; computerised pattern recognition; document image segmentation; filtering; image enhancement; morphological processing; rule-based system; run length smoothing algorithm; skew detection; Filtering algorithms; Image converters; Image enhancement; Image segmentation; Knowledge based systems; Optical computing; Optical filters; Optical tuning; Smoothing methods; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1990. Proceedings., 10th International Conference on
Conference_Location
Atlantic City, NJ
Print_ISBN
0-8186-2062-5
Type
conf
DOI
10.1109/ICPR.1990.118166
Filename
118166
Link To Document