Title :
Extracting line features from images of business forms and tables
Author_Institution :
Software Res. Center, Ricoh Corp., Santa Clara, CA, USA
fDate :
30 Aug-3 Sep 1992
Abstract :
Business forms and tables are special document classes typically used to collect or distribute data; they are characterized by the presence of horizontal and vertical lines that delimit the usable space. The paper describes an algorithm that identifies these lines in binary digital images. This algorithm can be used to separate text from graphics before applying optical character recognition, or as a feature extractor in a form classification system. The approach presented differs from exiting vectorization, line extraction, and text-graphics separation methods, in that it focuses exclusively on the recognition of horizontal and vertical lines
Keywords :
document image processing; feature extraction; optical character recognition; binary digital images; business form images; document image processing; feature extraction; form classification system; horizontal lines; line feature extraction; optical character recognition; vertical lines; Character recognition; Data mining; Digital images; Feature extraction; Graphics; Image analysis; Image coding; Image converters; Merging; Optical character recognition software;
Conference_Titel :
Pattern Recognition, 1992. Vol.III. Conference C: Image, Speech and Signal Analysis, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2920-7
DOI :
10.1109/ICPR.1992.202008