DocumentCode :
3062336
Title :
Extracting line features from images of business forms and tables
Author :
Pizano, Arturo
Author_Institution :
Software Res. Center, Ricoh Corp., Santa Clara, CA, USA
fYear :
1992
fDate :
30 Aug-3 Sep 1992
Firstpage :
399
Lastpage :
403
Abstract :
Business forms and tables are special document classes typically used to collect or distribute data; they are characterized by the presence of horizontal and vertical lines that delimit the usable space. The paper describes an algorithm that identifies these lines in binary digital images. This algorithm can be used to separate text from graphics before applying optical character recognition, or as a feature extractor in a form classification system. The approach presented differs from exiting vectorization, line extraction, and text-graphics separation methods, in that it focuses exclusively on the recognition of horizontal and vertical lines
Keywords :
document image processing; feature extraction; optical character recognition; binary digital images; business form images; document image processing; feature extraction; form classification system; horizontal lines; line feature extraction; optical character recognition; vertical lines; Character recognition; Data mining; Digital images; Feature extraction; Graphics; Image analysis; Image coding; Image converters; Merging; Optical character recognition software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1992. Vol.III. Conference C: Image, Speech and Signal Analysis, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2920-7
Type :
conf
DOI :
10.1109/ICPR.1992.202008
Filename :
202008
Link To Document :
بازگشت