DocumentCode
2471106
Title
Automatic invoice interpretation: invoice structure analysis
Author
Kosiba, David A. ; Kasturi, Rangachar
Author_Institution
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Volume
3
fYear
1996
fDate
25-29 Aug 1996
Firstpage
721
Abstract
We propose a method of invoice document structure analysis that provides a means to extract the relevant information from an unknown invoice. Our method uses a combination of textual and graphical processing by analyzing the line and line intersection features in the document as well as searching for possible keywords such as item number, quantity, total, etc. Valid keyword search regions are determined by a specialized connected-component analysis before any OCR is performed. The results of the the keyword search and the line analysis are combined to give the search regions for extracting the relevant data contained in the invoice. This analysis will become part of a larger invoice interpretation system which is currently under development
Keywords
business forms; document image processing; edge detection; image segmentation; invoicing; automatic invoice interpretation; connected-component analysis; graphical processing; invoice document structure analysis; item number; keyword search regions; line intersection features; textual processing; Computer science; Data mining; Graphics; Keyword search; Marine vehicles; Optical character recognition software; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1996., Proceedings of the 13th International Conference on
Conference_Location
Vienna
ISSN
1051-4651
Print_ISBN
0-8186-7282-X
Type
conf
DOI
10.1109/ICPR.1996.547263
Filename
547263
Link To Document