Title :
Identifying and understanding tabular material in compound documents
Author :
Laurentini, A. ; Viada, P.
Author_Institution :
Dipartimento di Automatica ed Inf., Politechnico di, Torino, Italy
fDate :
30 Aug-3 Sep 1992
Abstract :
Tables are important components of technical documents. This paper addresses the following problems: (i) identifying a tabular component in a scanned image of a compound document containing text, drawings, diagrams, etc.; (ii) understanding the content of the table in order to convert the table into electronic format. As far as the authors are aware, the problems addressed are new. An algorithm for performing both the above tasks has been studied and implemented. Preliminary experimental results indicate satisfactory performance for many table lay-out styles
Keywords :
document image processing; image recognition; compound documents; electronic format; scanned image; table lay-out styles; tabular material; technical documents; Circuits; Computer aided manufacturing; Computer graphics; Computer industry; Document handling; Electronic publishing; Engineering drawings; Image converters; Image segmentation; Manufacturing industries;
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
DOI :
10.1109/ICPR.1992.201803