DocumentCode :
1583043
Title :
Region segmentation for table image with unknown complex structure
Author :
Tsuruoka, Shinji ; Takao, Kensuke ; Tanaka, Toru ; Yoshikawa, Tomohiro ; Shinogi, Tsuyoshi
Author_Institution :
Dept. of Electr. & Electr. Eng., Mie Univ., Japan
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
709
Lastpage :
713
Abstract :
In this paper, we describe a system of region segmentation and conversion into an HTML file for an unknown machine-printed table image. Ruled lines delimit some cells of the table, and omitted ruled lines also delimit other cells. We consider a table analysis system for both types of table cell. First, our system segments a table by means of the ruled lines into some regions. Secondly, these segmented regions are further segmented into cells by the omitted ruled lines that are indicators (such as numerals and characters). The cells include several character lines, and our system can convert a table of unknown complex structure into an HTML file. Also, we confirm the effectiveness of our region segmentation method for various kinds of tables with omitted ruled lines by computer experiments
Keywords :
document image processing; electronic data interchange; hypermedia markup languages; image segmentation; HTML file; character lines; complex structure; data conversion; numerals; omitted ruled lines; region segmentation; table analysis system; table cells; unknown machine-printed table image; Books; Character recognition; Data mining; Digital images; HTML; Image converters; Image segmentation; Optical character recognition software; TV; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953882
Filename :
953882
Link To Document :
بازگشت