Title :
An approach toward binary quantization of color table images for document analysis
Author :
Suen, Hong-Ming ; Wang, Jhing-Fa
Author_Institution :
Inst. of Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
Table processing is a subdomain of document analysis technology. It commonly focuses on understanding the well-organized information presented in a table and then making an entry for the desired items. Earlier research is all based on the assumption that data is presented in white-background/black-text (WB/BT) binary type. This condition, however, is not always held for color tables. Thus, a preprocessing stage is required to transform the color table into the binary format before the existing techniques can be employed to handle them. In this paper we propose a method for completing such a conversion. The underlying idea of our approach is based on location of background components (i.e., the image background and table cells) together with their colors. After these background regions are extracted, we can then convert the pixels belonging to the background regions to white and other pixels to black. Since our processing scheme needs no prior knowledge of the color style of the input tables, it has the ability to transform a wide fashion of color tables into the WB/BT binary-type, even though they are scanned in a severely skewed manner
Keywords :
document image processing; image colour analysis; quantisation (signal); background components; binary format; binary quantization; color table images; document analysis; preprocessing stage; table processing; well-organized information; Application software; Computational complexity; Document image processing; Image analysis; Image color analysis; Image converters; Image edge detection; Prototypes; Quantization; Text analysis;
Conference_Titel :
Information, Communications and Signal Processing, 1997. ICICS., Proceedings of 1997 International Conference on
Print_ISBN :
0-7803-3676-3
DOI :
10.1109/ICICS.1997.647145