Title :
Text/graphics separation using agent-based pyramid operations
Author :
Tan, Chew Lim ; Yuan, Bo ; Huang, Weihua ; Wang, Qian ; Zhang, Zheng
Author_Institution :
Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore
Abstract :
This paper describes a document image analysis system using multiple agents working on a pyramid structure to separate text from graphics in the image. Text strings appear as different groupings of connected components at different image resolutions. As such, the pyramid structure, which is a multi-resolution image representation, provides a natural means of identifying and grouping of character strings in the document at different levels of resolution. The pyramid structure is also amenable to parallel processing, where multiple agents in the system can individually and concurrently look for groups of connected components at appropriate levels. The agent-based pyramid operations do not require expensive feature analysis among different connected components to detect text strings as found in other existing works
Keywords :
document image processing; image representation; image resolution; multi-agent systems; optical character recognition; agent-based pyramid operations; character strings; document image analysis; image resolution; multi-resolution image representation; multiple agents; parallel processing; text graphics separation; text strings; Computational efficiency; Computer graphics; Computer science; Data structures; Image analysis; Image representation; Image resolution; Labeling; Optical character recognition software; Parallel processing;
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
DOI :
10.1109/ICDAR.1999.791751