Title :
Document image matching based on component blocks
Author :
Peng, Hanchuan ; Long, Fuhui ; Siu, Wan-chi ; Chi, Zheru ; Feng, David Dagan
Author_Institution :
Dept. of Electron. Eng., Hong Kong Polytech., Kowloon, China
Abstract :
Document image matching is the key technique for document registration and retrieval. A new matching algorithm based on the document component block list and component block tree is proposed. Our method can effectively make use of the local information of each page block and the global information of page layout, while it is also robust to image distortion, filled-in text, and noises. This algorithm is then refined and applied to automatic data extraction of column forms. A demonstrating software package has been developed.
Keywords :
document image processing; feature extraction; image matching; image registration; image retrieval; software packages; automatic data extraction; column forms; component blocks; data structures; document component block list; document component block tree; document image matching; document registration; document retrieval; filled-in text; global information; image distortion robustness; image matching algorithm; local information; noise; page block; page layout; software package; Biomedical engineering; Biomedical signal processing; Data mining; Data structures; Image matching; Image retrieval; Laboratories; Noise robustness; Signal processing algorithms; Sorting;
Conference_Titel :
Image Processing, 2000. Proceedings. 2000 International Conference on
Conference_Location :
Vancouver, BC, Canada
Print_ISBN :
0-7803-6297-7
DOI :
10.1109/ICIP.2000.899505