DocumentCode :
3023060
Title :
Document image retrieval based on density distribution feature and key block feature
Author :
Liu, Hong ; Feng, Suoqian ; Zha, Hongbin ; Liu, Xueping
Author_Institution :
Nat. Lab. on Machine Perception, Peking Univ., China
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
1040
Abstract :
Document image retrieval is an important part of many document image processing systems such as paperless office systems, digital libraries and so on. Its task is to help users find out the most similar document images from a document image database. For developing a system of document image retrieval among different resolutions, different formats document images with hybrid characters of multiple languages, a new retrieval method based on document image density distribution features and key block features is proposed in this paper. Firstly, the density distribution and key block features of a document image are defined and extracted based on documents´ print-core. Secondly, the candidate document images are attained based on the density distribution features. Thirdly, to improve reliability of the retrieval results, a confirmation procedure using key block features is applied to those candidates. Experimental results on a large scale document image database, which contains 10385 document images, show that the proposed method is efficient and robust to retrieve different kinds of document images in real time.
Keywords :
document image processing; image retrieval; visual databases; density distribution feature; document image database; document image processing systems; document image retrieval; key block feature; Data mining; Document image processing; Image databases; Image matching; Image resolution; Image retrieval; Information retrieval; Large-scale systems; Optical character recognition software; Software libraries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.91
Filename :
1575702
Link To Document :
بازگشت