DocumentCode :
2147565
Title :
Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH
Author :
Takeda, Kazutaka ; Kise, Koichi ; Iwamura, Masakazu
Author_Institution :
Dept. of CSIS, Osaka Prefecture Univ., Sakai, Japan
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
1054
Lastpage :
1058
Abstract :
This paper presents a real-time document image retrieval method for a large-scale database with Locally Likely Arrangement Hashing (LLAH). In general, when a database is scaled up, a large amount of memory is required and retrieval accuracy drops due to insufficient discrimination power of features. To solve these problems, we propose three improvements: memory reduction by sampling feature points, improvement of discrimination power by increasing the number of feature dimensions and stabilizing features by reducing redundancy. From the experimental results, we have confirmed that the proposed method realizes 50% memory reduction, and achieves 99.4% accuracy and 38ms processing time for a database of 10 million pages.
Keywords :
cryptography; database management systems; document image processing; image retrieval; redundancy; storage management; 10 million pages database; feature point sampling; insufficient discrimination power; large-scale database; locally likely arrangement hashing; memory reduction; real-time document image retrieval; stability improved LLAH; stabilizing feature; Accuracy; Cameras; Feature extraction; Image retrieval; Memory management; Real time systems; Document image retrieval; LLAH; Large-scale database; Real-time 10 million pages processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.213
Filename :
6065471
Link To Document :
بازگشت