Title :
Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH
Author :
Takeda, Kazutaka ; Kise, Koichi ; Iwamura, Masakazu
Author_Institution :
Dept. of CSIS, Osaka Prefecture Univ., Sakai, Japan
Abstract :
This paper presents a real-time document image retrieval method for a large-scale database with Locally Likely Arrangement Hashing (LLAH). In general, when a database is scaled up, a large amount of memory is required and retrieval accuracy drops due to insufficient discrimination power of features. To solve these problems, we propose three improvements: memory reduction by sampling feature points, improvement of discrimination power by increasing the number of feature dimensions and stabilizing features by reducing redundancy. From the experimental results, we have confirmed that the proposed method realizes 50% memory reduction, and achieves 99.4% accuracy and 38ms processing time for a database of 10 million pages.
Keywords :
cryptography; database management systems; document image processing; image retrieval; redundancy; storage management; 10 million pages database; feature point sampling; insufficient discrimination power; large-scale database; locally likely arrangement hashing; memory reduction; real-time document image retrieval; stability improved LLAH; stabilizing feature; Accuracy; Cameras; Feature extraction; Image retrieval; Memory management; Real time systems; Document image retrieval; LLAH; Large-scale database; Real-time 10 million pages processing;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.213