Title :
On virtual partitioning of large dictionaries for contextual post-processing to improve character recognition
Author :
Hoch, Rainer ; Kieninger, Thomas
Author_Institution :
German Res. Center for Artificial Intelligence, Kaiserslautern, Germany
Abstract :
A new approach to the partitioning of large dictionaries by virtual views is presented. The basic idea is that additional knowledge sources of text recognition and text analysis are employed for fast dictionary look-up in order to prune the search space through static or dynamic views. The heart of the system is a redundant hashing technique which involves a set of hash functions dealing with noisy input efficiently. Currently, the system is composed of two main system components: the dictionary generator and the dictionary controller. While the dictionary generator initially builds the system by using profiles and source dictionaries, the controller allows the flexible integration of different search heuristics. Results prove that the system achieves a respectable speed-up of dictionary access time
Keywords :
character recognition; file organisation; glossaries; natural languages; search problems; table lookup; very large databases; additional knowledge sources; character recognition; contextual post-processing; dictionary access time; dictionary controller; dictionary generator; dynamic views; fast dictionary look-up; hash functions; large dictionaries; noisy input; profiles; redundant hashing technique; search heuristics; search space pruning; source dictionaries; speed-up; static views; text analysis; text recognition; virtual partitioning; virtual views; Artificial intelligence; Character recognition; Control systems; Dictionaries; Heart; Pattern matching; Prototypes; Robustness; Text analysis; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395743