DocumentCode :
1582463
Title :
A document retrieval method from handwritten characters based on OCR and character shape information
Author :
Kameshiro, Taizo ; Hirano, Takashi ; Okada, Yasuhiro ; Yoda, Fumio
Author_Institution :
Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kanagawa, Japan
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
597
Lastpage :
601
Abstract :
It is a difficult task to create a large database of electronic documents from paper documents. In order to search the database for an image document, it is necessary for general electronic filing systems to convert the document into texts using OCR. However, the system cannot retrieve documents that do not contain correct character codes. We (1999) had previously proposed a document retrieval method that reduces false drops and false alarms by using the "shape-feature" technique that describes the outline of the character\´s shape. We now apply this method to handwritten Japanese documents. Experimental results reveal that our method has a high recall rate of 88.8% compared to the conventional methods (69.2%: text matching, 78.3%: candidate matching)
Keywords :
document image processing; feature extraction; handwritten character recognition; image retrieval; information retrieval; optical character recognition; OCR; character code collation; electronic document retrieval; handwritten Japanese character recognition; shape feature extraction; shape-feature collation; Character recognition; Image converters; Image databases; Image retrieval; Image storage; Information retrieval; Optical character recognition software; Prototypes; Shape; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953859
Filename :
953859
Link To Document :
بازگشت