DocumentCode
1582463
Title
A document retrieval method from handwritten characters based on OCR and character shape information
Author
Kameshiro, Taizo ; Hirano, Takashi ; Okada, Yasuhiro ; Yoda, Fumio
Author_Institution
Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kanagawa, Japan
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
597
Lastpage
601
Abstract
It is a difficult task to create a large database of electronic documents from paper documents. In order to search the database for an image document, it is necessary for general electronic filing systems to convert the document into texts using OCR. However, the system cannot retrieve documents that do not contain correct character codes. We (1999) had previously proposed a document retrieval method that reduces false drops and false alarms by using the "shape-feature" technique that describes the outline of the character\´s shape. We now apply this method to handwritten Japanese documents. Experimental results reveal that our method has a high recall rate of 88.8% compared to the conventional methods (69.2%: text matching, 78.3%: candidate matching)
Keywords
document image processing; feature extraction; handwritten character recognition; image retrieval; information retrieval; optical character recognition; OCR; character code collation; electronic document retrieval; handwritten Japanese character recognition; shape feature extraction; shape-feature collation; Character recognition; Image converters; Image databases; Image retrieval; Image storage; Information retrieval; Optical character recognition software; Prototypes; Shape; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location
Seattle, WA
Print_ISBN
0-7695-1263-1
Type
conf
DOI
10.1109/ICDAR.2001.953859
Filename
953859
Link To Document