• DocumentCode
    1582463
  • Title

    A document retrieval method from handwritten characters based on OCR and character shape information

  • Author

    Kameshiro, Taizo ; Hirano, Takashi ; Okada, Yasuhiro ; Yoda, Fumio

  • Author_Institution
    Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kanagawa, Japan
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    597
  • Lastpage
    601
  • Abstract
    It is a difficult task to create a large database of electronic documents from paper documents. In order to search the database for an image document, it is necessary for general electronic filing systems to convert the document into texts using OCR. However, the system cannot retrieve documents that do not contain correct character codes. We (1999) had previously proposed a document retrieval method that reduces false drops and false alarms by using the "shape-feature" technique that describes the outline of the character\´s shape. We now apply this method to handwritten Japanese documents. Experimental results reveal that our method has a high recall rate of 88.8% compared to the conventional methods (69.2%: text matching, 78.3%: candidate matching)
  • Keywords
    document image processing; feature extraction; handwritten character recognition; image retrieval; information retrieval; optical character recognition; OCR; character code collation; electronic document retrieval; handwritten Japanese character recognition; shape feature extraction; shape-feature collation; Character recognition; Image converters; Image databases; Image retrieval; Image storage; Information retrieval; Optical character recognition software; Prototypes; Shape; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953859
  • Filename
    953859