• DocumentCode
    2011678
  • Title

    Toward Part-Based Document Image Decoding

  • Author

    Song, Wang ; Uchida, Seiichi ; Liwicki, Marcus

  • Author_Institution
    Kyushu Univ., Fukuoka, Japan
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    266
  • Lastpage
    270
  • Abstract
    Document image decoding (DID) is a trial to understand the contents of a whole document without any reference information about font, language, etc. Typically, DID approaches assume the correct segmentation of the document and some a priori knowledge about the language or the script. Unfortunately, this assumption will not hold if we deal with various documents, such as documents with various sized fonts, camera-captured documents, free-layout documents, or historical documents. In this paper, we propose a part-based character identification method where no segmentation into characters is necessary and no a priori information about the document is needed. The approach clusters similar key points and groups frequent neighboring key point clusters. Then a second iteration is performed, i.e., the groups are again clustered and optionally pairs frequent group clusters are detected. Our first experimental results on multi font-size documents look already very promising. We could find nearly perfect correspondences between characters and detected group clusters.
  • Keywords
    cameras; document image processing; image coding; image segmentation; camera-captured documents; document segmentation; free-layout documents; historical documents; multifont-size documents; neighboring keypoint clusters; part-based character identification method; part-based document image decoding; Character recognition; Decoding; Feature extraction; Image segmentation; Robustness; Text analysis; Vectors; Document image decoding; part-based;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.90
  • Filename
    6195376