• DocumentCode
    780640
  • Title

    Layout recognition of multi-kinds of table-form documents

  • Author

    Watanabe, Toyohide ; Luo, Qin ; Sugie, Noboru

  • Author_Institution
    Dept. of Inf. Eng., Nagoya Univ., Japan
  • Volume
    17
  • Issue
    4
  • fYear
    1995
  • fDate
    4/1/1995 12:00:00 AM
  • Firstpage
    432
  • Lastpage
    445
  • Abstract
    Many approaches have reported that knowledge-based layout recognition methods are very successful in classifying the meaningful data from document images automatically. However, these approaches are applicable to only the same kind of documents because they are based on the paradigm that specifies the structure definition information in advance so as to be able to analyze a particular class of documents intelligently. In this paper, the authors propose a method to recognize the layout structures of multi-kinds of table-form document images. For this purpose, the authors introduce a classification tree to manage the relationships among different classes of layout structures. The authors´ recognition system has two modes: layout knowledge acquisition and layout structure recognition. In the layout knowledge acquisition mode, table-form document images are distinguished according to this. Classification tree and then the structure description trees which specify the logical structures of table-form documents are generated automatically. While, in the layout structure recognition mode, individual item fields in the table-form document images are extracted and classified successfully by searching the classification tree and interpreting the structure description tree
  • Keywords
    document image processing; image classification; knowledge acquisition; knowledge based systems; classification tree; document images; knowledge-based layout recognition methods; layout knowledge acquisition; layout structure recognition; structure definition information; structure description tree; table-form documents; Classification tree analysis; Computer Society; Helium; Image recognition; Information analysis; Intelligent structures; Knowledge acquisition; Knowledge management; Notice of Violation;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.385976
  • Filename
    385976