• DocumentCode
    3021893
  • Title

    A fundamental study of output translation from layout recognition and semantic understanding system for mathematical formulae

  • Author

    Takiguchi, Yusuke ; Okada, Minoru ; Miyake, Yasuji

  • Author_Institution
    Graduate Sch. of Inf., Production & Syst., Waseda Univ., Kitakyushu, Japan
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    745
  • Abstract
    In this paper we propose an implementation method for an off-line layout recognition and semantic understanding system for mathematical formulae. This off-line system aims at higher order coding of mathematical formulae in scientific articles as an application in document analysis. The system has two intermediate output codes: a layout tree, holding information of geometrical structure of the formula and character recognized code of the symbols, and a semantic tree, holding information of semantics of symbols. From the structure tree and the semantic tree after layout recognition and semantic understanding, various useful outputs can be generated at the translating part. This paper mainly describes implementation techniques for LATEX source output for high quality typesetting and gnuplot script output for drawing a function as a method for visual representation.
  • Keywords
    character recognition; text analysis; trees (mathematics); LATEX; character recognition; document analysis; formula geometrical structure; layout recognition; layout tree; mathematical formula; semantic tree; semantic understanding; Books; Character recognition; Dictionaries; Educational institutions; Image processing; Image recognition; Production systems; Prototypes; Text analysis; Typesetting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.10
  • Filename
    1575644