Title :
A fundamental study of output translation from layout recognition and semantic understanding system for mathematical formulae
Author :
Takiguchi, Yusuke ; Okada, Minoru ; Miyake, Yasuji
Author_Institution :
Graduate Sch. of Inf., Production & Syst., Waseda Univ., Kitakyushu, Japan
fDate :
29 Aug.-1 Sept. 2005
Abstract :
In this paper we propose an implementation method for an off-line layout recognition and semantic understanding system for mathematical formulae. This off-line system aims at higher order coding of mathematical formulae in scientific articles as an application in document analysis. The system has two intermediate output codes: a layout tree, holding information of geometrical structure of the formula and character recognized code of the symbols, and a semantic tree, holding information of semantics of symbols. From the structure tree and the semantic tree after layout recognition and semantic understanding, various useful outputs can be generated at the translating part. This paper mainly describes implementation techniques for LATEX source output for high quality typesetting and gnuplot script output for drawing a function as a method for visual representation.
Keywords :
character recognition; text analysis; trees (mathematics); LATEX; character recognition; document analysis; formula geometrical structure; layout recognition; layout tree; mathematical formula; semantic tree; semantic understanding; Books; Character recognition; Dictionaries; Educational institutions; Image processing; Image recognition; Production systems; Prototypes; Text analysis; Typesetting;
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
Print_ISBN :
0-7695-2420-6
DOI :
10.1109/ICDAR.2005.10