Title :
Design of a mathematical expression recognition system
Author :
Lee, Hsi-Jian ; Wang, Jiumn-Shine
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Abstract :
We present a system to segment and recognize texts and mathematical expressions in a document. The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. In expression formation, we build a symbol relation tree for each text line to represent the relationships among the symbols in the text line. Some heuristic rules based on the primitive tokens are used to correct the recognition errors in a text line. We extract all mathematical expressions according to some basic expression forms. Our database consists of 190 symbols in the current stage. The average recognition rate is about 96.16%
Keywords :
character recognition; document image processing; feature extraction; image segmentation; character segmentation; feature extraction; heuristic rules; labeling; mathematical equations understanding; mathematical expression recognition; mathematical expressions; page segmentation; scientific documents; Character recognition; Equations; Error correction; Graphics; Image edge detection; Image segmentation; Labeling; Layout; Optical character recognition software; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.602097