DocumentCode :
3326944
Title :
Design of a mathematical expression recognition system
Author :
Lee, Hsi-Jian ; Wang, Jiumn-Shine
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
2
fYear :
1995
fDate :
14-16 Aug 1995
Firstpage :
1084
Abstract :
We present a system to segment and recognize texts and mathematical expressions in a document. The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. In expression formation, we build a symbol relation tree for each text line to represent the relationships among the symbols in the text line. Some heuristic rules based on the primitive tokens are used to correct the recognition errors in a text line. We extract all mathematical expressions according to some basic expression forms. Our database consists of 190 symbols in the current stage. The average recognition rate is about 96.16%
Keywords :
character recognition; document image processing; feature extraction; image segmentation; character segmentation; feature extraction; heuristic rules; labeling; mathematical equations understanding; mathematical expression recognition; mathematical expressions; page segmentation; scientific documents; Character recognition; Equations; Error correction; Graphics; Image edge detection; Image segmentation; Labeling; Layout; Optical character recognition software; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
Type :
conf
DOI :
10.1109/ICDAR.1995.602097
Filename :
602097
Link To Document :
بازگشت