Title :
EXTRAFOR: automatic EXTRAction of mathematical FORmulas
Author :
Kacem, A. ; Belaïd, A. ; Ben Ahmed, M.
Abstract :
We present a method for automatic extraction of mathematical formulas from images of documents without character recognition. Formula extraction is first done by location of its most significant symbols, then extension to adjoining symbols using contextual rules until delimitation of the whole formula space. Mathematical symbol labelling is realised from models created at the learning stage using fuzzy logic. From the experiments, we found that the average rate of primary labelling of mathematical symbols is about 95.3%. The obtained results have demonstrated the applicability of our system since 90% of mathematical formulas are well extracted from documents printed with high quality
Keywords :
document image processing; feature extraction; fuzzy logic; EXTRAFOR; contextual rules; document image processing; fuzzy logic; mathematical formula extraction; mathematical symbol labelling; primary labelling; symbols; Character recognition; Error correction; Graphics; Image analysis; Labeling; Logic; Mathematical model; Mathematics; Optical character recognition software; White spaces;
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
DOI :
10.1109/ICDAR.1999.791841