DocumentCode :
1639732
Title :
Syntactic Detection and Correction of Misrecognitions in Mathematical OCR
Author :
Fujiyoshi, Akio ; Suzuki, Masakazu ; Uchida, Seiichi
Author_Institution :
Dept. of Comput. & Inf. Sci., Ibaraki Univ., Hitachi, Japan
fYear :
2009
Firstpage :
1360
Lastpage :
1364
Abstract :
This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae.For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study.A satisfactory number of misrecognitions are detected and delivered to the correction process.
Keywords :
document image processing; mathematical analysis; optical character recognition; trees (mathematics); visual databases; ground-truthed mathematical document database; linear monadic context-free tree grammar; mathematical OCR system; optical character recognition; syntactic detection; Databases; Image recognition; Information analysis; Information science; Mathematics; Optical character recognition software; Pixel; Stochastic processes; Text analysis; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.150
Filename :
5277755
Link To Document :
بازگشت