• DocumentCode
    1639732
  • Title

    Syntactic Detection and Correction of Misrecognitions in Mathematical OCR

  • Author

    Fujiyoshi, Akio ; Suzuki, Masakazu ; Uchida, Seiichi

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ibaraki Univ., Hitachi, Japan
  • fYear
    2009
  • Firstpage
    1360
  • Lastpage
    1364
  • Abstract
    This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae.For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study.A satisfactory number of misrecognitions are detected and delivered to the correction process.
  • Keywords
    document image processing; mathematical analysis; optical character recognition; trees (mathematics); visual databases; ground-truthed mathematical document database; linear monadic context-free tree grammar; mathematical OCR system; optical character recognition; syntactic detection; Databases; Image recognition; Information analysis; Information science; Mathematics; Optical character recognition software; Pixel; Stochastic processes; Text analysis; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.150
  • Filename
    5277755