• DocumentCode
    2145719
  • Title

    A Novel Italic Detection and Rectification Method for Chinese Advertising Images

  • Author

    Liu, Jie ; Li, Heping ; Zhang, Shuwu ; Liang, Wei

  • Author_Institution
    High-Tech Innovation Center, Inst. of Autom., Beijing, China
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    698
  • Lastpage
    702
  • Abstract
    The italic detection and slant rectification is a key step of optical character recognition (OCR). In this paper, a novel method is proposed to detect and rectify italic characters in Chinese advertising images. Based on observations on structures of many characters, the centroid angle is proposed and a statistical study on it is presented. According to the statistical results, the centroid angle of a Chinese character approximately obeys a Gaussian distribution with its slant angle. Moreover, a Markov Random Field (MRF) model, considering the font-face similarity of neighboring characters and the strong correlation between the centroid angle and the slant angle of a character, is then presented to estimate the slant angle of a character. The italic characters can be detected and rectified by the estimated angle. The experimental results demonstrate the proposed method is effective and applicable.
  • Keywords
    Gaussian distribution; Markov processes; natural language processing; optical character recognition; Chinese advertising images; Gaussian distribution; Markov random field model; centroid angle; font face similarity; italic detection; italic rectification method; optical character recognition; statistical study; Advertising; Correlation; Estimation; Gaussian distribution; Optical character recognition software; Reliability; Text analysis; Markov Random Field; centroid angle; italic detection; slant rectification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.146
  • Filename
    6065401