• DocumentCode
    1635337
  • Title

    Document Image Binarisation Using Markov Field Model

  • Author

    Lelore, Thibault ; Bouchara, Frédéric

  • Author_Institution
    Southern Univ. of Toulon-Var, La Garde, France
  • fYear
    2009
  • Firstpage
    551
  • Lastpage
    555
  • Abstract
    This paper presents a new approach for the binarization of seriously degraded manuscript. We introduce a new technique based on a Markov random field (MRF) model of the document. Depending on the available information, the model parameters (clique potentials) are learned from training data or computed using heuristics. The observation model is estimated thanks to an expectation maximization (EM) algorithm which extracts text and paperpsilas features. The performance of the proposition is evaluated on several types of degraded document images where considerable background noise or variation in contrast and illumination exist.
  • Keywords
    Markov processes; document image processing; feature extraction; text analysis; Markov field model; document image binarisation; expectation maximization algorithm; text extract; Background noise; Character recognition; Data mining; Degradation; Image analysis; Image recognition; Large scale integration; Markov random fields; Text analysis; Training data; Document image binarization; EM algorithm; Markov Random Field;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.117
  • Filename
    5277593