• DocumentCode
    1636701
  • Title

    Statistical Modeling and Learning for Recognition-Based Handwritten Numeral String Segmentation

  • Author

    Wang, Yanjie ; Liu, Xiabi ; Jia, Yunde

  • Author_Institution
    Beijing Lab. of Intell. Inf. Technol., Beijing Inst. of Technol., Beijing, China
  • fYear
    2009
  • Firstpage
    421
  • Lastpage
    425
  • Abstract
    This paper proposes a recognition based approach to handwritten numeral string segmentation. We consider two classes: numeral strings segmented correctly or not. The feature vectors containing recognition information for numeral strings segmented correctly are assumed to be of the distribution of Gaussian mixture model (GMM). Based on this modeling, the recognition based segmentation is solved under the max-min posterior pseudo-probabilities (MMP) framework of learning Bayesian classifiers. In the training phase, we use the MMP method to learn a posterior pseudo-probability measure function from positive samples and negative samples of numeral strings segmented correctly. In the process of recognition based segmentation, we generate all possible candidate segmentations of an input string through contour and profile analysis, and then compute the posterior pseudo-probabilities of being the numeral string segmented correctly for all the candidate segmentations. The candidate segmentation with the maximum posterior pseudo-probability is taken as the final result. The effectiveness of our approach is demonstrated by the experiments of numeral string segmentation and recognition on the NIST SD19 database.
  • Keywords
    Bayes methods; Gaussian processes; feature extraction; handwritten character recognition; image classification; image sampling; image segmentation; learning (artificial intelligence); probability; statistical analysis; string matching; Bayesian classifier; GMM; Gaussian mixture model; MMP framework; NIST SD19 database; contour-profile analysis; handwritten numeral string segmentation; image sample; max-min posterior pseudo-probability; recognition based segmentation; statistical learning; statistical modeling; Bayesian methods; Character recognition; Databases; Handwriting recognition; Information analysis; Information technology; Laboratories; NIST; Testing; Text analysis; Max-Min posterior Pseudo-probability; discriminative learning; numeral string recognition; numeral string segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.25
  • Filename
    5277643