• DocumentCode
    1425920
  • Title

    Off-line handwritten Chinese character recognition as a compound Bayes decision problem

  • Author

    Wong, Pak-Kwong ; Chan, Chorkin

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., Hong Kong
  • Volume
    20
  • Issue
    9
  • fYear
    1998
  • fDate
    9/1/1998 12:00:00 AM
  • Firstpage
    1016
  • Lastpage
    1023
  • Abstract
    A handwritten Chinese character off-line recognizer based on contextual vector quantization (CVQ) of every pixel of an unknown character image has been constructed. Each template character is represented by a codebook. When an unknown image is matched against a template character, each pixel of the image is quantized according to the associated codebook by considering not just the feature vector observed at each pixel, but those observed at its neighbors and their quantization as well. Structural information such as stroke counts observed at each pixel are captured to form a cellular feature vector. Supporting a vocabulary of 4616 simplified Chinese characters and alphanumeric and punctuation symbols, the writer-independent recognizer has an average recognition rate of 77.2 percent. Three statistical language models for postprocessing have been studied for their effectiveness in upgrading the recognition rate of the system. Among them, the CVQ-based language model is the most effective one upgrading the recognition rate by 10.4 percent on the average
  • Keywords
    Bayes methods; character recognition; image coding; image matching; image segmentation; vector quantisation; Chinese language modelling; cellular feature vector; character recognition; codebook; compound Bayes decision; contextual vector quantization; handwritten Chinese characters; image matching; stroke counts; template character; word segmentation; Character recognition; Handwriting recognition; Hidden Markov models; Image recognition; Natural languages; Pattern recognition; Pixel; Quantization; Text recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.713366
  • Filename
    713366