• DocumentCode
    2015661
  • Title

    On the Use of Lexeme Features for Writer Verification

  • Author

    Bhardwaj, Anurag ; Singh, Abhishek ; Srinivasan, Harish ; Srihari, Sargur

  • Author_Institution
    Univ. at Buffalo, Amherst
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    1088
  • Lastpage
    1092
  • Abstract
    Document examiners use a variety of features to analyze a given handwritten document for writer verification. The challenge in the automatic classification of a pair of documents to belong to the same or different writer, are both (i)The task of proper selection and extraction of features from the handwritten document and (ii)The use of a proper model that is capable of utilizing the true discriminatory power of these features for classification. This paper describes the use of content specific skeleton based features for characters and pairs of characters (bigrams) and ascertains their discriminatory power. A triangulation skeletonisation procedure is first used to obtain the skeleton of the character(s), and features are computed from the skeleton. Experiments and results are conducted on content specific features extracted for two most frequently occurring bigrams (th, he), and characters (d and f). A neural network based on a Bayesian formulation was used to ascertain the discriminability power of these features. To combine these features with previously existing writer verification features, an alternative Naive Bayes model is also described and evaluated. From the results obtained, we conclude that bigram th has the highest discriminatory power followed by character d, f and bigram he. Also the paper highlights the significant increase in performance of writer verification(^ 15% more) with the use of Bayesian neural networks as against the Naive Bayes model.
  • Keywords
    Bayes methods; feature extraction; handwriting recognition; image classification; image thinning; Lexeme features; automatic classification; features extraction; features selection; handwritten document; naive Bayes model; triangulation skeletonisation procedure; writer verification; Bayesian methods; Design automation; Euclidean distance; Feature extraction; Handwriting recognition; Neural networks; Skeleton; Text analysis; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4377083
  • Filename
    4377083