DocumentCode :
2015661
Title :
On the Use of Lexeme Features for Writer Verification
Author :
Bhardwaj, Anurag ; Singh, Abhishek ; Srinivasan, Harish ; Srihari, Sargur
Author_Institution :
Univ. at Buffalo, Amherst
Volume :
2
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
1088
Lastpage :
1092
Abstract :
Document examiners use a variety of features to analyze a given handwritten document for writer verification. The challenge in the automatic classification of a pair of documents to belong to the same or different writer, are both (i)The task of proper selection and extraction of features from the handwritten document and (ii)The use of a proper model that is capable of utilizing the true discriminatory power of these features for classification. This paper describes the use of content specific skeleton based features for characters and pairs of characters (bigrams) and ascertains their discriminatory power. A triangulation skeletonisation procedure is first used to obtain the skeleton of the character(s), and features are computed from the skeleton. Experiments and results are conducted on content specific features extracted for two most frequently occurring bigrams (th, he), and characters (d and f). A neural network based on a Bayesian formulation was used to ascertain the discriminability power of these features. To combine these features with previously existing writer verification features, an alternative Naive Bayes model is also described and evaluated. From the results obtained, we conclude that bigram th has the highest discriminatory power followed by character d, f and bigram he. Also the paper highlights the significant increase in performance of writer verification(^ 15% more) with the use of Bayesian neural networks as against the Naive Bayes model.
Keywords :
Bayes methods; feature extraction; handwriting recognition; image classification; image thinning; Lexeme features; automatic classification; features extraction; features selection; handwritten document; naive Bayes model; triangulation skeletonisation procedure; writer verification; Bayesian methods; Design automation; Euclidean distance; Feature extraction; Handwriting recognition; Neural networks; Skeleton; Text analysis; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4377083
Filename :
4377083
Link To Document :
بازگشت