DocumentCode
178461
Title
Data Sufficiency for Online Writer Identification: A Comparative Study of Writer-Style Space vs. Feature Space Models
Author
Shivram, A. ; Ramaiah, C. ; Govindaraju, V.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. at Buffalo - SUNY, Buffalo, NY, USA
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
3121
Lastpage
3125
Abstract
A key factor in building effective writer identification/verification systems is the amount of data required to build the underlying models. In this research we systematically examine data sufficiency bounds for two broad approaches to online writer identification -- feature space models vs. writer-style space models. We report results from 40 experiments conducted on two publicly available datasets and also test identification performance for the target models using two different feature functions. Our findings show that the writer-style space model gives higher identification performance for a given level of data and further, achieves high performance levels with lesser data costs. This model appears to require as less as 20 words per page to achieve identification performance close to 80% and reaches more than 90% accuracy with higher levels of data enrollment.
Keywords
data analysis; formal verification; text analysis; data sufficiency; feature space model; online writer identification; writer identification-verification system; writer-style space model; Buildings; Data models; Fasteners; Feature extraction; Standards; Text analysis; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.538
Filename
6977250
Link To Document