Title :
Training with positive and negative data samples: effects on a classifier for hand-drawn geometric shapes
Author :
Barakat, Hanaa ; Blostein, Dorothea
Author_Institution :
Queen´´s Univ., Kingston, Ont., Canada
fDate :
6/23/1905 12:00:00 AM
Abstract :
It is quite common in document analysis and symbol recognition to rely on a priori knowledge about the nature of the document in order to locate candidate symbols. It is desirable, but less common, for a segmentation procedure to rely on "a posteriori" feedback from a non-human-guided process to adjust for segmentation errors. For this method to succeed, the feedback must come from a reliable classifier (one that is able to reject negative symbols including miss-segmented symbols). This paper examines the use of positive and negative training data on a nearest-neighbour classifier for hand-drawn geometric shapes. We explore the issues involved in the development of a reliable classifier using this method, and we discuss the trade-off between reliability and correctness
Keywords :
document image processing; image recognition; image segmentation; pattern classification; a posteriori feedback; document analysis; hand-drawn geometric shapes; nearest-neighbour classifier; negative training data; positive training data; reliable classifier; segmentation procedure; symbol recognition; Machine learning; Natural languages; Negative feedback; Shape; Spatial databases; Testing; Text analysis; Training data; Typesetting;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953939