Title :
Separability versus Prototypicality in Handwritten Word Retrieval
Author :
Van Oosten, Jean-Paul ; Schomaker, Lambert
Author_Institution :
Dept. of Artificial Intell., Univ. of Groningen, Groningen, Netherlands
Abstract :
User appreciation of a word-image retrieval system is based on the quality of a hit list for a query. Using support vector machines for ranking in large scale, handwritten document collections, we observed that many hit lists suffered from bad instances in the top ranks. An analysis of this problem revealed that two functions needed to be optimised concerning both separability and prototypicality. By ranking images in two stages, the number of distracting images is reduced, making the method very convenient for massive scale, continuously trainable retrieval engines. Instead of cumbersome SVM training, we present a nearest-centroid method and show that precision improvements of up to 35 percentage points can be achieved, yielding up to 100% precision in data sets with a large amount of instances, while maintaining high recall performances.
Keywords :
handwritten character recognition; image retrieval; support vector machines; SVM training; handwritten document collection; handwritten word retrieval; image ranking; nearest-centroid method; support vector machine; word-image retrieval system; Accuracy; Engines; Humans; Prototypes; Shape; Support vector machines; Training; Handwriting recognition; Handwritten word retrieval; Prototypicality; Ranking; Separability;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
DOI :
10.1109/ICFHR.2012.269