DocumentCode :
591951
Title :
Separability versus Prototypicality in Handwritten Word Retrieval
Author :
Van Oosten, Jean-Paul ; Schomaker, Lambert
Author_Institution :
Dept. of Artificial Intell., Univ. of Groningen, Groningen, Netherlands
fYear :
2012
fDate :
18-20 Sept. 2012
Firstpage :
8
Lastpage :
13
Abstract :
User appreciation of a word-image retrieval system is based on the quality of a hit list for a query. Using support vector machines for ranking in large scale, handwritten document collections, we observed that many hit lists suffered from bad instances in the top ranks. An analysis of this problem revealed that two functions needed to be optimised concerning both separability and prototypicality. By ranking images in two stages, the number of distracting images is reduced, making the method very convenient for massive scale, continuously trainable retrieval engines. Instead of cumbersome SVM training, we present a nearest-centroid method and show that precision improvements of up to 35 percentage points can be achieved, yielding up to 100% precision in data sets with a large amount of instances, while maintaining high recall performances.
Keywords :
handwritten character recognition; image retrieval; support vector machines; SVM training; handwritten document collection; handwritten word retrieval; image ranking; nearest-centroid method; support vector machine; word-image retrieval system; Accuracy; Engines; Humans; Prototypes; Shape; Support vector machines; Training; Handwriting recognition; Handwritten word retrieval; Prototypicality; Ranking; Separability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
Type :
conf
DOI :
10.1109/ICFHR.2012.269
Filename :
6424363
Link To Document :
بازگشت