DocumentCode :
478638
Title :
A Comparison of Clustering Methods for Word Image Indexing
Author :
Marinai, Simone ; Marino, Emanuele ; Soda, Giovanni
fYear :
2008
fDate :
16-19 Sept. 2008
Firstpage :
671
Lastpage :
676
Abstract :
In this paper we compare three clustering methods used to perform word image indexing. The three methods are: the Self-Organizing Map (SOM), the Growing Hierarchical Self-Organizing Map (GHSOM), and the Spectral Clustering. We test these methods on a real data set composed of word images extracted from an encyclopedia of the XIX-th Century. The word images are grouped on the basis of the clustering methods and subsequently retrieved identifying the closest clusters to a query word. The accuracy of the methods is compared evaluating the performance of the word retrieval algorithm. From the experimental results we conclude that methods designed to automatically determine the number and the structure of clusters, such as GHSOM, are particularly suitable in the context represented by our data set.
Keywords :
Clustering methods; Convergence; Error analysis; Feature extraction; Image analysis; Image converters; Image segmentation; Indexing; Shape; Text analysis; Document Image Retrieval; Self Organizing Map; clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
Conference_Location :
Nara, Japan
Print_ISBN :
978-0-7695-3337-7
Type :
conf
DOI :
10.1109/DAS.2008.85
Filename :
4670020
Link To Document :
بازگشت