Title :
A Lexicon Reduction Method Based on Clustering Word Images in Offline Farsi Handwritten Word Recognition Systems
Author :
Bayesteh, Elham ; Ahmadifard, Alireza ; Khosravi, Hossein
Author_Institution :
Dept. Electr., Electron. & Robotic Eng., Shahrood Univ. of Technol., Shahrood, Iran
Abstract :
In this paper a novel approach for lexicon reduction of Farsi words is proposed. For this purpose we extract upper and lower profiles, vertical projection profile and black/white transition from word images. Using DTW similarity between words in the database is measured. The Isoclus algorithm is used to cluster handwritten word images of training dataset. The initial center of clusters is determined from agglomerative hierarchical clustering algorithm. Experimental results on IRANSHAHR dataset show a promising result. It yields a lexicon reduction of 77% with accuracy of 94%. We also evaluate the proposed system when combination of statistical features and different type of distance measures are used.
Keywords :
handwriting recognition; pattern clustering; DTW; Isoclus algorithm; black/white transition; dynamic time warping; lexicon reduction method; lower profile extraction; offline Farsi handwritten word recognition systems; upper profile extraction; vertical projection profile; word image clustering; Accuracy; Clustering algorithms; Databases; Feature extraction; Handwriting recognition; Training; Vectors;
Conference_Titel :
Machine Vision and Image Processing (MVIP), 2011 7th Iranian
Conference_Location :
Tehran
Print_ISBN :
978-1-4577-1533-4
DOI :
10.1109/IranianMVIP.2011.6121550