• DocumentCode
    2911438
  • Title

    A Lexicon Reduction Method Based on Clustering Word Images in Offline Farsi Handwritten Word Recognition Systems

  • Author

    Bayesteh, Elham ; Ahmadifard, Alireza ; Khosravi, Hossein

  • Author_Institution
    Dept. Electr., Electron. & Robotic Eng., Shahrood Univ. of Technol., Shahrood, Iran
  • fYear
    2011
  • fDate
    16-17 Nov. 2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper a novel approach for lexicon reduction of Farsi words is proposed. For this purpose we extract upper and lower profiles, vertical projection profile and black/white transition from word images. Using DTW similarity between words in the database is measured. The Isoclus algorithm is used to cluster handwritten word images of training dataset. The initial center of clusters is determined from agglomerative hierarchical clustering algorithm. Experimental results on IRANSHAHR dataset show a promising result. It yields a lexicon reduction of 77% with accuracy of 94%. We also evaluate the proposed system when combination of statistical features and different type of distance measures are used.
  • Keywords
    handwriting recognition; pattern clustering; DTW; Isoclus algorithm; black/white transition; dynamic time warping; lexicon reduction method; lower profile extraction; offline Farsi handwritten word recognition systems; upper profile extraction; vertical projection profile; word image clustering; Accuracy; Clustering algorithms; Databases; Feature extraction; Handwriting recognition; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Vision and Image Processing (MVIP), 2011 7th Iranian
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4577-1533-4
  • Type

    conf

  • DOI
    10.1109/IranianMVIP.2011.6121550
  • Filename
    6121550