• DocumentCode
    2170669
  • Title

    A New Approach to Feature Selection in Handwritten Farsi/Arabic Character Recognition

  • Author

    Shayegan, M.A. ; Chee Seng Chan

  • Author_Institution
    Dept. of Artificial Intell., Univ. of Malaya, Kuala Lumpur, Malaysia
  • fYear
    2012
  • fDate
    26-28 Nov. 2012
  • Firstpage
    506
  • Lastpage
    511
  • Abstract
    Feature extraction and feature selection are very important steps in pattern recognition systems. However, finding an optimal, effective, and robust feature set is usually a difficult task. In this paper, with the use of a comprehensive study on offline handwritten Farsi/Arabic digit recognition systems, a set of well-known features were extracted. Then, by employing one- and two-dimensional spectrum diagrams for standard deviation and minimum to maximum distributions, an optimal subset of initial features set was selected automatically. Experimental results, according to one of the biggest standard handwritten Farsi digit datasets, the HODA, had shown 95.70% accuracy with the proposed method. The achieved results showed a salient improvement in system precision in comparison to using other state-of-the-art approaches.
  • Keywords
    feature extraction; handwritten character recognition; natural language processing; optical character recognition; set theory; HODA; feature selection; handwritten Farsi-Arabic character recognition; offline handwritten Farsi-Arabic digit recognition systems; one-dimensional spectrum diagrams; optimal subset; pattern recognition systems; robust feature set; standard handwritten Farsi digit datasets; two-dimensional spectrum diagrams; Farsi/Arabic Handwritten OCR; Feature Extraction and Selection; Principal Component Analysis; Spectrum Diagram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Science Applications and Technologies (ACSAT), 2012 International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4673-5832-3
  • Type

    conf

  • DOI
    10.1109/ACSAT.2012.77
  • Filename
    6516407