• DocumentCode
    2199263
  • Title

    Word-Wise Handwritten Persian and Roman Script Identification

  • Author

    Roy, Kaushik ; Alaei, Alireza ; Pal, Umapada

  • Author_Institution
    Dept. of Comput. Sci., West Bengal State Univ., Kolkata, India
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    628
  • Lastpage
    633
  • Abstract
    Most of the countries use bi-script documents. This is because every country uses its own national language and English as second/foreign language. Therefore, bi-lingual document with one language being the English and other being the national language is very common. Postal documents are a very good example of such bi-lingual/script document. This paper deals with word-wise handwritten script identification from bi-script documents written in Persian and Roman. In the proposed scheme, simple but fast computable set of 12 features based on fractal dimension, position of small component, topology etc. are used and a set of classifiers are employed for script identification experiments. We tested our scheme on a dataset of 5000 handwritten Persian and English words and 99.20% of correct script identification is obtained.
  • Keywords
    document image processing; handwritten character recognition; natural language processing; pattern classification; bi-lingual document; fractal dimension; national language; postal documents; word-wise handwritten Persian script identification; word-wise handwritten Roman script identification; Fractal dimension; Handwritten script identification; Persian handwritten Recognition; Word-wise script identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.103
  • Filename
    5693634