• DocumentCode
    183331
  • Title

    Automatic Handwritten Indian Scripts Identification

  • Author

    Pardeshi, Rajmohan ; Chaudhuri, Bidyut B. ; Hangarge, Mallikarjun ; Santosh, K.C.

  • Author_Institution
    Dept. of Comput. Sci., Karnatak Arts, Sci. & Commerce Coll., Bidar, India
  • fYear
    2014
  • fDate
    1-4 Sept. 2014
  • Firstpage
    375
  • Lastpage
    380
  • Abstract
    Since OCR engines are usually script-dependent, automatic text recognition in multi-script document requires a pre-processor module that identifies the scripts. Based on this motivation, in this paper, we present a word level handwritten Indian script identification technique. To handle this, words are first segmented by morphological dilation and performed connected component labelling. We then employ the Radon transform, discrete wavelet transform, statistical filters and discrete cosine transform to extract the directional multi-resolution spatial features. We tested the features by using linear discriminant analysis, support vector machine and K-nearest neighbour classifiers over 11 different major Indian scripts (including Roman) in bi-script and tri-script scenario. In our tests, we have achieved maximum accuracies of 98% and 96% for bi-script and tri-scipt respectively.
  • Keywords
    Radon transforms; discrete cosine transforms; discrete wavelet transforms; document image processing; handwritten character recognition; image resolution; optical character recognition; pattern classification; statistical analysis; support vector machines; K-nearest neighbour classifiers; OCR engines; Radon transform; automatic handwritten Indian scripts identification; biscript scenario; connected component labelling; directional multiresolution spatial features; discrete cosine transform; discrete wavelet transform; linear discriminant analysis; morphological dilation; multiscript document; preprocessor module; script-dependent automatic text recognition; statistical filters; support vector machine; triscript scenario; word level handwritten Indian script identification technique; Accuracy; Discrete cosine transforms; Discrete wavelet transforms; Feature extraction; Kernel; Support vector machines; Indian script identification; The Radon transform; discrete cosine transform; statistical filters; wavelet transform;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
  • Conference_Location
    Heraklion
  • ISSN
    2167-6445
  • Print_ISBN
    978-1-4799-4335-7
  • Type

    conf

  • DOI
    10.1109/ICFHR.2014.69
  • Filename
    6981048