• DocumentCode
    187693
  • Title

    Recognition of open vocabulary, online handwritten pages in Tamil script

  • Author

    Urala, K. Bhargava ; Ramakrishnan, A.G. ; Mohamed, Salina

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • fYear
    2014
  • fDate
    22-25 July 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this work, we describe a system, which recognises open vocabulary, isolated, online handwritten Tamil words and extend it to recognize a paragraph of writing. We explain in detail each step involved in the process: segmentation, preprocessing, feature extraction, classification and bigram-based post-processing. On our database of 45,000 handwritten words obtained through tablet PC, we have obtained symbol level accuracy of 78.5% and 85.3% without and with the usage of post-processing using symbol level language models, respectively. Word level accuracies for the same are 40.1% and 59.6%. A line and word level segmentation strategy is proposed, which gives promising results of 100% line segmentation and 98.1% word segmentation accuracies on our initial trials of 40 handwritten paragraphs. The two modules have been combined to obtain a full-fledged page recognition system for online handwritten Tamil data. To the knowledge of the authors, this is the first ever attempt on recognition of open vocabulary, online handwritten paragraphs in any Indian language.
  • Keywords
    document image processing; handwritten character recognition; image segmentation; natural language processing; notebook computers; Indian language; Tamil script; bigram-based post-processing; feature extraction; full-fledged page recognition system; handwritten paragraphs; online handwritten Tamil words; online handwritten pages; open vocabulary recognition; symbol level accuracy; tablet PC; word level segmentation strategy; Accuracy; Databases; Feature extraction; Handwriting recognition; Hidden Markov models; Support vector machines; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications (SPCOM), 2014 International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4799-4666-2
  • Type

    conf

  • DOI
    10.1109/SPCOM.2014.6984002
  • Filename
    6984002