• DocumentCode
    2199424
  • Title

    Annotation Tool and XML Representation for Online Indic Data

  • Author

    Belhe, Swapnil ; Paulzagade, Chetan ; Surve, Sanket ; Jawanjal, Nitesh ; Mehrotra, Kapil ; Motwani, Anil

  • Author_Institution
    GIST Group, Center for Dev. of Adv. Comput. (CDAC), Pune, India
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    664
  • Lastpage
    669
  • Abstract
    In this paper we describe the semi-automatic annotation tool for annotating online handwritten data of Indic scripts. The annotation of handwriting data is essential to train and test the recognizers. In this paper we briefly describe the XML representation for storing online handwritten data in Indian languages. We then describe the annotation tool which essentially annotates at stroke, character and word level and exploits the uniqueness of XML standard to provide quality labels at different levels of annotation. The tool also facilitates classification of data based on quality of handwriting, age & region of writers. The annotator can verify the outputs suggested by the tool. The tool is supplemented by a utility for data segregation and accuracy calculator which aids quick performance analysis of recognizer. This tool is extensively used for annotating large amount of Hindi data and promising time saving is obtained in otherwise tedious annotation activity.
  • Keywords
    XML; handwriting recognition; natural language processing; set theory; word processing; Hindi data; Indian languages; XML representation; accuracy calculator; data classification; data segregation; online Indic Data; online handwritten data storage; quick performance analysis; semiautomatic annotation tool;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.109
  • Filename
    5693640