DocumentCode
2199424
Title
Annotation Tool and XML Representation for Online Indic Data
Author
Belhe, Swapnil ; Paulzagade, Chetan ; Surve, Sanket ; Jawanjal, Nitesh ; Mehrotra, Kapil ; Motwani, Anil
Author_Institution
GIST Group, Center for Dev. of Adv. Comput. (CDAC), Pune, India
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
664
Lastpage
669
Abstract
In this paper we describe the semi-automatic annotation tool for annotating online handwritten data of Indic scripts. The annotation of handwriting data is essential to train and test the recognizers. In this paper we briefly describe the XML representation for storing online handwritten data in Indian languages. We then describe the annotation tool which essentially annotates at stroke, character and word level and exploits the uniqueness of XML standard to provide quality labels at different levels of annotation. The tool also facilitates classification of data based on quality of handwriting, age & region of writers. The annotator can verify the outputs suggested by the tool. The tool is supplemented by a utility for data segregation and accuracy calculator which aids quick performance analysis of recognizer. This tool is extensively used for annotating large amount of Hindi data and promising time saving is obtained in otherwise tedious annotation activity.
Keywords
XML; handwriting recognition; natural language processing; set theory; word processing; Hindi data; Indian languages; XML representation; accuracy calculator; data classification; data segregation; online Indic Data; online handwritten data storage; quick performance analysis; semiautomatic annotation tool;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.109
Filename
5693640
Link To Document