Title :
A Framework Based on Semi-Supervised Clustering for Discovering Unique Writing Styles
Author :
Bharath, A. ; Madhvanath, Sriganesh
Author_Institution :
Hewlett-Packard Labs., Bangalore, India
Abstract :
An online multi-stroke character is often written in many ways. While some vary in the number of strokes they contain, others differ in the ordering of strokes. It is important for a writer-independent recognition system to learn these different styles of writing the character during the training phase in order to better model the training data. Typically, the samples of a character are clustered in an unsupervised manner and each cluster is modeled individually. In this paper, we describe an approach based on dasiasemi-supervised clusteringpsila where basic domain knowledge can be incorporated for better clustering of strokes present across all the characters.Experimental results show improved recognition accuracy when compared to the baseline system.
Keywords :
handwritten character recognition; pattern clustering; domain knowledge; online multistroke character; semisupervised clustering; stroke clustering; unique writing style discovery; writer-independent recognition system; Character recognition; Clustering algorithms; Feature extraction; Hidden Markov models; Ink; Laboratories; Nearest neighbor searches; Text analysis; Training data; Writing; online Devanagari character recognition; online handwriting recognition; semi-supervised stroke clustering; writing style identification;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.148