DocumentCode
183362
Title
A Machine Learning Approach to Detection of Core Region of Online Handwritten Bangla Word Samples
Author
Baral, S. ; Bhattacharya, Surya ; Chakraborty, Arpan ; Bhattacharya, Ujjwal ; Parui, Swapan K.
Author_Institution
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Kolkata, India
fYear
2014
fDate
1-4 Sept. 2014
Firstpage
458
Lastpage
463
Abstract
Core region detection of handwritten cursive words is an important step towards their automatic recognition. Several preprocessing operations such as height normalization, slant estimation etc. Are often based on this core region. This is particularly useful for word recognition of major Indian scripts, which have large character sets. The main parts of majority of these characters belong to the core region that is bounded above by a headline and bounded below by an imaginary base line. Only a few such characters or their parts appear either above or below the core region. A few approaches are available in the literature for detection of such a core region of offline handwritten word samples of Latin script. Also, a similar region is often determined for recognition of images of printed Indian scripts. However, none of these approaches have studied detection of core region of an unconstrained online handwritten word. In this article, we propose a novel method for detection of the core region of online handwritten word samples of Bangla, a major Indian script. For this we first perform smoothing on the samples and then segment a stroke into sub strokes. We compute certain novel positional features from each such sub stroke. Using these features, a multilayer perceptron (MLP) is trained by back propagation (BP) algorithm. On the basis of the output of the MLP, we determine the position of both the headline and the baseline. We have tested this approach on a recently developed large database of online unconstrained handwriting Bangla word samples. The proposed approach would also work on similar samples of Devanagari, another major Indian script. Experimental results are encouraging.
Keywords
handwritten character recognition; learning (artificial intelligence); multilayer perceptrons; natural language processing; object recognition; BP; Devanagari; Indian script; Latin script; MLP; backpropagation algorithm; core region detection; handwritten cursive words; height normalization; imaginary base line; machine learning approach; multilayer perceptron; online handwritten Bangla word samples; printed Indian scripts; slant estimation; unconstrained online handwritten word; word recognition; Compounds; Databases; Feature extraction; Histograms; Shape; Training; Vectors; Detection of core region of handwritten words; Machine learning based appoach; Online handwriting recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location
Heraklion
ISSN
2167-6445
Print_ISBN
978-1-4799-4335-7
Type
conf
DOI
10.1109/ICFHR.2014.83
Filename
6981062
Link To Document