DocumentCode :
183362
Title :
A Machine Learning Approach to Detection of Core Region of Online Handwritten Bangla Word Samples
Author :
Baral, S. ; Bhattacharya, Surya ; Chakraborty, Arpan ; Bhattacharya, Ujjwal ; Parui, Swapan K.
Author_Institution :
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Kolkata, India
fYear :
2014
fDate :
1-4 Sept. 2014
Firstpage :
458
Lastpage :
463
Abstract :
Core region detection of handwritten cursive words is an important step towards their automatic recognition. Several preprocessing operations such as height normalization, slant estimation etc. Are often based on this core region. This is particularly useful for word recognition of major Indian scripts, which have large character sets. The main parts of majority of these characters belong to the core region that is bounded above by a headline and bounded below by an imaginary base line. Only a few such characters or their parts appear either above or below the core region. A few approaches are available in the literature for detection of such a core region of offline handwritten word samples of Latin script. Also, a similar region is often determined for recognition of images of printed Indian scripts. However, none of these approaches have studied detection of core region of an unconstrained online handwritten word. In this article, we propose a novel method for detection of the core region of online handwritten word samples of Bangla, a major Indian script. For this we first perform smoothing on the samples and then segment a stroke into sub strokes. We compute certain novel positional features from each such sub stroke. Using these features, a multilayer perceptron (MLP) is trained by back propagation (BP) algorithm. On the basis of the output of the MLP, we determine the position of both the headline and the baseline. We have tested this approach on a recently developed large database of online unconstrained handwriting Bangla word samples. The proposed approach would also work on similar samples of Devanagari, another major Indian script. Experimental results are encouraging.
Keywords :
handwritten character recognition; learning (artificial intelligence); multilayer perceptrons; natural language processing; object recognition; BP; Devanagari; Indian script; Latin script; MLP; backpropagation algorithm; core region detection; handwritten cursive words; height normalization; imaginary base line; machine learning approach; multilayer perceptron; online handwritten Bangla word samples; printed Indian scripts; slant estimation; unconstrained online handwritten word; word recognition; Compounds; Databases; Feature extraction; Histograms; Shape; Training; Vectors; Detection of core region of handwritten words; Machine learning based appoach; Online handwriting recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
ISSN :
2167-6445
Print_ISBN :
978-1-4799-4335-7
Type :
conf
DOI :
10.1109/ICFHR.2014.83
Filename :
6981062
Link To Document :
بازگشت