Title :
Online handwritten script recognition
Author :
Namboodiri, Anoop M. ; Jain, Anil K.
Author_Institution :
Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Abstract :
Automatic identification of handwritten script facilitates many important applications such as automatic transcription of multilingual documents and search for documents on the Web containing a particular script. The increase in usage of handheld devices which accept handwritten input has created a growing demand for algorithms that can efficiently analyze and retrieve handwritten data. This paper proposes a method to classify words and lines in an online handwritten document into one of the six major scripts: Arabic, Cyrillic, Devnagari, Han, Hebrew, or Roman. The classification is based on 11 different spatial and temporal features extracted from the strokes of the words. The proposed system attains an overall classification accuracy of 87.1 percent at the word level with 5-fold cross validation on a data set containing 13,379 words. The classification accuracy improves to 95 percent as the number of words in the test sample is increased to five, and to 95.5 percent for complete text lines consisting of an average of seven words.
Keywords :
document image processing; feature extraction; handwriting recognition; online operation; Arabic script; Cyrillic script; Devnagari script; Han script; Hebrew script; Roman script; automatic identification; automatic transcription; feature extraction; handheld devices; handwritten data analysis; handwritten data retrieval; multilingual documents; online handwritten script recognition; Algorithm design and analysis; Data mining; Feature extraction; Handheld computers; Handwriting recognition; Information retrieval; Natural languages; Personal digital assistants; Text recognition; Writing; Algorithms; Artificial Intelligence; Automatic Data Processing; Computer Graphics; Computer Simulation; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique; User-Computer Interface;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2004.1261096