Title :
Text/Non-text Classification in Online Handwritten Documents with Recurrent Neural Networks
Author :
Truyen Van Phan ; Nakagawa, Masaki
Author_Institution :
Dept. of Electron. & Inf. Eng., Tokyo Univ. of Agric. & Technol., Koganei, Japan
Abstract :
In this paper, we propose a novel method for text/non-text classification in online handwritten document based on Recurrent Neural Network (RNN) and its improved version, Long Short-Term Memory (LSTM) network. The task of classifying strokes in a digital ink document into two classes (text and non-text) can be seen as a sequence labelling task. The bidirectional architecture is used in these networks to access to the complete global context of the sequence being classified. Moreover, a simple but effective model is adopted for the temporal local context of adjacent strokes. By integrating local context and global context, the classification accuracy is improved. In our experiments on the Japanese ink documents (Kondate database), the proposed method achieves a classification rate of 98.75%, which is significantly higher than the 96.61% in the previous work. Similarly, on the English ink documents (IAMonDo database), it produces a classification rate of 97.68%, which is also higher than other results reported in the literature.
Keywords :
handwritten character recognition; recurrent neural nets; text analysis; English ink documents; IAMonDo database; Japanese ink documents; Kondate database; LSTM network; bidirectional architecture; digital ink document; long short-term memory; nontext classification; online handwritten documents; recurrent neural networks; stroke classification; Accuracy; Context; Context modeling; Databases; Feature extraction; Recurrent neural networks; Training; LSTM; Long Short-Term Memory; RNN; Recurrent Neural Networks; Text/Non-text classification; Text/Non-text separation; ink stroke classification;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
Print_ISBN :
978-1-4799-4335-7
DOI :
10.1109/ICFHR.2014.12