DocumentCode :
183213
Title :
Text/Non-text Classification in Online Handwritten Documents with Recurrent Neural Networks
Author :
Truyen Van Phan ; Nakagawa, Masaki
Author_Institution :
Dept. of Electron. & Inf. Eng., Tokyo Univ. of Agric. & Technol., Koganei, Japan
fYear :
2014
fDate :
1-4 Sept. 2014
Firstpage :
23
Lastpage :
28
Abstract :
In this paper, we propose a novel method for text/non-text classification in online handwritten document based on Recurrent Neural Network (RNN) and its improved version, Long Short-Term Memory (LSTM) network. The task of classifying strokes in a digital ink document into two classes (text and non-text) can be seen as a sequence labelling task. The bidirectional architecture is used in these networks to access to the complete global context of the sequence being classified. Moreover, a simple but effective model is adopted for the temporal local context of adjacent strokes. By integrating local context and global context, the classification accuracy is improved. In our experiments on the Japanese ink documents (Kondate database), the proposed method achieves a classification rate of 98.75%, which is significantly higher than the 96.61% in the previous work. Similarly, on the English ink documents (IAMonDo database), it produces a classification rate of 97.68%, which is also higher than other results reported in the literature.
Keywords :
handwritten character recognition; recurrent neural nets; text analysis; English ink documents; IAMonDo database; Japanese ink documents; Kondate database; LSTM network; bidirectional architecture; digital ink document; long short-term memory; nontext classification; online handwritten documents; recurrent neural networks; stroke classification; Accuracy; Context; Context modeling; Databases; Feature extraction; Recurrent neural networks; Training; LSTM; Long Short-Term Memory; RNN; Recurrent Neural Networks; Text/Non-text classification; Text/Non-text separation; ink stroke classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
ISSN :
2167-6445
Print_ISBN :
978-1-4799-4335-7
Type :
conf
DOI :
10.1109/ICFHR.2014.12
Filename :
6980991
Link To Document :
بازگشت