DocumentCode
3695239
Title
Adapting off-the-shelf CNNs for word spotting & recognition
Author
Arjun Sharma; Pramod Sankar K.
Author_Institution
Xerox Research Centre India, Bengaluru, India
fYear
2015
Firstpage
986
Lastpage
990
Abstract
The word spotting approach is extremely useful for searching and annotating documents for which robust recognizers are unavailable. Traditionally, hand-designed features were used to represent the word images for spotting. In this paper, we learn a data-driven representation for word-images from Convolutional Neural Networks (CNNs). Previous approaches that learn deep neural networks for a particular task/dataset are difficult to design and train for generic word spotting. Instead, by “adapting” a CNN trained for a different problem, we show tremendous speedup in the training phase. Our experiments show that features extracted from an adapted-CNN handsomely outperform hand-designed features on both spotting and recognition tasks for printed (English and Telugu) and handwritten (IAM) document collections.
Keywords
"Bridges","Indexes"
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type
conf
DOI
10.1109/ICDAR.2015.7333909
Filename
7333909
Link To Document