• DocumentCode
    3695239
  • Title

    Adapting off-the-shelf CNNs for word spotting & recognition

  • Author

    Arjun Sharma; Pramod Sankar K.

  • Author_Institution
    Xerox Research Centre India, Bengaluru, India
  • fYear
    2015
  • Firstpage
    986
  • Lastpage
    990
  • Abstract
    The word spotting approach is extremely useful for searching and annotating documents for which robust recognizers are unavailable. Traditionally, hand-designed features were used to represent the word images for spotting. In this paper, we learn a data-driven representation for word-images from Convolutional Neural Networks (CNNs). Previous approaches that learn deep neural networks for a particular task/dataset are difficult to design and train for generic word spotting. Instead, by “adapting” a CNN trained for a different problem, we show tremendous speedup in the training phase. Our experiments show that features extracted from an adapted-CNN handsomely outperform hand-designed features on both spotting and recognition tasks for printed (English and Telugu) and handwritten (IAM) document collections.
  • Keywords
    "Bridges","Indexes"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333909
  • Filename
    7333909