• DocumentCode
    2142416
  • Title

    BLSTM Neural Network Based Word Retrieval for Hindi Documents

  • Author

    Jain, Raman ; Frinken, Volkmar ; Jawahar, C.V. ; Manmatha, R.

  • Author_Institution
    Center for Visual Inf. Technol., Int. Inst. of Inf. Technol. Hyderabad, Hyderabad, India
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    83
  • Lastpage
    87
  • Abstract
    Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition, segmentation and recognition of individual characters often becomes difficult due to the writing style as well as degradations in the print. For these reasons, robust OCRs are non existent for Hindi. Therefore, Hindi document repositories are not amenable to indexing and retrieval. In this paper, we propose a scheme for retrieving relevant Hindi documents in response to a query word. This approach uses BLSTM neural networks. Designed to take contextual information into account, these networks can handle word images that can not be robustly segmented into individual characters. By zoning the Hindi words, we simplify the problem and obtain high retrieval rates. Our simplification suits the retrieval problem, while it does not apply to recognition. Our scalable retrieval scheme avoids explicit recognition of characters. An experimental evaluation on a dataset of word images gathered from two complete books demonstrates good accuracy even in the presence of printing variations and degradations. The performance is compared with baseline methods.
  • Keywords
    document image processing; image retrieval; image segmentation; natural language processing; neural nets; optical character recognition; BLSTM neural network; Hindi document image collections; OCR; individual character recognition; individual character segmentation; optical character recognition; printing degradations; printing variations; word retrieval; Character recognition; Image segmentation; Neural networks; Optical character recognition software; Robustness; Training; Vectors; BLSTM neural network; Hindi; Indian languages; Word Image Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.26
  • Filename
    6065281