• DocumentCode
    3134519
  • Title

    Arabic handwritten word spotting using language models

  • Author

    Khayyat, Muna ; Lam, Linh ; Suen, Ching

  • Author_Institution
    Centre for Pattern Recognition & Machine Intell., Concordia Univ., Montreal, QC, Canada
  • fYear
    2012
  • fDate
    18-20 Sept. 2012
  • Firstpage
    43
  • Lastpage
    48
  • Abstract
    With the ever-increasing amounts of published materials being made available, developing efficient means of locating target items has become a subject of significant interest. Among the approaches adopted for this purpose is word spotting, which enables the identification of documents through the use of pertinent keywords. This paper reports on an effective method of word spotting for Arabic handwritten documents that takes into consideration the nature of Arabic handwriting. Parts of Arabic Words (PAWs) form the basic components of this search process, and a hierarchical classifier (consisting of a set of classifiers each trained on a different part of the input pattern) is implemented. For the first time in Arabic word spotting, language models are incorporated into the process of reconstructing words from PAWs. Details of the method and promising experimental results are also presented.
  • Keywords
    classification; handwritten character recognition; natural language processing; text analysis; Arabic handwritten document; Arabic handwritten word spotting; PAW; document identification; hierarchical classifier; language model; parts of Arabic words; published material; search process; target item location; word reconstruction; Databases; Feature extraction; Hidden Markov models; Image segmentation; Mathematical model; Testing; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
  • Conference_Location
    Bari
  • Print_ISBN
    978-1-4673-2262-1
  • Type

    conf

  • DOI
    10.1109/ICFHR.2012.183
  • Filename
    6424368