Title :
Arabic handwritten word spotting using language models
Author :
Khayyat, Muna ; Lam, Linh ; Suen, Ching
Author_Institution :
Centre for Pattern Recognition & Machine Intell., Concordia Univ., Montreal, QC, Canada
Abstract :
With the ever-increasing amounts of published materials being made available, developing efficient means of locating target items has become a subject of significant interest. Among the approaches adopted for this purpose is word spotting, which enables the identification of documents through the use of pertinent keywords. This paper reports on an effective method of word spotting for Arabic handwritten documents that takes into consideration the nature of Arabic handwriting. Parts of Arabic Words (PAWs) form the basic components of this search process, and a hierarchical classifier (consisting of a set of classifiers each trained on a different part of the input pattern) is implemented. For the first time in Arabic word spotting, language models are incorporated into the process of reconstructing words from PAWs. Details of the method and promising experimental results are also presented.
Keywords :
classification; handwritten character recognition; natural language processing; text analysis; Arabic handwritten document; Arabic handwritten word spotting; PAW; document identification; hierarchical classifier; language model; parts of Arabic words; published material; search process; target item location; word reconstruction; Databases; Feature extraction; Hidden Markov models; Image segmentation; Mathematical model; Testing; Writing;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
DOI :
10.1109/ICFHR.2012.183