DocumentCode
3145562
Title
A discriminative learning approach for orientation detection of Urdu document images
Author
Rashid, Sheikh Faisal ; Bukhari, Syed Saqib ; Shafait, Faisal ; Breuel, Thomas M.
Author_Institution
Image Understanding & Pattern Recognition (IUPR), Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
fYear
2009
fDate
14-15 Dec. 2009
Firstpage
1
Lastpage
5
Abstract
Orientation detection is an important preprocessing step for accurate recognition of text from document images. Many existing orientation detection techniques are based on the fact that in Roman script text ascenders occur more likely than descenders, but this approach is not applicable to document of other scripts like Urdu, Arabic, etc. In this paper, we propose a discriminative learning approach for orientation detection of Urdu documents with varying layouts and fonts. The main advantage of our approach is that it can be applied to documents of other scripts easily and accurately. Our approach is based on classification of individual connected component orientation in the document image, and then the orientation of the page image is determined via majority count. A convolutional neural network is trained as discriminative learning model for the labeled Urdu books dataset with four target orientations: 0, 90, 180 and 270 degrees. We demonstrate the effectiveness of our method on dataset of Urdu documents categorized into the layouts of book, novel and poetry. We achieved 100% orientation detection accuracy on a test set of 328 document images.
Keywords
classification; document image processing; learning (artificial intelligence); natural language processing; neural nets; text analysis; Roman script; Urdu books dataset; Urdu document images; classification; convolutional neural network; discriminative learning approach; orientation detection; text ascenders; text recognition; Artificial intelligence; Books; Cellular neural networks; Image recognition; Learning; Neural networks; Optical character recognition software; Pattern recognition; Shape; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Multitopic Conference, 2009. INMIC 2009. IEEE 13th International
Conference_Location
Islamabad
Print_ISBN
978-1-4244-4872-2
Electronic_ISBN
978-1-4244-4873-9
Type
conf
DOI
10.1109/INMIC.2009.5383110
Filename
5383110
Link To Document