DocumentCode :
3429066
Title :
A prototype system for handwritten sub-word recognition: Toward Arabic-manuscript transliteration
Author :
Moghaddam, Reza Farrahi ; Cheriet, Mohamed ; Milo, Thomas ; Wisnovsky, Robert
Author_Institution :
Synchromedia Lab., Ecole de Technol. Super., Montreal, QC, Canada
fYear :
2012
fDate :
2-5 July 2012
Firstpage :
1198
Lastpage :
1204
Abstract :
A prototype system for the transliteration of diacriticsless Arabic manuscripts at the sub-word or part of Arabic word (PAW) level is developed. The system is able to read sub-words of the input manuscript using a set of skeleton-based features. A variation of the system is also developed which reads archigraphemic Arabic manuscripts, which are dot-less, into archigraphemes transliteration. In order to reduce the complexity of the original highly multiclass problem of sub-word recognition, it is redefined into a set of binary descriptor classifiers. The outputs of trained binary classifiers are combined to generate the sequence of sub-word letters. SVMs are used to learn the binary classifiers. Two specific Arabic databases have been developed to train and test the system. One of them is a database of the Naskh style. The initial results are promising. The systems could be trained on other scripts found in Arabic manuscripts.
Keywords :
handwritten character recognition; image recognition; language translation; pattern classification; support vector machines; Arabic databases; Arabic-manuscript transliteration; Naskh style; PAW; SVM; archigraphemes transliteration; archigraphemic Arabic manuscripts; binary descriptor classifiers; diacriticsless Arabic manuscripts; handwritten subword recognition; part of Arabic word; prototype system; skeleton-based features; Complexity theory; Databases; Encoding; Feature extraction; Shape; Skeleton; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
Type :
conf
DOI :
10.1109/ISSPA.2012.6310473
Filename :
6310473
Link To Document :
بازگشت