Title :
HMM-based recognition engine using a novel approach for statistical feature extraction
Author :
Khorsheed, M.S. ; Ouis, Samir
Author_Institution :
Nat. Center for Robot. & Intell. Syst., King Abdul-Aziz City for Sci. & Technol., Riyadh, Saudi Arabia
Abstract :
This paper extracts statistical features using a novel approach. The feature set locally measure the characteristics of the image. The proposed approach encodes the extracted features, from a one-pixel width window that slides horizontally the word image. We then inject the feature vector set into a recognition engine. The recognition engine is built using Hidden Markov Models Tool Kit (HTK). The system is trained and tested on the Arabic Printed Text Image (APTI) database. In order to select the optimal parameters for the HMM classifier, the APTI training dataset is further divided into a smaller training subset and a verification set. The estimated parameters are, then, used in the testing phase. The presented technique provides state-of-the-art recognition results on the APTI database using HMMs. The overall system achieved a recognition rate more than 97%.
Keywords :
feature extraction; hidden Markov models; image recognition; pattern classification; text analysis; visual databases; APTI database; APTI training dataset; Arabic printed text image; HMM classifier; HMM-based recognition engine; HTK; feature vector set; hidden Markov models tool kit; statistical feature extraction; word image; Databases; Engines; Feature extraction; Hidden Markov models; Text recognition; Training; Vectors; Arabic printed text recognition; hidden Markov modesl; run-length encoding;
Conference_Titel :
Systems and Informatics (ICSAI), 2014 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4799-5457-5
DOI :
10.1109/ICSAI.2014.7009290