Title :
Off-line Farsi / arabic handwritten word recognition using vector quantization and hidden Markov model
Author :
Vaseghi, Behroz ; Alirezaee, Shahpour ; Ahmadi, Majid ; Amirfattahi, Rasoul
Author_Institution :
Eng. Dept., Islamic Azad Univ., Najafabad
Abstract :
In this paper a Farsi handwritten word recognition system for reading city names in postal addresses is presented. The method is based on vector quantization (VQ) and hidden Markov model (HMM). The sliding right to left window is used to extract the proper features(we have proposed four features). After feature extraction, K-means clustering is used for generation a codebook and VQ generates a codeword for each word image. In the next stage, HMM is trained by Baum Welch algorithm for each city name. A test image is recognized by finding the best match (likelihood) between the image and all of the HMM words models using forward algorithm. Experimental results show the advantages of using VQ/HMM recognizer engine instead of conventional discrete HMM.
Keywords :
document image processing; feature extraction; handwritten character recognition; hidden Markov models; natural language processing; pattern clustering; vector quantisation; Baum Welch algorithm; K-means clustering; VQ-HMM recognizer engine; codebook generation; feature extraction; forward algorithm; hidden Markov model; offline Farsi-Arabic handwritten word recognition; vector quantization; Character recognition; Cities and towns; Clustering algorithms; Engines; Feature extraction; Handwriting recognition; Hidden Markov models; Image recognition; Pattern recognition; Vector quantization; HMM; Off-line character recogntion; VQ;
Conference_Titel :
Multitopic Conference, 2008. INMIC 2008. IEEE International
Conference_Location :
Karachi
Print_ISBN :
978-1-4244-2823-6
Electronic_ISBN :
978-1-4244-2824-3
DOI :
10.1109/INMIC.2008.4777804