Title :
Off-line unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov word models
Author :
Dehghan, M. ; Faez, Karim ; Ahmadi, M. ; Shridhar, M.
Author_Institution :
Electr. Eng. Dept., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
An unconstrained Farsi handwritten word recognition system based on fuzzy vector quantization (FVQ) and a hidden Markov model (HMM) for reading city names in postal addresses is presented. Preprocessing techniques including binarization, noise removal, slope correction and baseline estimation are described. Each word image is represented by its contour information. The histogram of chain code slopes of the image strips (frames), scanned from right to left by a sliding window, is used as feature vectors. Fuzzy c-means (FCM) clustering is used for generating a fuzzy code book. A separate HMM is trained by a modified Baum-Welch algorithm for each city name. A test image is recognized by finding the best match (likelihood) between the image and all of the HMM work models using a forward algorithm. Experimental results show the advantages of using an FVQ/HMM recognizer engine instead of conventional discrete HMMs
Keywords :
feature extraction; fuzzy set theory; handwritten character recognition; hidden Markov models; pattern clustering; vector quantisation; baseline estimation; binarization; chain code slopes; city names; contour information; fuzzy c-means clustering; fuzzy code book; fuzzy vector quantization; hidden Markov word models; modified Baum-Welch algorithm; noise removal; off-line unconstrained Farsi handwritten word recognition; postal addresses; preprocessing techniques; slope correction; Books; Cities and towns; Clustering algorithms; Fuzzy systems; Handwriting recognition; Hidden Markov models; Histograms; Strips; Testing; Vector quantization;
Conference_Titel :
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
0-7695-0750-6
DOI :
10.1109/ICPR.2000.906085