Title :
Segmentation of connected Arabic characters using hidden Markov models
Author :
Gouda, Alaa M. ; Rashwan, M.A.
Author_Institution :
Dept. of Electr. & Comput. Eng., King Abdulaziz Univ., Jeddah, Saudi Arabia
Abstract :
Because the Arabic text is connected by nature, segmentation of Arabic text into characters is a very important task for building an Arabic OCR. Although a lot of work has been done in this area, there is no perfect technique for segmentation has been used until now. In this paper, discrete hidden Markov models are used for segmentation of Arabic words into letters. The results are very encouraging. A system has been built and used for testing the proposed algorithm and the segmentation results achieved 99%.
Keywords :
finite state machines; hidden Markov models; natural languages; optical character recognition; pattern classification; word processing; Arabic OCR; Arabic text; Arabic words; HMM; connected Arabic character segmentation; cursive script; discrete hidden Markov models; Character recognition; Hidden Markov models; Histograms; Neural networks; Optical character recognition software; Reconstruction algorithms; Text recognition; Writing;
Conference_Titel :
Computational Intelligence for Measurement Systems and Applications, 2004. CIMSA. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8341-9
DOI :
10.1109/CIMSA.2004.1397244