Title :
Analyzing the information entropy of states to optimize the number of states in an HMM-based off-line handwritten Arabic word recognizer
Author :
Zhiwei Jiang ; Xiaoqing Ding ; Liangrui Peng ; Changsong Liu
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
HMM is one of the most popular methods to model sequential signals and plays a significant role in the field of off-line handwritten Arabic word recognition research. However, the structure of an HMM including the number of states has to be determined initially and can hardly be updated during the training process. A novel analytic algorithm based on the information entropy of states in an HMM to optimize the number of states will be proposed in this paper. Information entropy is defined as an evaluation criterion of the activity of a state. According to principle of maximum entropy, states with minor information entropy do not possess so enough capability to represent actual observations that they should be deleted. Experiments on IFN/ENIT database show that the algorithm in this paper can bring approximately 3%-6% increase to correct recognition rate from the best performance of system with constant states.
Keywords :
entropy; handwritten character recognition; hidden Markov models; natural language processing; HMM-based offline handwritten Arabic word recognizer; IFN-ENIT database; analytic algorithm; evaluation criterion; maximum entropy principle; offline handwritten Arabic word recognition research; sequential signal modelling; state activity; state information entropy; states optimization; training process; Algorithm design and analysis; Handwriting recognition; Hidden Markov models; Information entropy; Optimization; Signal processing algorithms; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4