Title :
Text Information Extraction Based on Genetic Algorithm and Hidden Markov Model
Author :
Li, Rong ; Zheng, Jia-heng ; Pei, Chun-qin
Author_Institution :
Dept. of Comput., Xinzhou Teachers´´ Coll., Xinzhou
Abstract :
Since the traditional training method of HMM for text information extraction is sensitive to the initial model parameters and easy to converge to a local optimal model in practice ,a novel hybrid model of genetic algorithm (GA) and hidden Markov model (HMM) for text information extraction is presented. During the parameter training phase, the hybrid method combines GA and Baum-Welch algorithm to optimize HMM parameters globally. In the selection process of the HMM initial parameters, the hybrid method adopts GA which uses real number matrix encoding as the representation of the chromosomes and the likelihood values as the fitness values, and then utilizes a modified Baum-Welch algorithm to reevaluate parameters and construct HMM. And during the information extraction phase, an improved Viterbi algorithm is presented to obtain the optimal state sequence of test sample for text information extraction. Experimental results show that the new algorithm improves the performance in precision and recall.
Keywords :
genetic algorithms; hidden Markov models; information retrieval; text analysis; Baum-Welch algorithm; genetic algorithm; hidden Markov model; optimal state sequence; text information extraction; Computer science; Computer science education; Data mining; Educational institutions; Educational technology; Genetic algorithms; Hidden Markov models; Optimization methods; Speech recognition; Viterbi algorithm; Baum-welch algorithm; Viterbi algorithm; genetic algorithm; hidden markov model; text information extraction;
Conference_Titel :
Education Technology and Computer Science, 2009. ETCS '09. First International Workshop on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-1-4244-3581-4
DOI :
10.1109/ETCS.2009.83