DocumentCode
3520948
Title
An Efficient Corpus Based Part-of-Speech Tagging with GEP
Author
Lv, Chengyao ; Liu, Huihua ; Dong, Yuanxing
Author_Institution
Sch. of Foreign Language, China Univ. of Geosci., Wuhan, China
fYear
2010
fDate
1-3 Nov. 2010
Firstpage
289
Lastpage
292
Abstract
Text corpora which are tagged with part-of-speech (pos) information are useful in many areas of linguistic research. This paper proposes a model of Genetic Expression Programming (GEP) for pos tagging. GEP is used to search for appropriate structures in function space. After the evolution of sequence of tags, GEP can find the best individual as solution. Before simulation, a set of appropriate parameters of algorithm is fitted. Experiments on Brown Corpus show that the proposed model can achieve higher accuracy rate than Genetic Algorithm model and HMM model.
Keywords
identification technology; natural language processing; optimisation; search problems; text analysis; Brown corpus; GEP; corpus based part-of-speech tagging; genetic expression programming; part-of-speech information; pos tagging; text corpora;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantics Knowledge and Grid (SKG), 2010 Sixth International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-8125-5
Electronic_ISBN
978-0-7695-4189-1
Type
conf
DOI
10.1109/SKG.2010.42
Filename
5663526
Link To Document