• DocumentCode
    3520948
  • Title

    An Efficient Corpus Based Part-of-Speech Tagging with GEP

  • Author

    Lv, Chengyao ; Liu, Huihua ; Dong, Yuanxing

  • Author_Institution
    Sch. of Foreign Language, China Univ. of Geosci., Wuhan, China
  • fYear
    2010
  • fDate
    1-3 Nov. 2010
  • Firstpage
    289
  • Lastpage
    292
  • Abstract
    Text corpora which are tagged with part-of-speech (pos) information are useful in many areas of linguistic research. This paper proposes a model of Genetic Expression Programming (GEP) for pos tagging. GEP is used to search for appropriate structures in function space. After the evolution of sequence of tags, GEP can find the best individual as solution. Before simulation, a set of appropriate parameters of algorithm is fitted. Experiments on Brown Corpus show that the proposed model can achieve higher accuracy rate than Genetic Algorithm model and HMM model.
  • Keywords
    identification technology; natural language processing; optimisation; search problems; text analysis; Brown corpus; GEP; corpus based part-of-speech tagging; genetic expression programming; part-of-speech information; pos tagging; text corpora;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantics Knowledge and Grid (SKG), 2010 Sixth International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-8125-5
  • Electronic_ISBN
    978-0-7695-4189-1
  • Type

    conf

  • DOI
    10.1109/SKG.2010.42
  • Filename
    5663526