• DocumentCode
    2815134
  • Title

    Research and implementation English Morphological Analysis and Part-of-Speech tagging

  • Author

    Juan, Cheng

  • Author_Institution
    Normal Educ. Dept., Bohai Shipbuilding Vocational Coll., Huludao, China
  • Volume
    2
  • fYear
    2010
  • fDate
    17-18 April 2010
  • Firstpage
    496
  • Lastpage
    499
  • Abstract
    English Morphological Analysis (MA) and Part-of-Speech (POS) tagging are key task in natural language processing (NLP) and computational linguistics. This research and application are of great theoretical and practical significance. English Morphological Analysis (MA), Part-of-Speech (POS) tagging and Phrase Dictionary Retrieval (PDR) are essential steps in the course of NLP. And they are difficult in NLP. Their results are decisive to the accuracy of next processing, such as information searching, information filtration. As separate problems of English, MA, POS, PDR can be considered independent with each other. In a practical research system, however, they are dependent: solution of the prior one forms the base for processing the next one. Considering different features of these problems in this thesis, after a comprehensive study, a divide-and-conqueror strategy is proposed and resolves them separately. First, a knowledge-based method is put forward for the solution of MA. The whole MA processing is completed by many subordinate functions dealing with different particular marks of English words. A strategy of combining the word length with statistic enumeration is developed to distinguish between the periods and abbreviations.
  • Keywords
    computational linguistics; information retrieval; natural language processing; MA; NLP; PDR; POS; computational linguistics; english morphological analysis; information filtration; information searching; knowledge based method; natural language processing; part-of-speech tagging; phrase dictionary retrieval; Dictionaries; Hidden Markov models; Information analysis; Natural languages; Random processes; Signal processing; Speech analysis; Speech processing; Speech recognition; Tagging; English morphological analysis; natural language processing; part-of-speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E-Health Networking, Digital Ecosystems and Technologies (EDT), 2010 International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4244-5514-0
  • Type

    conf

  • DOI
    10.1109/EDT.2010.5496438
  • Filename
    5496438