DocumentCode
2815134
Title
Research and implementation English Morphological Analysis and Part-of-Speech tagging
Author
Juan, Cheng
Author_Institution
Normal Educ. Dept., Bohai Shipbuilding Vocational Coll., Huludao, China
Volume
2
fYear
2010
fDate
17-18 April 2010
Firstpage
496
Lastpage
499
Abstract
English Morphological Analysis (MA) and Part-of-Speech (POS) tagging are key task in natural language processing (NLP) and computational linguistics. This research and application are of great theoretical and practical significance. English Morphological Analysis (MA), Part-of-Speech (POS) tagging and Phrase Dictionary Retrieval (PDR) are essential steps in the course of NLP. And they are difficult in NLP. Their results are decisive to the accuracy of next processing, such as information searching, information filtration. As separate problems of English, MA, POS, PDR can be considered independent with each other. In a practical research system, however, they are dependent: solution of the prior one forms the base for processing the next one. Considering different features of these problems in this thesis, after a comprehensive study, a divide-and-conqueror strategy is proposed and resolves them separately. First, a knowledge-based method is put forward for the solution of MA. The whole MA processing is completed by many subordinate functions dealing with different particular marks of English words. A strategy of combining the word length with statistic enumeration is developed to distinguish between the periods and abbreviations.
Keywords
computational linguistics; information retrieval; natural language processing; MA; NLP; PDR; POS; computational linguistics; english morphological analysis; information filtration; information searching; knowledge based method; natural language processing; part-of-speech tagging; phrase dictionary retrieval; Dictionaries; Hidden Markov models; Information analysis; Natural languages; Random processes; Signal processing; Speech analysis; Speech processing; Speech recognition; Tagging; English morphological analysis; natural language processing; part-of-speech;
fLanguage
English
Publisher
ieee
Conference_Titel
E-Health Networking, Digital Ecosystems and Technologies (EDT), 2010 International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4244-5514-0
Type
conf
DOI
10.1109/EDT.2010.5496438
Filename
5496438
Link To Document