DocumentCode :
2627898
Title :
Korean Part-of-Speech Tagging Using Disambiguation Rules for Ambiguous Word and Statistical information
Author :
Ahn, Young-Min ; Seo, Young-Hoon
Author_Institution :
Chungbuk Nat. Univ., Cheongju
fYear :
2007
fDate :
21-23 Nov. 2007
Firstpage :
1598
Lastpage :
1601
Abstract :
In this paper we describe a Korean part-of-speech tagging approach using disambiguation rules for ambiguous word and statistical information. Our tagging approach resolves lexical ambiguities by common rules, rules for individual ambiguous word, and statistical approach. Common rules are ones for idioms and phrases of common use including phrases composed of main and auxiliary verbs. We built disambiguation rules for each word which has several distinct morphological analysis results to enhance tagging accuracy. Each rule may have morphemes, morphological tags, and/or word senses of not only an ambiguous word itself but also words around it. Statistical approach based on HMM is then applied for ambiguous words which are not resolved by rules. Experiment shows that the part-of-speech tagging approach has high accuracy and broad coverage.
Keywords :
grammars; hidden Markov models; natural language processing; statistical analysis; HMM; Korean part-of-speech tagging; ambiguous word; disambiguation rule; morphological analysis; statistical information; Data mining; Error correction; Hidden Markov models; Information analysis; Information technology; Natural language processing; Natural languages; Probability; Solid modeling; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
Type :
conf
DOI :
10.1109/ICCIT.2007.380
Filename :
4420481
Link To Document :
بازگشت