Title :
Korean Part-of-Speech Tagging Using Disambiguation Rules for Ambiguous Word and Statistical information
Author :
Ahn, Young-Min ; Seo, Young-Hoon
Author_Institution :
Chungbuk Nat. Univ., Cheongju
Abstract :
In this paper we describe a Korean part-of-speech tagging approach using disambiguation rules for ambiguous word and statistical information. Our tagging approach resolves lexical ambiguities by common rules, rules for individual ambiguous word, and statistical approach. Common rules are ones for idioms and phrases of common use including phrases composed of main and auxiliary verbs. We built disambiguation rules for each word which has several distinct morphological analysis results to enhance tagging accuracy. Each rule may have morphemes, morphological tags, and/or word senses of not only an ambiguous word itself but also words around it. Statistical approach based on HMM is then applied for ambiguous words which are not resolved by rules. Experiment shows that the part-of-speech tagging approach has high accuracy and broad coverage.
Keywords :
grammars; hidden Markov models; natural language processing; statistical analysis; HMM; Korean part-of-speech tagging; ambiguous word; disambiguation rule; morphological analysis; statistical information; Data mining; Error correction; Hidden Markov models; Information analysis; Information technology; Natural language processing; Natural languages; Probability; Solid modeling; Tagging;
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
DOI :
10.1109/ICCIT.2007.380