DocumentCode :
3423439
Title :
Automatic Annotation for Korean--Approach Based on the Contextual Exploration Method
Author :
Chai, Hyunzoo
Author_Institution :
Univ. of Paris-Sorbonne, Paris
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
278
Lastpage :
282
Abstract :
We present an automatic semantic annotation system for Korean based on the contextual exploration method. Creating a morphological analyzer and part-of-speech tagger for the Korean language is difficult as it is a highly agglutinative language. Accordingly, processing Korean in the same order as inflectional languages - morphological analysis, then syntactical and then semantic - has not yielded satisfactory results. Our new method identifies semantic information in Korean text without going through the morphological and syntactical analysis steps. Our initial system properly annotates approximately 88% of standard Korean sentences, and this annotation rate holds across text domains. Previously, the contextual exploration method has been applied successfully to languages as diverse as French and Arabic. Given our success with Korean, we believe that this method can be applied to other agglutinative languages such as Japanese, Turkish and Finnish.
Keywords :
natural language processing; Korean automatic annotation; contextual exploration method; inflectional languages; morphological analysis; morphological analyzer; part-of-speech tagger; Data mining; Databases; Decoding; Expert systems; Humans; Information analysis; Natural language processing; Natural languages; Robustness; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
Conference_Location :
Regensburg
ISSN :
1529-4188
Print_ISBN :
978-0-7695-2932-5
Type :
conf
DOI :
10.1109/DEXA.2007.62
Filename :
4312901
Link To Document :
بازگشت