DocumentCode :
3474882
Title :
Korean Text Chunk Identification Using Support Vector Machines
Author :
Kim, Sang-Soo ; Son, Jeong Woo ; Kong, Mi-hwa ; Park, Seong-Bae ; Lee, Sanj-Jo
Author_Institution :
Dept. of Comput. Eng., Kyungpook Nat. Univ., Daegu
fYear :
2006
fDate :
10-12 April 2006
Firstpage :
674
Lastpage :
679
Abstract :
In this paper, we propose a method of Korean text chunk identification based on support vector machines (SVMs). Text chunking is a task that divides text into syntactically related non-overlapping groups of word. It is a useful preprocessing step for the reduced time and computational resource of sentence parsing. Especially, we select features for SVM by considering the linguistic typological characteristics of Korean, and convert the text chunk identification, a multi-class problem into binary problems for the use of SVM frameworks. According to the experimental results, the proposed method showed 99.04 of F-score
Keywords :
natural languages; support vector machines; text analysis; word processing; Korean text chunk identification; SVM framework; computational resource; linguistic typological characteristic; sentence parsing; support vector machine; text chunking; Information technology; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2497-4
Type :
conf
DOI :
10.1109/ITNG.2006.87
Filename :
1611682
Link To Document :
بازگشت