Title :
Korean Text Chunk Identification Using Support Vector Machines
Author :
Kim, Sang-Soo ; Son, Jeong Woo ; Kong, Mi-hwa ; Park, Seong-Bae ; Lee, Sanj-Jo
Author_Institution :
Dept. of Comput. Eng., Kyungpook Nat. Univ., Daegu
Abstract :
In this paper, we propose a method of Korean text chunk identification based on support vector machines (SVMs). Text chunking is a task that divides text into syntactically related non-overlapping groups of word. It is a useful preprocessing step for the reduced time and computational resource of sentence parsing. Especially, we select features for SVM by considering the linguistic typological characteristics of Korean, and convert the text chunk identification, a multi-class problem into binary problems for the use of SVM frameworks. According to the experimental results, the proposed method showed 99.04 of F-score
Keywords :
natural languages; support vector machines; text analysis; word processing; Korean text chunk identification; SVM framework; computational resource; linguistic typological characteristic; sentence parsing; support vector machine; text chunking; Information technology; Support vector machines;
Conference_Titel :
Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2497-4
DOI :
10.1109/ITNG.2006.87