مرکز منطقه ای اطلاع رساني علوم و فناوري - Korean Text Chunk Identification Using Support Vector Machines

DocumentCode :

3474882

Title :

Korean Text Chunk Identification Using Support Vector Machines

Author :

Kim, Sang-Soo ; Son, Jeong Woo ; Kong, Mi-hwa ; Park, Seong-Bae ; Lee, Sanj-Jo

Author_Institution :

Dept. of Comput. Eng., Kyungpook Nat. Univ., Daegu

fYear :

2006

fDate :

10-12 April 2006

Firstpage :

674

Lastpage :

679

Abstract :

In this paper, we propose a method of Korean text chunk identification based on support vector machines (SVMs). Text chunking is a task that divides text into syntactically related non-overlapping groups of word. It is a useful preprocessing step for the reduced time and computational resource of sentence parsing. Especially, we select features for SVM by considering the linguistic typological characteristics of Korean, and convert the text chunk identification, a multi-class problem into binary problems for the use of SVM frameworks. According to the experimental results, the proposed method showed 99.04 of F-score

Keywords :

natural languages; support vector machines; text analysis; word processing; Korean text chunk identification; SVM framework; computational resource; linguistic typological characteristic; sentence parsing; support vector machine; text chunking; Information technology; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on

Conference_Location :

Las Vegas, NV

Print_ISBN :

0-7695-2497-4

Type :

conf

DOI :

10.1109/ITNG.2006.87

Filename :

1611682

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3474882