Title :
Analysis of nursing-care freestyle japanese text classification using ga-based term selection
Author :
Nii, Manabu ; Yamaguchi, Takafumi ; Takahashi, Yutaka ; Sakashita, Reiko ; Uchinuno, Atsuko
Author_Institution :
Grad. Sch. of Eng., Univ. of Hyogo, Himeji, Japan
Abstract :
In this paper, classification performance of a term selection based on GA is analyzed. In the term selection based on GA, two objectives which are maximizing correctly classified texts and minimizing selected terms are optimized. An objective function based on the classification per-formance of the SVM with 10-fold cross validation is used for evaluating each individual in GA. Therefore, GA-based term selection is performed aiming at the improvement in classification per-formance on testing text sets. This causes the performance deterioration over unseen texts in actual use by GA-based term selection because terms are deleted excessively even when such terms have important role for the classification. In this paper, relation between the terms deleted by the term se-lection based on GA and the terms which appears in unseen texts is clarified by numerical simulation results.
Keywords :
genetic algorithms; medical administrative data processing; numerical analysis; patient care; support vector machines; text analysis; 10-fold cross validation; GA-based term selection; SVM; genetic algorithm; numerical simulation; nursing-care freestyle Japanese text classification; support vector machine; Classification algorithms; Gallium; Pain; Software; Support vector machines; Text categorization; Nursing-care texts; genetic algorithm; support vector machine; term selection;
Conference_Titel :
World Automation Congress (WAC), 2010
Conference_Location :
Kobe
Print_ISBN :
978-1-4244-9673-0
Electronic_ISBN :
2154-4824