Title :
Chinese readability assessment using TF-IDF and SVM
Author :
Chen, Yaw-Huei ; Tsai, Yi-Han ; Chen, Yu-Ta
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiayi Univ., Chiayi, Taiwan
Abstract :
This paper proposes a simple yet effective method to automatically determine the readability of Chinese articles. We use mutual information to select the most important terms from the training data, calculate TF-IDF values based on those terms, and use those values as features for SVM to build classification models that identify articles suitable for lower grade students and middle grade students in elementary school. The experiments on elementary school textbooks produce satisfactory results.
Keywords :
classification; educational technology; student experiments; support vector machines; Chinese readability assessment; SVM; TF-IDF; classification models; elementary school textbooks; lower grade students; middle grade students; training data; Educational institutions; Machine learning; Mutual information; Support vector machines; Testing; Training; Training data; Classification; Mutual Information; Readability; SVM; TF-IDF;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-1-4577-0305-8
DOI :
10.1109/ICMLC.2011.6016783