• DocumentCode
    2961561
  • Title

    Constructing a Novel Chinese Readability Classification Model Using Principal Component Analysis and Genetic Programming

  • Author

    Lee, Yi-Shian ; Tseng, Hou-Chiang ; Chen, Ju-Ling ; Peng, Chun-Yi ; Chang, Tao-Hsing ; Sung, Yao-Ting

  • Author_Institution
    Res. Ctr for Psychological & Educ. Testing, Nat. Taiwan Normal Univ., Taipei, Taiwan
  • fYear
    2012
  • fDate
    4-6 July 2012
  • Firstpage
    164
  • Lastpage
    166
  • Abstract
    The studies of readability aim to measure the level of text difficulty. Although traditional formulae such as the Flesch-Kincaid formula can properly predict text readability, they are only effective for English text. Other formulae with very few features may result in inaccurate text classification. The study takes into account multiple linguistic features, and attempts to increase the level of accuracy in text classification by adopting a new model which integrates Principal Component Analysis (PCA) with Genetic Programming (GP). Empirical data are utilized to demonstrate the performance of the proposed model.
  • Keywords
    genetic algorithms; natural language processing; pattern classification; principal component analysis; text analysis; English text; Flesch-Kincaid formula; GP; PCA; genetic programming; multiple linguistic features; novel Chinese readability classification model; principal component analysis; text classification; text readability; Educational institutions; Genetic programming; Mathematical model; Predictive models; Principal component analysis; Psychology; Support vector machines; Genetic programming; Principal component analysis; Readability; Text analysis component;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Learning Technologies (ICALT), 2012 IEEE 12th International Conference on
  • Conference_Location
    Rome
  • Print_ISBN
    978-1-4673-1642-2
  • Type

    conf

  • DOI
    10.1109/ICALT.2012.134
  • Filename
    6268065