Title :
Automatic Quality Assessment of Documents with Application to Essay Grading
Author :
Kumar, Narendra ; Dey, Lipika
Author_Institution :
TCS Innovation Lab., Tata Consultancy Services, New Delhi, India
Abstract :
In this paper, we focus on automatic quality assessment for intelligent essay grading. Our devised system grades essays without depending upon completely overlapping essays in training data. This increases the scope of devised system due to list dependency on highly topic focused labeled data for automatic essay grading. Instead of depending upon direct topic specific matching w.r.t., training data, the devised system judge the quality of essay by exploiting knowledgebase documents and SentiWordNet, etc. To achieve this goal, we concentrate on five different features: (1) relevance of information, (2) presence of sparsely connected words, (3) statistical and semantic role of words, (4) presence of talkative terms and (5) length of essay. We extract all these features by using word graph of text, populated with statistical, semantic and topical relation between words. Next, we use graph theoretical techniques, like: weighted all pair shortest paths, Ego-Networks, entropy based measures for effectiveness of nodes in weighted graph and statistical and probabilistic techniques like: total correlation score and Point wise Mutual Information (PMI) etc. Our experimental result on standard dataset shows that our devised system performs better than state-of-the-Art systems of this area.
Keywords :
computer aided instruction; document handling; entropy; network theory (graphs); statistical analysis; PMI; SentiWordNet; automatic quality assessment; document assessment; ego-networks; entropy based measures; essay length; features extraction; focused labeled data; graph theoretical techniques; information relevance; intelligent essay grading; knowledge-base documents; pointwise mutual information; probabilistic techniques; semantic relation; sparsely connected words; statistical relation; statistical techniques; talkative terms; topical relation; total correlation score; training data; weighted all pair shortest paths; weighted graph; word graph; words semantic role; words statistical role; Abstracts; Correlation; Entropy; Feature extraction; Semantics; Training; Training data; Automatic essay grading; Ego-Network; Entropy; Point Wise Mutual Information; Word graph of text;
Conference_Titel :
Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
Conference_Location :
Mexico City
Print_ISBN :
978-1-4799-2604-6
DOI :
10.1109/MICAI.2013.34