DocumentCode
675006
Title
Automatic Quality Assessment of Documents with Application to Essay Grading
Author
Kumar, Narendra ; Dey, Lipika
Author_Institution
TCS Innovation Lab., Tata Consultancy Services, New Delhi, India
fYear
2013
fDate
24-30 Nov. 2013
Firstpage
216
Lastpage
222
Abstract
In this paper, we focus on automatic quality assessment for intelligent essay grading. Our devised system grades essays without depending upon completely overlapping essays in training data. This increases the scope of devised system due to list dependency on highly topic focused labeled data for automatic essay grading. Instead of depending upon direct topic specific matching w.r.t., training data, the devised system judge the quality of essay by exploiting knowledgebase documents and SentiWordNet, etc. To achieve this goal, we concentrate on five different features: (1) relevance of information, (2) presence of sparsely connected words, (3) statistical and semantic role of words, (4) presence of talkative terms and (5) length of essay. We extract all these features by using word graph of text, populated with statistical, semantic and topical relation between words. Next, we use graph theoretical techniques, like: weighted all pair shortest paths, Ego-Networks, entropy based measures for effectiveness of nodes in weighted graph and statistical and probabilistic techniques like: total correlation score and Point wise Mutual Information (PMI) etc. Our experimental result on standard dataset shows that our devised system performs better than state-of-the-Art systems of this area.
Keywords
computer aided instruction; document handling; entropy; network theory (graphs); statistical analysis; PMI; SentiWordNet; automatic quality assessment; document assessment; ego-networks; entropy based measures; essay length; features extraction; focused labeled data; graph theoretical techniques; information relevance; intelligent essay grading; knowledge-base documents; pointwise mutual information; probabilistic techniques; semantic relation; sparsely connected words; statistical relation; statistical techniques; talkative terms; topical relation; total correlation score; training data; weighted all pair shortest paths; weighted graph; word graph; words semantic role; words statistical role; Abstracts; Correlation; Entropy; Feature extraction; Semantics; Training; Training data; Automatic essay grading; Ego-Network; Entropy; Point Wise Mutual Information; Word graph of text;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
Conference_Location
Mexico City
Print_ISBN
978-1-4799-2604-6
Type
conf
DOI
10.1109/MICAI.2013.34
Filename
6714671
Link To Document