• DocumentCode
    675006
  • Title

    Automatic Quality Assessment of Documents with Application to Essay Grading

  • Author

    Kumar, Narendra ; Dey, Lipika

  • Author_Institution
    TCS Innovation Lab., Tata Consultancy Services, New Delhi, India
  • fYear
    2013
  • fDate
    24-30 Nov. 2013
  • Firstpage
    216
  • Lastpage
    222
  • Abstract
    In this paper, we focus on automatic quality assessment for intelligent essay grading. Our devised system grades essays without depending upon completely overlapping essays in training data. This increases the scope of devised system due to list dependency on highly topic focused labeled data for automatic essay grading. Instead of depending upon direct topic specific matching w.r.t., training data, the devised system judge the quality of essay by exploiting knowledgebase documents and SentiWordNet, etc. To achieve this goal, we concentrate on five different features: (1) relevance of information, (2) presence of sparsely connected words, (3) statistical and semantic role of words, (4) presence of talkative terms and (5) length of essay. We extract all these features by using word graph of text, populated with statistical, semantic and topical relation between words. Next, we use graph theoretical techniques, like: weighted all pair shortest paths, Ego-Networks, entropy based measures for effectiveness of nodes in weighted graph and statistical and probabilistic techniques like: total correlation score and Point wise Mutual Information (PMI) etc. Our experimental result on standard dataset shows that our devised system performs better than state-of-the-Art systems of this area.
  • Keywords
    computer aided instruction; document handling; entropy; network theory (graphs); statistical analysis; PMI; SentiWordNet; automatic quality assessment; document assessment; ego-networks; entropy based measures; essay length; features extraction; focused labeled data; graph theoretical techniques; information relevance; intelligent essay grading; knowledge-base documents; pointwise mutual information; probabilistic techniques; semantic relation; sparsely connected words; statistical relation; statistical techniques; talkative terms; topical relation; total correlation score; training data; weighted all pair shortest paths; weighted graph; word graph; words semantic role; words statistical role; Abstracts; Correlation; Entropy; Feature extraction; Semantics; Training; Training data; Automatic essay grading; Ego-Network; Entropy; Point Wise Mutual Information; Word graph of text;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence (MICAI), 2013 12th Mexican International Conference on
  • Conference_Location
    Mexico City
  • Print_ISBN
    978-1-4799-2604-6
  • Type

    conf

  • DOI
    10.1109/MICAI.2013.34
  • Filename
    6714671