• DocumentCode
    1466865
  • Title

    A Generative Student Model for Scoring Word Reading Skills

  • Author

    Tepperman, J. ; Sungbok Lee ; Narayanan, S. ; Alwan, A.

  • Author_Institution
    Rosetta Stone Labs., Boulder, CO, USA
  • Volume
    19
  • Issue
    2
  • fYear
    2011
  • Firstpage
    348
  • Lastpage
    360
  • Abstract
    This paper presents a novel student model intended to automate word-list-based reading assessments in a classroom setting, specifically for a student population that includes both native and nonnative speakers of English. As a Bayesian Network, the model is meant to conceive of student reading skills as a conscientious teacher would, incorporating cues based on expert knowledge of pronunciation variants and their cognitive or phonological sources, as well as prior knowledge of the student and the test itself. Alongside a hypothesized structure of conditional dependencies, we also propose an automatic method for refining the Bayes Net to eliminate unnecessary arcs. Reading assessment baselines that use strict pronunciation scoring alone (without other prior knowledge) achieve 0.7 correlation of their automatic scores with human assessments on the TBALL dataset. Our proposed structure significantly outperforms this baseline, and a simpler data-driven structure achieves 0.87 correlation through the use of novel features, surpassing the lower range of inter-annotator agreement. Scores estimated by this new model are also shown to exhibit the same biases along demographic lines as human listeners. Though used here for reading assessment, this model paradigm could be used in other pedagogical applications like foreign language instruction, or for inferring abstract cognitive states like categorical emotions.
  • Keywords
    belief networks; computer aided instruction; natural language processing; speech processing; Bayesian network; categorical emotions; cognitive sources; demographic lines; expert knowledge; foreign language instruction; generative student model; pedagogical applications; phonological sources; pronunciation; word reading skills scoring; Bayesian methods; Decoding; Demography; Education; Humans; Natural languages; Permission; Speech analysis; Testing; Viterbi algorithm; Bayesian networks; children´s speech; pronunciation evaluation; reading assessment; student modeling;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2047812
  • Filename
    5445019