• DocumentCode
    118022
  • Title

    Automatic scoring method for open answer task in the SJ-CAT speaking test considering utterance difficulty level

  • Author

    Hao Lu ; Yamada, Takeshi ; Imai, Shingo ; Shinozaki, Takahiro ; Nisimura, Ryuichi ; Ishizuka, Kenkichi ; Makino, Shoji ; Kitawaki, Nobuhiko

  • Author_Institution
    Univ. of Tsukuba, Tsukuba, Japan
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper, we propose an automatic scoring method for the open answer task of the Japanese speaking test SJ-CAT. The proposed method first extracts a set of features from an input answer utterance and then estimates a vocabulary richness score by human raters, which ranges from 0 to 4, by employing SVR (support vector regression). We devised a novel set of features, namely text statistics weighted by word reliability, to assess the abundance of vocabulary and expression, and degree of word relevance based on the hierarchical distance in a thesaurus to evaluate the suitability of vocabulary. We confirmed experimentally that the proposed method provides good estimates of the human richness score, with a correlation coefficient of 0.92 and an RMSE (root mean square error) of 0.56. We also showed that the proposed method is relatively robust to differences among examinees and among questions used for training and testing.
  • Keywords
    computer based training; correlation methods; feature extraction; linguistics; mean square error methods; natural language processing; regression analysis; support vector machines; thesauri; RMSE; SJ-CAT speaking test; SVR; automatic scoring method; correlation coefficient; feature extraction; human richness score estimation; open answer task; root mean square error; speaking Japanese computerized adaptive test; support vector regression; text statistics; vocabulary abundance assess; vocabulary estimation; word relevance degree; word reliability; Correlation; Feature extraction; Reliability; Speech recognition; Thesauri; Training; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
  • Conference_Location
    Siem Reap
  • Type

    conf

  • DOI
    10.1109/APSIPA.2014.7041583
  • Filename
    7041583