Title :
Automatic scoring method for open answer task in the SJ-CAT speaking test considering utterance difficulty level
Author :
Hao Lu ; Yamada, Takeshi ; Imai, Shingo ; Shinozaki, Takahiro ; Nisimura, Ryuichi ; Ishizuka, Kenkichi ; Makino, Shoji ; Kitawaki, Nobuhiko
Author_Institution :
Univ. of Tsukuba, Tsukuba, Japan
Abstract :
In this paper, we propose an automatic scoring method for the open answer task of the Japanese speaking test SJ-CAT. The proposed method first extracts a set of features from an input answer utterance and then estimates a vocabulary richness score by human raters, which ranges from 0 to 4, by employing SVR (support vector regression). We devised a novel set of features, namely text statistics weighted by word reliability, to assess the abundance of vocabulary and expression, and degree of word relevance based on the hierarchical distance in a thesaurus to evaluate the suitability of vocabulary. We confirmed experimentally that the proposed method provides good estimates of the human richness score, with a correlation coefficient of 0.92 and an RMSE (root mean square error) of 0.56. We also showed that the proposed method is relatively robust to differences among examinees and among questions used for training and testing.
Keywords :
computer based training; correlation methods; feature extraction; linguistics; mean square error methods; natural language processing; regression analysis; support vector machines; thesauri; RMSE; SJ-CAT speaking test; SVR; automatic scoring method; correlation coefficient; feature extraction; human richness score estimation; open answer task; root mean square error; speaking Japanese computerized adaptive test; support vector regression; text statistics; vocabulary abundance assess; vocabulary estimation; word relevance degree; word reliability; Correlation; Feature extraction; Reliability; Speech recognition; Thesauri; Training; Vocabulary;
Conference_Titel :
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location :
Siem Reap
DOI :
10.1109/APSIPA.2014.7041583