Title :
Children´s speech recognition with application to interactive books and tutors
Author :
Hagen, Andreas ; Pellom, B. ; Cole, Ronald
Author_Institution :
Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
fDate :
30 Nov.-3 Dec. 2003
Abstract :
We present initial work towards development of a children´s speech recognition system for use within an interactive reading and comprehension training system. We first describe the Colorado Literacy Tutor project and two corpora collected for children´s speech recognition research. Next, baseline speech recognition experiments are performed to illustrate the degree of acoustic mismatch for children in grades K through 5. It is shown that an 11.2% relative reduction in word error rate can be achieved through vocal tract normalization applied to children´s speech. Finally, we describe our baseline system for automatic recognition of spontaneously spoken story summaries. It is shown that a word error rate of 42.6% is achieved on the presented children´s story summarization task after using unsupervised MAPLR (maximum a posteriori linear regression) adaptation and VTLN (vocal tract length normalization) to compensate for inter-speaker acoustic variability. Based on this result, we point to promising directions for further study.
Keywords :
computer based training; error statistics; interactive systems; speech recognition; unsupervised learning; Colorado Literacy Tutor; VTLN; acoustic mismatch; child speech recognition; comprehension training; inter-speaker acoustic variability; interactive books; interactive reading training; interactive training; interactive tutors; maximum a posteriori linear regression; spontaneously spoken story summaries; vocal tract length normalization; vocal tract normalization; word error rate; Automatic speech recognition; Books; Error analysis; Face recognition; Frequency; Natural languages; Parameter estimation; Speech analysis; Speech recognition; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318426