Title :
Japanese speech databases for robust speech recognition
Author :
Nakamura, Atsushi ; Matsunaga, Shoichi ; Shimizu, Tohru ; Tonomura, Masahiro ; Sagisaka, Yoshinori
Author_Institution :
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Abstract :
At ATR, a next-generation speech translation system is under development rewards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved nor only for speech translation but also for the general use of speech recognition in real environments. Three large speech databases are designed to cope with these problems in speech recognition and the current status of data collection is reported
Keywords :
data acquisition; language translation; speech recognition; ATR; Japanese speech databases; data collection; fast spontaneous speech; large vocabulary robustness; natural trans-language communication; next-generation speech translation system; robust speech recognition; speaker variances; speaking variations; speech recognition; Buildings; Humans; Natural languages; Robustness; Spatial databases; Speech recognition; Standards development; Target recognition; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607241