Title :
Non-native English speech recognition using bilingual English lexicon and acoustic models
Author :
Matsunaga, S. ; Ogawa, A. ; Yamaguchi, Y. ; Imamura, A.
Author_Institution :
NTT Cyber Space Labs., Kanagawa, Japan
Abstract :
This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speaker´s pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native English- and Japanese-pronounced words is the recognition results. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance levels as that of a purely native English recognition system.
Keywords :
audio databases; natural languages; speech recognition; English pronounced word; English transcription; Japanese phoneme transcriptions; Japanese pronounced word; acoustic models; bilingual English lexicon; bilingual pronunciation lexicon; highest-likelihood word sequence; nonnative English speech recognition; Acoustic signal detection; Adaptation model; Databases; Information retrieval; Loudspeakers; Man machine systems; Natural languages; Portals; Speech analysis; Speech recognition;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221389