DocumentCode :
1874242
Title :
Non-native English speech recognition using bilingual English lexicon and acoustic models
Author :
Matsunaga, S. ; Ogawa, A. ; Yamaguchi, Y. ; Imamura, A.
Author_Institution :
NTT Cyber Space Labs., Kanagawa, Japan
Volume :
3
fYear :
2003
fDate :
6-9 July 2003
Abstract :
This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speaker´s pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native English- and Japanese-pronounced words is the recognition results. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance levels as that of a purely native English recognition system.
Keywords :
audio databases; natural languages; speech recognition; English pronounced word; English transcription; Japanese phoneme transcriptions; Japanese pronounced word; acoustic models; bilingual English lexicon; bilingual pronunciation lexicon; highest-likelihood word sequence; nonnative English speech recognition; Acoustic signal detection; Adaptation model; Databases; Information retrieval; Loudspeakers; Man machine systems; Natural languages; Portals; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1221389
Filename :
1221389
Link To Document :
بازگشت