DocumentCode :
3278904
Title :
Multilingual connected digits and natural numbers recognition in the telephone speech dialog systems
Author :
Imperl, Bojan
Author_Institution :
Fac. of Electr. Eng. & Comput. Sci., Maribor Univ., Slovenia
Volume :
1
fYear :
1999
fDate :
1999
Firstpage :
188
Abstract :
This paper presents the system for multilingual connected digits recognition and natural numbers recognition over the telephone. System bases on language dependent HMM recognisers operating in parallel and is currently implemented for the Slovene and German language. It has been developed with the HTK V2.1.1 with improved frontend processing module. The language dependent recognisers were trained using the Slovene and German SpeechDat(II) databases. The system was tested in both a monolingual and in a multilingual environment in the off-line and on-line mode of operation. For off-line testing the speech files from the Slovene and German SpeechDat(II) databases were used, while for on-line testing the system was integrated into a Reverse Directory System O-TEL (a real world environment). During the testing of the system, various HMM architectures were tested (whole-word models, phone models, triphone models) in order to find the recogniser configuration that is the most efficient in a multilingual environment. Experiments have shown the high recognition accuracy in a single-language environment as well as in a multilingual environment. The multilingual tests have also pointed out the high language identification rate of the multilingual system. A number of advantages of multilingual speech recognition in speech dialog systems were found while experimenting with the O-TEL system
Keywords :
hidden Markov models; natural languages; speech recognition; telephony; voice communication; German; HMM architectures; HTK V2.1.1; Reverse Directory System O-TEL; Slovene; SpeechDat(II) database; frontend processing module; high recognition accuracy; language dependent HMM recognisers; language dependent recognisers; language identification rate; monolingual environment; multilingual connected digits recognition; multilingual environment; multilingual natural numbers recognition; multilingual speech recognition; multilingual tests; off-line operation; on-line operation; phone models; real world environment; recogniser configuration; speech dialog systems; speech files testing; telephone speech dialog systems; triphone models; whole-word models; Computational complexity; Computer science; Databases; Hidden Markov models; Natural languages; Rail transportation; Speech recognition; System testing; Telephony; Voice mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Electronics, 1999. ISIE '99. Proceedings of the IEEE International Symposium on
Conference_Location :
Bled
Print_ISBN :
0-7803-5662-4
Type :
conf
DOI :
10.1109/ISIE.1999.801782
Filename :
801782
Link To Document :
بازگشت