Title :
A multilingual phoneme and model set: toward a universal base for automatic speech recognition
Author :
Gokcen, S. ; Gokcen, J.M.
Author_Institution :
Lucent Technol., Columbus, OH, USA
Abstract :
The amount of time, effort and expense that is required to incorporate a new language into an ASR system is extensive. It also is usually not possible to provide more than one language for speech recognition per system. The Core Technology Group of Conversant Voice Information Systems, Bell Laboratories has developed a multilingual phoneme and model set (MPMS) that is being used as a base for all telephone-based ASR continuous speech systems being developed in new languages. With a unified set such as the MPMS, it is possible that multiple languages could be available on one system. While the idea of a multilingual phoneme model set is not new, there has been no work that has used a large, telephone-based database consisting of continuous speech samples in more than two languages, obtains commercially acceptable word recognition rates, and that is ready to be marketed. Our system´s phoneme set represents six different languages; we have built models based on three languages and tested them using two other languages (for which there were no models); and we have achieved very acceptable word recognition rates of better than 92% (field accuracy). These languages can be incorporated into an existing speech recognition system, available for customers
Keywords :
natural language interfaces; performance evaluation; speech recognition; telephony; very large databases; Bell Laboratories; Conversant Voice Information Systems; MPMS; large telephone-based database; multilingual phoneme model set; multiple languages; speech recognition; telephone-based continuous speech systems; word recognition rate; Automatic speech recognition; Databases; Dictionaries; Engines; Hidden Markov models; Information systems; Natural languages; Speech recognition; Statistical analysis; System testing;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659141