DocumentCode :
2307125
Title :
Area code, country code, and time difference information system and its field trial
Author :
Kato, Tsuneo ; Kuroiwa, Shingo ; Higuchi, Norio
Author_Institution :
Speech Technol. & Appl. Lab., KDD R&D Labs. Inc., Saitama, Japan
fYear :
1998
fDate :
29-30 Sep 1998
Firstpage :
5
Lastpage :
10
Abstract :
This paper describes an ASR system which responds to customer inquiries over a telephone network. Customer inquiries on area codes, country codes and time differences are one of the most popular items in information service of international telecommunication. This system called ACTIS (Area code, Country code and Time difference Information System) responds to these inquiries. ACTIS recognizes Japanese continuous speech and vocabulary including the names of 299 countries and 721 major cities throughout the world. We report on several technical features of the system including (1) new acoustic models using additional acoustic parameters, that is, acceleration of MFCC parameters and a log energy for increasing the recognition rate, (2) CMS (cepstrum mean subtraction) with compensation by recognition results for normalizing various channel characteristics and real-time operation, and (3) robust speech detection and out-of-vocabulary word detection for improving robustness to ambient noise, irrelevant sound and out-of-vocabulary words, along with their effects on computer simulation. We also report on results of field trial at KDD and a subjective assessment by users
Keywords :
automatic telephone systems; speech recognition; telephone networks; ACTIS; ASR system; Japanese; KDD; MFCC parameters; acoustic models; acoustic parameters; ambient noise robustness; area code; cepstrum mean subtraction; channel characteristics normalisation; compensation; computer simulation; continuous speech recognition; countries; country code; customer inquiries; field trial; international telecommunication; irrelevant sound; log energy; major cities; out-of-vocabulary word detection; out-of-vocabulary words; real-time operation; recognition rate; robust speech detection; subjective assessment; telephone network; time difference information system; vocabulary; Acceleration; Acoustic signal detection; Automatic speech recognition; Character recognition; Cities and towns; Information systems; Noise robustness; Speech recognition; Telephony; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Interactive Voice Technology for Telecommunications Applications, 1998. IVTTA '98. Proceedings. 1998 IEEE 4th Workshop
Conference_Location :
Torino
Print_ISBN :
0-7803-5028-6
Type :
conf
DOI :
10.1109/IVTTA.1998.727684
Filename :
727684
Link To Document :
بازگشت