DocumentCode
3246417
Title
Approaches to gathering realistic training data for speech translation systems
Author
Bretan, Ivan ; Eklund, Robert ; MacDermid, Catriona
Author_Institution
Telia Res. AB, Haninge, Sweden
fYear
1996
fDate
30 Sep-1 Oct 1996
Firstpage
97
Lastpage
100
Abstract
The Spoken Language Translator (SLT) is a multi-lingual speech-to-speech translation prototype supporting English, Swedish and French within the air traffic information system (ATIS) domain. The design of SLT is characterized by a strongly corpus-driven approach, which accentuates the need for cost-efficient collection procedures to obtain training data. This paper discusses various approaches to the data collection issue pursued within a speech translation framework. Original American English speech and language data have been collected using traditional Wizard-of-Oz (WOZ) techniques, a relatively costly procedure yielding high-quality results. The resulting corpus has been translated textually into Swedish by a large number of native speakers (427) and used as prompts for training the target language speech model. This “budget” collection method is compared to the accepted method, i.e., gathering data by means of a full-blown WOZ simulation. The results indicate that although translation in this case proved economical and produced considerable data, the method is not sensitive to certain features typical of spoken language, for which WOZ is superior
Keywords
air traffic control; language translation; natural language interfaces; speech recognition; English; French; Spoken Language Translator; Swedish; Wizard-of-Oz techniques; air traffic information system; corpus-driven approach; cost-efficient collection procedures; data collection; multi-lingual speech-to-speech translation prototype; realistic training data gathering; speech translation systems; Costs; Frequency; Humans; Information systems; Large-scale systems; Natural languages; Prototypes; Speech recognition; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Interactive Voice Technology for Telecommunications Applications, 1996. Proceedings., Third IEEE Workshop on
Conference_Location
Basking Ridge, NJ
Print_ISBN
0-7803-3238-5
Type
conf
DOI
10.1109/IVTTA.1996.552770
Filename
552770
Link To Document