Title :
Phone-to-word decoding through statistical machine translation and complementary system combination
Author :
Falavigna, D. ; Gerosa, M. ; Gretter, R. ; Giuliani, D.
Author_Institution :
Human Language Technol. Res. Unit, FBK - Fondazione Bruno Kessler, Povo, Italy
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
In this paper, phone-to-word transduction is first investigated by coupling a speech recognizer, generating for each speech segment a phone sequence or a phone confusion network, with the efficient decoder of confusion networks adopted by MOSES, a popular statistical machine translation toolkit. Then, system combination is investigated by combining the outputs of several conventional ASR systems with the output of a system embedding phone-to-word decoding through statistical machine translation. Experiments are carried out in the context of a large vocabulary speech recognition task consisting of transcription of speeches delivered in English during the European Parliament Plenary Sessions (EPPS). While only a marginal performance improvements is achieved in system combination experiments when the output of the phone-to-word transducer is included in the combination, partial results show a great potential for improvements.
Keywords :
language translation; speech coding; speech recognition equipment; ASR systems; European parliament plenary sessions; MOSES; complementary system combination; confusion networks decoder; large vocabulary speech recognition task; phone confusion network; phone-to-word decoding; speech recognizer; statistical machine translation; Adaptation model; Audio recording; Automatic speech recognition; Decision trees; Decoding; Humans; Mel frequency cepstral coefficient; Speech recognition; Transducers; Vocabulary; ASR system combination; phone-to-word transducer; word graph rescoring;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373281