Title : 
Improving Speech Transcription for Mandarin-English Translation
         
        
            Author : 
Tomalin, M. ; Gales, Mark J.F. ; Liu, Xueqin Amy ; Sim, Khe Chai ; Sinha, Roopak ; Wang, Lingfeng ; Woodland, Philip C. ; Yu, Kaiyuan
         
        
            Author_Institution : 
Dept. of Eng., Cambridge Univ., UK
         
        
        
        
        
            Abstract : 
This paper describes the development of the CU-HTK Mandarin speech-to-text (STT) system and assesses its performance as part of a transcription-translation pipeline which converts broadcast Mandarin audio into English text. Recent improvements to the STT system are described and these give character error rate (CER) gains of 14.3% absolute for a broadcast conversation (BC) task and 5.1% absolute for a broadcast news (BN) task. The output of these STT systems is then post-processed, so that it consists of sentence-like segments, and translated into English text using a statistical machine translation (SMT) system. The performance of the transcription-translation pipeline is evaluated using the translation edit rate (TER) and BLEU metrics. It is shown that improving both the STT system and the post-STT segmentations can lower the TER scores by up to 5.3% absolute and increase the BLEU scores by up to 2.7% absolute.
         
        
            Keywords : 
language translation; natural language processing; speech processing; CU-HTK Mandarin speech-to-text; Mandarin-English translation; broadcast conversation; broadcast news; character error rate; speech transcription; statistical machine translation; transcription-translation pipeline; translation edit rate; Acoustic testing; Broadcasting; Computer numerical control; Lattices; Maximum likelihood linear regression; Natural languages; Pipelines; Speech; Surface-mount technology; Training data; Machine Translation; Sentence Boundary Detection; Speech Recognition;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
         
        
            Conference_Location : 
Honolulu, HI
         
        
        
            Print_ISBN : 
1-4244-0727-3
         
        
        
            DOI : 
10.1109/ICASSP.2007.367172