Title :
Improved statistical models for SMT-based speaking style transformation
Author :
Neubig, Graham ; Akita, Yuya ; Mori, Shinsuke ; Kawahara, Tatsuya
Author_Institution :
Sch. of Inf., Kyoto Univ., Kyoto, Japan
Abstract :
Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to “translate” from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of conditional and joint probabilities, which allows for frequently observed translation patterns to be given higher priority. The system is implemented using weighted finite state transducers (WFST). On an evaluation using ASR results and manual transcripts of meetings of the Japanese Diet (national congress), the proposed methods showed a significant increase in accuracy over traditional modeling techniques.
Keywords :
finite state machines; language translation; speech recognition; SMT based speaking style transformation; automatic speech recognition; context dependent translation model; log linear interpolation; statistical machine translation; weighted finite state transducer; Automatic speech recognition; Context modeling; Error correction; Informatics; Interpolation; Manuals; Parameter estimation; Probability; Surface-mount technology; Transducers; log-linear models; speaking style transformation; weighted finite state transducers;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494997