Title :
Topic-Independent Speaking-Style Transformation of Language Model for Spontaneous Speech Recognition
Author :
Akita, Yuya ; Kawahara, Toshio
Author_Institution :
Acad. Center for Comput. & Media Studies, Kyoto Univ., Japan
Abstract :
For language modeling of spontaneous speech, we propose a novel approach, based on the statistical machine translation framework, which transforms a document-style model to the spoken style. For better coverage and more reliable estimation, incorporation of POS (part-of-speech) information is explored in addition to lexical information. In this paper, we investigate several methods that combine POS-based model or integrate POS information in the ME (maximum entropy) scheme. They achieve significant reduction in perplexity and WER in a meeting transcription task. Moreover, the model is applied to different domains or committee meetings of different topics. As a result, even larger perplexity reduction is achieved compared with the case tested in the same domain. The result demonstrates the generality and portability of the model.
Keywords :
language translation; maximum entropy methods; speech recognition; statistical analysis; document-style model; language model; lexical information; maximum entropy scheme; part-of-speech information; spontaneous speech recognition; statistical machine translation framework; topic-independent speaking-style transformation; Automatic speech recognition; Costs; Databases; Differential equations; Entropy; Natural languages; Probability; Speech recognition; Speech synthesis; Testing; automatic speech recognition; language model; spontaneous speech; statistical transformation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367156