Title :
Improving arabic broadcast transcription using automatic topic clustering
Author :
Chu, Stephen M. ; Mangu, Lidia
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Latent Dirichlet Allocation (LDA) has been shown to be an effective model to augment n-gram language models in speech recognition applications. In this work, we aim to take advantage of the superior unsupervised learning ability of the framework, and use it to uncover topic structure embedded in the corpora in an entirely data-driven fashion. In addition, we describe a bi-level inference and classification method that allows topic clustering at the utterance level while preserving the document-level topic structures. We demonstrate the effectiveness of the proposed topic clustering pipeline in a state-of-the-art Arabic broadcast transcription system. Experiments show that optimizing LM in the LDA topic space leads to 5% reduction in language model perplexity. It is further shown that topic clustering and adaptation is able to attain 0.4% absolute word error rate reduction on the GALE Arabic task.
Keywords :
broadcasting; document handling; inference mechanisms; natural language processing; pattern classification; pattern clustering; speech recognition; unsupervised learning; GALE Arabic task; LDA topic space; augment n-gram language models; automatic topic clustering; bi-level inference; classification method; corpora; data-driven fashion; document-level topic structures; language model perplexity; latent Dirichlet allocation; speech recognition applications; state-of-the-art Arabic broadcast transcription system; superior unsupervised learning ability; topic clustering pipeline; utterance level; word error rate reduction; Adaptation models; Hidden Markov models; Interpolation; Optimization; Semantics; Speech; Training; Arabic ASR; language modeling; topic clustering;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288907