DocumentCode :
3163748
Title :
Improving arabic broadcast transcription using automatic topic clustering
Author :
Chu, Stephen M. ; Mangu, Lidia
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4449
Lastpage :
4452
Abstract :
Latent Dirichlet Allocation (LDA) has been shown to be an effective model to augment n-gram language models in speech recognition applications. In this work, we aim to take advantage of the superior unsupervised learning ability of the framework, and use it to uncover topic structure embedded in the corpora in an entirely data-driven fashion. In addition, we describe a bi-level inference and classification method that allows topic clustering at the utterance level while preserving the document-level topic structures. We demonstrate the effectiveness of the proposed topic clustering pipeline in a state-of-the-art Arabic broadcast transcription system. Experiments show that optimizing LM in the LDA topic space leads to 5% reduction in language model perplexity. It is further shown that topic clustering and adaptation is able to attain 0.4% absolute word error rate reduction on the GALE Arabic task.
Keywords :
broadcasting; document handling; inference mechanisms; natural language processing; pattern classification; pattern clustering; speech recognition; unsupervised learning; GALE Arabic task; LDA topic space; augment n-gram language models; automatic topic clustering; bi-level inference; classification method; corpora; data-driven fashion; document-level topic structures; language model perplexity; latent Dirichlet allocation; speech recognition applications; state-of-the-art Arabic broadcast transcription system; superior unsupervised learning ability; topic clustering pipeline; utterance level; word error rate reduction; Adaptation models; Hidden Markov models; Interpolation; Optimization; Semantics; Speech; Training; Arabic ASR; language modeling; topic clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288907
Filename :
6288907
Link To Document :
بازگشت