DocumentCode :
2768991
Title :
Robust topic inference for latent semantic language model adaptation
Author :
Heidel, Aaron ; Lee, Lin-shan
Author_Institution :
Nat. Taiwan Univ. Taipei, Taipei
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
177
Lastpage :
182
Abstract :
We perform topic-based, unsupervised language model adaptation under an N-best rescoring framework by using previous-pass system hypotheses to infer a topic mixture which is used to select topic-dependent LMs for interpolation with a topic-independent LM. Our primary focus is on techniques for improving the robustness of topic inference for a given utterance with respect to recognition errors, including the use of ASR confidence and contextual information from surrounding utterances. We describe a novel application of metadata-based pseudo-story segmentation to language model adaptation, and present good improvements to character error rate on multi-genre GALE Project data in Mandarin Chinese.
Keywords :
computational linguistics; indexing; inference mechanisms; interpolation; linguistics; unsupervised learning; N-best rescoring framework; interpolation; latent semantic language model adaptation; metadata-based pseudostory segmentation; previous-pass system hypotheses; topic inference; topic-based unsupervised language model adaptation; topic-independent LM; Adaptation model; Automatic speech recognition; Broadcasting; Computer science; Context modeling; Error analysis; Interpolation; Linear discriminant analysis; Natural languages; Robustness; language model adaptation; speech recognition; story segmentation; topic modeling; unsupervised adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430105
Filename :
4430105
Link To Document :
بازگشت