DocumentCode
699420
Title
Language modeling structures in audio transcription for retrieval of historical speeches
Author
Kurimo, Mikko ; Bowen Zhou ; Rongqing Huang ; Hansen, John H. L.
Author_Institution
Neural Networks Res. Centre, Helsinki Univ. of Technol., Helsinki, Finland
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
557
Lastpage
560
Abstract
In this paper we apply speech recognition for automatic transcript generation for spoken document retrieval. The transcripts are used to compute an index for an archive of historical speeches and to provide the index, speech, and transcripts available for query based retrieval and browsing. In addition to acoustic variability, the task is challenging, because it covers a broad spectrum of different speaking styles and use of language. Language modeling is important for speech recognition to determine the prior probabilities of the compared word and sentence candidates in decoding. Various large text corpora are available in electronic format for language model training, but the open question is what and how should we include to improve the audio transcripts of this task. In this work we compare large overall language models to focused ones trained on selected subsets of the data, and to combinations between both. With respect to the potential index terms, improvements were obtained for transcripts that did not fit well to the scope of the large overall language model.
Keywords
information retrieval; probability; speech recognition; acoustic variability; audio transcription; automatic transcript generation; historical speeches; language modeling structures; large text corpora; query based retrieval; speech recognition; spoken document retrieval; Abstracts; Adaptation models; Indexes; Optical imaging; Scholarships; Speech; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079950
Link To Document