مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised vocabulary selection for real-time speech recognition of lectures

DocumentCode :

3163593

Title :

Unsupervised vocabulary selection for real-time speech recognition of lectures

Author :

Maergner, Paul ; Waibel, Alex ; Lane, Ian

Author_Institution :

Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4417

Lastpage :

4420

Abstract :

In this work, we propose a novel method for vocabulary selection to automatically adapt automatic speech recognition systems to the diverse topics that occur in educational and scientific lectures. Utilizing materials that are available before the lecture begins, such as lecture slides, our proposed framework iteratively searches for related documents on the web and generates a lecture-specific vocabulary based on the resulting documents. In this paper, we propose a novel method for vocabulary selection where we first collect documents similar to an initial seed document and then rank the resulting vocabulary based on a score which is calculated using a combination of word features. This is a critical component for adaptation that has typically been overlooked in prior works. On the inter ACT German-English simultaneous lecture translation system our proposed approach significantly improved vocabulary coverage, reducing the out-of-vocabulary rate, on average by 57.0% and up to 84.9%, compared to a lecture-independent baseline. Furthermore, our approach reduced the word error rate, by 12.5% on average and up to 25.3%, compared to a lecture-independent baseline.

Keywords :

educational administrative data processing; natural languages; speech recognition; vocabulary; ACT German-English simultaneous lecture translation system; automatic speech recognition systems; educational lectures; lecture-independent baseline; lecture-specific vocabulary; real-time speech recognition; scientific lectures; unsupervised vocabulary selection; Accuracy; Adaptation models; Automatic speech recognition; Real time systems; Speech; Vocabulary; Vocabulary selection; automatic speech recognition; language model adaptation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288899

Filename :

6288899

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3163593