مرکز منطقه ای اطلاع رساني علوم و فناوري - Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition

DocumentCode :

3686703

Title :

Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition

Author :

Sebastian Dziadzio;Aleksandra Nabożny;Aleksander Smywiński-Pohl;Bartosz Ziółko

Author_Institution :

AGH University of Science and Technology, Faculty of Computer Science, Electronics and Telecommunications, Krakow, Poland

fYear :

2015

Firstpage :

193

Lastpage :

197

Abstract :

We investigate whether language models used in automatic speech recognition (ASR) should be trained on speech transcripts rather than on written texts. By calculating log-likelihood statistic for part-of-speech (POS) n-grams, we show that there are significant differences between written texts and speech transcripts. We also test the performance of language models trained on speech transcripts and written texts in ASR and show that using the former results in greater word error reduction rates (WERR), even if the model is trained on much smaller corpora. For our experiments we used the manually labeled one million subcorpus of the National Corpus of Polish and an HTK acoustic model.

Keywords :

"Speech","Computational modeling","Automatic speech recognition","Computer science","Acoustics","Tagging"

Publisher :

ieee

Conference_Titel :

Computer Science and Information Systems (FedCSIS), 2015 Federated Conference on

Type :

conf

DOI :

10.15439/2015F386

Filename :

7321441

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3686703