DocumentCode :
2347294
Title :
N-gram language models for Polish language. Basic concepts and applications in automatic speech recognition systems
Author :
Rapp, Bartosz
Author_Institution :
Lab. of Language & Speech Technol., Poznan
fYear :
2008
fDate :
20-22 Oct. 2008
Firstpage :
321
Lastpage :
324
Abstract :
Usage of language models in automatic speech recognition systems usually give significant quality and certainty improvement of recognition outcomes. On the other hand, wrongly chosen or trained language models can result in serious degradation not only recognition quality but also overall performance of the system. Proper selection of language material, system parameters and representation of the model itself is important task during language models construction process. This paper describes basic aspects of building, evaluating and applying language models for Polish language in automatic speech recognition systems, which are intended to be used by lawyer´s chambers, judiciary and law enforcements. Language modeling is a part of project which is still early stage of development and work is ongoing so only some basic concepts and ideas are presented in this paper.
Keywords :
natural languages; speech recognition; N-gram language models; Polish language; automatic speech recognition systems; language material; language models construction process; system parameters; Application software; Automatic speech recognition; Building materials; Computer science; Degradation; Information technology; Laboratories; Natural languages; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Conference_Location :
Wisia
Print_ISBN :
978-83-60810-14-9
Type :
conf
DOI :
10.1109/IMCSIT.2008.4747259
Filename :
4747259
Link To Document :
بازگشت