DocumentCode
555949
Title
Automatic speech recognition for polish in a computer game interface
Author
Janicki, Artur ; Wawer, Dariusz
Author_Institution
Inst. of Telecommun., Warsaw Univ. of Technol., Warsaw, Poland
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
711
Lastpage
716
Abstract
The paper describes the process of designing a task-oriented continuous speech recognition system for Polish, based on CMU Sphinx 4, to be used in the voice interface of a computer game called Rally Navigator. The concept of the game is presented, the stages of creating the acoustic model and the language model are described in details, taking into account the specificity of the Polish language. Results of initial experiments show that as little as 15 minutes of audio material is enough to produce a highly effective single-speaker command-and-control ASR system for the computer game, providing the sentence recognition accuracy of 97.6%. Results of the system adaptation for a new speaker are presented. It is also showed that the statistic trigram-based language model with negative trigrams yields the best recognition results.
Keywords
computer games; natural language processing; speech recognition; speech-based user interfaces; statistical analysis; CMU Sphinx 4; Polish language; Rally Navigator; acoustic model; automatic speech recognition; computer game interface; single-speaker command-and-control ASR system; statistic trigram-based language model; task-oriented continuous speech recognition; voice interface; Accuracy; Acoustics; Computers; Games; Hidden Markov models; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on
Conference_Location
Szczecin
Print_ISBN
978-1-4577-0041-5
Electronic_ISBN
978-83-60810-35-4
Type
conf
Filename
6078265
Link To Document