• DocumentCode
    555949
  • Title

    Automatic speech recognition for polish in a computer game interface

  • Author

    Janicki, Artur ; Wawer, Dariusz

  • Author_Institution
    Inst. of Telecommun., Warsaw Univ. of Technol., Warsaw, Poland
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    711
  • Lastpage
    716
  • Abstract
    The paper describes the process of designing a task-oriented continuous speech recognition system for Polish, based on CMU Sphinx 4, to be used in the voice interface of a computer game called Rally Navigator. The concept of the game is presented, the stages of creating the acoustic model and the language model are described in details, taking into account the specificity of the Polish language. Results of initial experiments show that as little as 15 minutes of audio material is enough to produce a highly effective single-speaker command-and-control ASR system for the computer game, providing the sentence recognition accuracy of 97.6%. Results of the system adaptation for a new speaker are presented. It is also showed that the statistic trigram-based language model with negative trigrams yields the best recognition results.
  • Keywords
    computer games; natural language processing; speech recognition; speech-based user interfaces; statistical analysis; CMU Sphinx 4; Polish language; Rally Navigator; acoustic model; automatic speech recognition; computer game interface; single-speaker command-and-control ASR system; statistic trigram-based language model; task-oriented continuous speech recognition; voice interface; Accuracy; Acoustics; Computers; Games; Hidden Markov models; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on
  • Conference_Location
    Szczecin
  • Print_ISBN
    978-1-4577-0041-5
  • Electronic_ISBN
    978-83-60810-35-4
  • Type

    conf

  • Filename
    6078265