• DocumentCode
    2347294
  • Title

    N-gram language models for Polish language. Basic concepts and applications in automatic speech recognition systems

  • Author

    Rapp, Bartosz

  • Author_Institution
    Lab. of Language & Speech Technol., Poznan
  • fYear
    2008
  • fDate
    20-22 Oct. 2008
  • Firstpage
    321
  • Lastpage
    324
  • Abstract
    Usage of language models in automatic speech recognition systems usually give significant quality and certainty improvement of recognition outcomes. On the other hand, wrongly chosen or trained language models can result in serious degradation not only recognition quality but also overall performance of the system. Proper selection of language material, system parameters and representation of the model itself is important task during language models construction process. This paper describes basic aspects of building, evaluating and applying language models for Polish language in automatic speech recognition systems, which are intended to be used by lawyer´s chambers, judiciary and law enforcements. Language modeling is a part of project which is still early stage of development and work is ongoing so only some basic concepts and ideas are presented in this paper.
  • Keywords
    natural languages; speech recognition; N-gram language models; Polish language; automatic speech recognition systems; language material; language models construction process; system parameters; Application software; Automatic speech recognition; Building materials; Computer science; Degradation; Information technology; Laboratories; Natural languages; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
  • Conference_Location
    Wisia
  • Print_ISBN
    978-83-60810-14-9
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2008.4747259
  • Filename
    4747259