• DocumentCode
    323796
  • Title

    Serbo-Croatian LVCSR on the dictation and broadcast news domain

  • Author

    Scheytt, Peter ; Geutner, Petra ; Waibel, Alex

  • Author_Institution
    Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    897
  • Abstract
    This paper describes the development of a Serbo-Croatian dictation and broadcast news speech recognizer. The intention is to generate an automatic text transcription of a news show, which will be submitted to a multilingual informedia database. We outline the complete system development process using the JanusRTk, beginning with data collection, design and training of the parameters, tuning and evaluation. We report on general recognition techniques like segmentation, adaptation and language model interpolation, as well as language specific problems, e.g. high OOV rate due to inflected word forms. We show that even with a low amount of acoustic training data, combined with Web based interpolated language models, it is sufficient to build up a fairly reliable automatic news transcription system, which yields a performance of 36.0% word error (WE)
  • Keywords
    acoustic signal processing; broadcasting; dictation; interpolation; natural languages; speech recognition; speech synthesis; JanusRTk; Serbo-Croatian LVCSR; Web based interpolated language models; acoustic training data; adaptation; automatic news transcription system; automatic text transcription; broadcast news; data collection; dictation; high OOV rate; inflected word forms; language model interpolation; multilingual informedia database; performance; segmentation; speech recognizer; system design; system development; word error; Audio recording; Automatic speech recognition; Databases; Interpolation; Loudspeakers; Natural languages; Sampling methods; Satellite broadcasting; Speech recognition; TV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675410
  • Filename
    675410