DocumentCode :
323796
Title :
Serbo-Croatian LVCSR on the dictation and broadcast news domain
Author :
Scheytt, Peter ; Geutner, Petra ; Waibel, Alex
Author_Institution :
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
897
Abstract :
This paper describes the development of a Serbo-Croatian dictation and broadcast news speech recognizer. The intention is to generate an automatic text transcription of a news show, which will be submitted to a multilingual informedia database. We outline the complete system development process using the JanusRTk, beginning with data collection, design and training of the parameters, tuning and evaluation. We report on general recognition techniques like segmentation, adaptation and language model interpolation, as well as language specific problems, e.g. high OOV rate due to inflected word forms. We show that even with a low amount of acoustic training data, combined with Web based interpolated language models, it is sufficient to build up a fairly reliable automatic news transcription system, which yields a performance of 36.0% word error (WE)
Keywords :
acoustic signal processing; broadcasting; dictation; interpolation; natural languages; speech recognition; speech synthesis; JanusRTk; Serbo-Croatian LVCSR; Web based interpolated language models; acoustic training data; adaptation; automatic news transcription system; automatic text transcription; broadcast news; data collection; dictation; high OOV rate; inflected word forms; language model interpolation; multilingual informedia database; performance; segmentation; speech recognizer; system design; system development; word error; Audio recording; Automatic speech recognition; Databases; Interpolation; Loudspeakers; Natural languages; Sampling methods; Satellite broadcasting; Speech recognition; TV;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675410
Filename :
675410
Link To Document :
بازگشت