DocumentCode :
1749650
Title :
Automatic transcription of compressed broadcast audio
Author :
Barras, Claude ; Lamel, Lori ; Gauvain, Jean-Luc
Author_Institution :
Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
265
Abstract :
With increasing volumes of audio and video data broadcast over the Web, it is of interest to assess the performance of state-of-the-art automatic transcription systems on compressed audio data for media indexation applications. In this paper the performance of the LIMSI 10x French broadcast news transcription system is measured on a two-hour audio set for a range of MP3 and RealAudio codecs at various bit rates and the GSM codec used for European cellular phone communications. The word error rates are compared with those obtained on high quality PCM recordings prior to compression. For a 6.5 kbps. audio bit rate (the most commonly used on the Web), word error rates under 40% can be achieved, which makes automatic media monitoring systems over the Web a realistic task
Keywords :
Internet; broadcasting; data compression; error statistics; indexing; speech coding; speech recognition; 6.5 kbit/s; GSM codec; LIMSI IN French broadcast news transcription system; MP3 codecs; RealAudio codecs; automatic media monitoring systems; automatic transcription; compressed broadcast audio; media indexation applications; word error rates; Bit rate; Broadcasting; Cellular phones; Codecs; Digital audio players; Error analysis; GSM; Multimedia communication; Phase change materials; Video compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940818
Filename :
940818
Link To Document :
بازگشت