DocumentCode
1749650
Title
Automatic transcription of compressed broadcast audio
Author
Barras, Claude ; Lamel, Lori ; Gauvain, Jean-Luc
Author_Institution
Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France
Volume
1
fYear
2001
fDate
2001
Firstpage
265
Abstract
With increasing volumes of audio and video data broadcast over the Web, it is of interest to assess the performance of state-of-the-art automatic transcription systems on compressed audio data for media indexation applications. In this paper the performance of the LIMSI 10x French broadcast news transcription system is measured on a two-hour audio set for a range of MP3 and RealAudio codecs at various bit rates and the GSM codec used for European cellular phone communications. The word error rates are compared with those obtained on high quality PCM recordings prior to compression. For a 6.5 kbps. audio bit rate (the most commonly used on the Web), word error rates under 40% can be achieved, which makes automatic media monitoring systems over the Web a realistic task
Keywords
Internet; broadcasting; data compression; error statistics; indexing; speech coding; speech recognition; 6.5 kbit/s; GSM codec; LIMSI IN French broadcast news transcription system; MP3 codecs; RealAudio codecs; automatic media monitoring systems; automatic transcription; compressed broadcast audio; media indexation applications; word error rates; Bit rate; Broadcasting; Cellular phones; Codecs; Digital audio players; Error analysis; GSM; Multimedia communication; Phase change materials; Video compression;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940818
Filename
940818
Link To Document