Title :
Automatic processing of broadcast audio in multiple languages
Author :
Lori Lamel;Jean-Luc Gauvain
Author_Institution :
Spoken Language Processing Group, LIMSI-CNRS, B.P. 133, 91403 Orsay cedex, France
Abstract :
This paper addresses recent progress in LVCSR in multiple languages which has enabled the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for seven languages. Automatic processing to access the content must take into account the specificities of audio data, such as needing to deal with the continuous data stream and an imperfect word transcription, and specificities of the language. Some near-term applications are audio data mining, structurization of audiovisual archives, selective dissemination of information and media monitoring.
Keywords :
"Hidden Markov models","Adaptation models","Media","Acoustics","Data mining","Speech","Speech recognition"
Conference_Titel :
Signal Processing Conference, 2002 11th European