Automatic processing of broadcast audio in multiple languages

Author

Lori Lamel;Jean-Luc Gauvain

Author_Institution

Spoken Language Processing Group, LIMSI-CNRS, B.P. 133, 91403 Orsay cedex, France

fYear

2002

Firstpage

Lastpage

Abstract

This paper addresses recent progress in LVCSR in multiple languages which has enabled the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for seven languages. Automatic processing to access the content must take into account the specificities of audio data, such as needing to deal with the continuous data stream and an imperfect word transcription, and specificities of the language. Some near-term applications are audio data mining, structurization of audiovisual archives, selective dissemination of information and media monitoring.

Keywords

"Hidden Markov models","Adaptation models","Media","Acoustics","Data mining","Speech","Speech recognition"

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2002 11th European

ISSN

2219-5491

Type

conf

Filename

7072229

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3653635