DocumentCode :
417668
Title :
Speech transcription in multiple languages
Author :
Lamel, L. ; Gauvain, J.-L. ; Adda, G. ; Adda-Decker, M. ; Canseco, L. ; Chen, L. ; Galibert, O. ; Messaoudi, A. ; Schwenk, H.
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
Volume :
3
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper summarizes recent work underway at LIMSI on speech-to-text transcription in multiple languages. The research has been oriented towards the processing of broadcast audio and conversational speech for information access. Broadcast news transcription systems have been developed for seven languages, and it is planned to address several other languages in the near term. Research on conversational speech has mainly focused on the English language, with some initial work on French, Arabic and Spanish. Automatic processing must take into account the characteristics of the audio data, such as needing to deal with the continuous data stream, specificities of the language and the use of an imperfect word transcription for accessing the information content. Our experience thus far indicates that at today´s word error rates, the techniques used in one language can be successfully ported to other languages, and most of the language specificities concern lexical and pronunciation modeling.
Keywords :
natural languages; speech recognition; text analysis; broadcast audio; broadcast news transcription systems; continuous data stream; conversational speech; information access; lexical modeling; multiple languages; pronunciation modeling; speech recognition; speech transcription; speech-to-text transcription; word error rates; word transcription; Adaptation model; Broadcasting; Error analysis; Hidden Markov models; Loudspeakers; Multimedia communication; Natural languages; Speech processing; Speech recognition; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326655
Filename :
1326655
Link To Document :
بازگشت