• DocumentCode
    417668
  • Title

    Speech transcription in multiple languages

  • Author

    Lamel, L. ; Gauvain, J.-L. ; Adda, G. ; Adda-Decker, M. ; Canseco, L. ; Chen, L. ; Galibert, O. ; Messaoudi, A. ; Schwenk, H.

  • Author_Institution
    Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
  • Volume
    3
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper summarizes recent work underway at LIMSI on speech-to-text transcription in multiple languages. The research has been oriented towards the processing of broadcast audio and conversational speech for information access. Broadcast news transcription systems have been developed for seven languages, and it is planned to address several other languages in the near term. Research on conversational speech has mainly focused on the English language, with some initial work on French, Arabic and Spanish. Automatic processing must take into account the characteristics of the audio data, such as needing to deal with the continuous data stream, specificities of the language and the use of an imperfect word transcription for accessing the information content. Our experience thus far indicates that at today´s word error rates, the techniques used in one language can be successfully ported to other languages, and most of the language specificities concern lexical and pronunciation modeling.
  • Keywords
    natural languages; speech recognition; text analysis; broadcast audio; broadcast news transcription systems; continuous data stream; conversational speech; information access; lexical modeling; multiple languages; pronunciation modeling; speech recognition; speech transcription; speech-to-text transcription; word error rates; word transcription; Adaptation model; Broadcasting; Error analysis; Hidden Markov models; Loudspeakers; Multimedia communication; Natural languages; Speech processing; Speech recognition; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326655
  • Filename
    1326655