DocumentCode
417667
Title
Speech recognition in multiple languages and domains: the 2003 BBN/LIMSI EARS system
Author
Schwartz, R. ; Colthurst, T. ; Duta, N. ; Gish, H. ; Iyer, R. ; Kao, C.-L. ; Liu, D. ; Kimball, O. ; Ma, J. ; Makhoul, J. ; Matsoukas, S. ; Nguyen, L. ; Noamany, M. ; Prasad, R. ; Xiang, B. ; Xu, D.-X. ; Gauvain, J.-L. ; Lamel, L. ; Schwenk, H. ; Adda, G.
Author_Institution
BBN Technol., Cambridge, MA, USA
Volume
3
fYear
2004
fDate
17-21 May 2004
Abstract
We report on the results of the first evaluations for the BBN/LIMSI system under the new DARPA EARS program. The evaluations were carried out for conversational telephone speech (CTS) and broadcast news (BN) for three languages: English, Mandarin, and Arabic. In addition to providing system descriptions and evaluation results, the paper highlights methods that worked well across the two domains and those few that worked well on one domain but not the other. For the BN evaluations, which had to be run under 10 times real-time, we demonstrated that a joint BBN/LIMSI system with a time constraint achieved better results than either system alone.
Keywords
hidden Markov models; natural languages; speech recognition; Arabic language; EARS system; English language; HMM; Mandarin language; broadcast news; conversational telephone speech; effective affordable reusable speech-to-text; multiple domain speech recognition; multiple language speech recognition; recognition word error rate reduction; Broadcasting; Collaborative work; Ear; Hidden Markov models; Natural languages; Real time systems; Speech recognition; Telephony; Testing; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326654
Filename
1326654
Link To Document