DocumentCode :
455184
Title :
Unsupervised Training on Large Amounts of Broadcast News Data
Author :
Ma, Jeff ; Matsoukas, Spyros ; Kimball, Owen ; Schwartz, Richard
Volume :
3
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This paper presents our recent effort that aims at improving our Arabic broadcast news (BN) recognition system by using thousands of hours of un-transcribed Arabic audio in the way of unsupervised training. Unsupervised training is first carried out on the 1,900-hour English topic detection and tracking (TDT) data and is compared with the lightly-supervised training method that we have used for the DARPA EARS evaluations. The comparison shows that unsupervised training produces a 21.7% relative reduction in word error rate (WER), which is comparable to the gain obtained with light supervision methods. The same unsupervised training strategy carried out on a similar amount of Arabic BN data produces an 11.6% relative gain. The gain, though considerable, is substantially smaller than what is observed on the English data. Our initial work towards understanding the reasons for this difference is also described
Keywords :
natural languages; speech recognition; unsupervised learning; Arabic broadcast news recognition system; English topic detection and tracking; broadcast news data; un-transcribed Arabic audio; unsupervised training; word error rate; Availability; Broadcast technology; Broadcasting; Ear; Error analysis; Maximum likelihood decoding; Natural languages; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660839
Filename :
1660839
Link To Document :
بازگشت