DocumentCode
455184
Title
Unsupervised Training on Large Amounts of Broadcast News Data
Author
Ma, Jeff ; Matsoukas, Spyros ; Kimball, Owen ; Schwartz, Richard
Volume
3
fYear
2006
fDate
14-19 May 2006
Abstract
This paper presents our recent effort that aims at improving our Arabic broadcast news (BN) recognition system by using thousands of hours of un-transcribed Arabic audio in the way of unsupervised training. Unsupervised training is first carried out on the 1,900-hour English topic detection and tracking (TDT) data and is compared with the lightly-supervised training method that we have used for the DARPA EARS evaluations. The comparison shows that unsupervised training produces a 21.7% relative reduction in word error rate (WER), which is comparable to the gain obtained with light supervision methods. The same unsupervised training strategy carried out on a similar amount of Arabic BN data produces an 11.6% relative gain. The gain, though considerable, is substantially smaller than what is observed on the English data. Our initial work towards understanding the reasons for this difference is also described
Keywords
natural languages; speech recognition; unsupervised learning; Arabic broadcast news recognition system; English topic detection and tracking; broadcast news data; un-transcribed Arabic audio; unsupervised training; word error rate; Availability; Broadcast technology; Broadcasting; Ear; Error analysis; Maximum likelihood decoding; Natural languages; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660839
Filename
1660839
Link To Document