• DocumentCode
    455184
  • Title

    Unsupervised Training on Large Amounts of Broadcast News Data

  • Author

    Ma, Jeff ; Matsoukas, Spyros ; Kimball, Owen ; Schwartz, Richard

  • Volume
    3
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper presents our recent effort that aims at improving our Arabic broadcast news (BN) recognition system by using thousands of hours of un-transcribed Arabic audio in the way of unsupervised training. Unsupervised training is first carried out on the 1,900-hour English topic detection and tracking (TDT) data and is compared with the lightly-supervised training method that we have used for the DARPA EARS evaluations. The comparison shows that unsupervised training produces a 21.7% relative reduction in word error rate (WER), which is comparable to the gain obtained with light supervision methods. The same unsupervised training strategy carried out on a similar amount of Arabic BN data produces an 11.6% relative gain. The gain, though considerable, is substantially smaller than what is observed on the English data. Our initial work towards understanding the reasons for this difference is also described
  • Keywords
    natural languages; speech recognition; unsupervised learning; Arabic broadcast news recognition system; English topic detection and tracking; broadcast news data; un-transcribed Arabic audio; unsupervised training; word error rate; Availability; Broadcast technology; Broadcasting; Ear; Error analysis; Maximum likelihood decoding; Natural languages; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660839
  • Filename
    1660839