• DocumentCode
    294581
  • Title

    Improved topic spotting through statistical modelling of keyword dependencies

  • Author

    Wright, Jerry H. ; Carey, Michael J. ; Parris, Eluned S.

  • Author_Institution
    Ensigma Ltd., Chepstow, UK
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    313
  • Abstract
    Keywords are chosen on the basis of their usefulness for discriminating a topic from background speech. Good topic recognition can be achieved with a small set of well-chosen keywords, but particular combinations of keywords often achieve better discrimination than can be accounted for by regarding them as independent. This paper describes a higher-order statistical approach involving models of keyword-topic interdependence. A linear-logistic model brings some improvement in performance, but better results are obtained using log-linear contingency table models. Although the potential number of these is very large, good models tend to be simple and are suggested by heuristic measures inferred from the training data. The approach is tested using a broadcast radio database
  • Keywords
    higher order statistics; radio broadcasting; speech processing; speech recognition; background speech; broadcast radio database; discrimination; heuristic measures; higher-order statistical approach; keyword dependencies; keyword-topic interdependence; linear-logistic mode; log-linear contingency table models; performance; statistical modelling; topic recognition; topic spotting; training data; Character generation; Databases; Frequency measurement; Logistics; Radio broadcasting; Smoothing methods; Speech; Testing; Training data; Yield estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479536
  • Filename
    479536