DocumentCode
294581
Title
Improved topic spotting through statistical modelling of keyword dependencies
Author
Wright, Jerry H. ; Carey, Michael J. ; Parris, Eluned S.
Author_Institution
Ensigma Ltd., Chepstow, UK
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
313
Abstract
Keywords are chosen on the basis of their usefulness for discriminating a topic from background speech. Good topic recognition can be achieved with a small set of well-chosen keywords, but particular combinations of keywords often achieve better discrimination than can be accounted for by regarding them as independent. This paper describes a higher-order statistical approach involving models of keyword-topic interdependence. A linear-logistic model brings some improvement in performance, but better results are obtained using log-linear contingency table models. Although the potential number of these is very large, good models tend to be simple and are suggested by heuristic measures inferred from the training data. The approach is tested using a broadcast radio database
Keywords
higher order statistics; radio broadcasting; speech processing; speech recognition; background speech; broadcast radio database; discrimination; heuristic measures; higher-order statistical approach; keyword dependencies; keyword-topic interdependence; linear-logistic mode; log-linear contingency table models; performance; statistical modelling; topic recognition; topic spotting; training data; Character generation; Databases; Frequency measurement; Logistics; Radio broadcasting; Smoothing methods; Speech; Testing; Training data; Yield estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479536
Filename
479536
Link To Document