Title :
A system for unrestricted topic retrieval from radio news broadcasts
Author_Institution :
UBILAB, Union Bank of Switzerland, Zurich, Switzerland
Abstract :
The “topic classification” systems described in the speech literature typically partition a collection of spoken messages into a small number of pre-defined topics. As such, they are only useful if the set of message topics does not vary over time. However, the techniques of textual information retrieval (IR) have long allowed for retrieval by arbitrary subject from a document collection. This paper describes experiments in unrestricted retrieval from a collection of radio news broadcasts. A hybrid message indexing strategy, with conventional word recognition and a fast lattice-based wordspotter, allows for the retrieval of news reports concerning any subject. The results show that retrieval can be carried out extremely quickly and that high accuracy is possible, even with recognition output errors
Keywords :
indexing; information retrieval systems; radio broadcasting; speech recognition; telecommunication computing; document collection; experiments; fast lattice-based wordspotter; hybrid message indexing; message topics; news reports; radio news broadcasts; speech literature; spoken messages; textual information retrieval; topic classification systems; unrestricted topic retrieval system; word recognition; Content based retrieval; Dictionaries; Indexing; Information retrieval; Loudspeakers; Radio broadcasting; Speech; Testing; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.540412