Title :
Taking Topic Detection From Evaluation to Practice
Author :
Allan, James ; Harding, Stephen ; Fisher, David ; Bolivar, Alvaro ; Guzman-Lara, Sergio ; Amstutz, Peter
Author_Institution :
University of Massachusetts Amherst
Abstract :
The Topic Detection and Tracking (TDT) research community investigates information retrieval methods for organizing a constantly arriving stream of news articles by the events that they discuss. Our best system for the open evaluations of TDT has used an approach that turned out to be problematic when the cluster detection technology was deployed in a real world setting. To avoid generating "garbage" clusters, we had to revert to a different approach and to explore engineering solutions that were not motivated by the model. Our experiences also led us to propose extensions to the formal TDT evaluation.
Keywords :
Clustering algorithms; Computer science; Cost function; Error correction; Event detection; Failure analysis; Information retrieval; NIST; Organizing; Performance evaluation;
Conference_Titel :
System Sciences, 2005. HICSS '05. Proceedings of the 38th Annual Hawaii International Conference on
Print_ISBN :
0-7695-2268-8
DOI :
10.1109/HICSS.2005.576