Title :
A simple probabilistic approach to classification and routing
Author :
Leistensnider, James ; Wilding, Jonathan
Author_Institution :
Manage. & Data Syst., Lockheed Martin Corp., Philadelphia, PA, USA
Abstract :
The Language Exploitation Group in Management and Data Systems has developed classification and routing software which calculates the most likely category of a document and creates ranked lists of documents relevant to user-defined categories. The basic algorithm used is to represent categories as multinomial distributions based upon information gathered from training sets of relevant documents, and to use probability to classify and route new documents into these categories. The software has been proven by its entry in the international TREC5 competition, and the software is being extended with both contract and IR and D efforts
Keywords :
classification; document handling; classification; document classification; information retrieval; lists of documents; multinomial distributions; routing; text processing; training sets; user-defined categories; Art; Contracts; Data systems; Frequency; Heart; Information retrieval; Routing; Software development management; Text processing; Writing;
Conference_Titel :
MILCOM 97 Proceedings
Conference_Location :
Monterey, CA
Print_ISBN :
0-7803-4249-6
DOI :
10.1109/MILCOM.1997.646719