DocumentCode :
885372
Title :
R65-25 Training a Computer to Assign Descriptors to Documents: Experiments in Automatic Indexing
Author :
Bobrow, D.G.
Author_Institution :
Dept. of Elec. Engrg. Mass. Inst. Tech.
Issue :
2
fYear :
1965
fDate :
4/1/1965 12:00:00 AM
Firstpage :
278
Lastpage :
278
Abstract :
Summary form only given. This work describes a technique for utilizing a computer program to assign to technical papers relevant descriptors from a fixed set of such terms. The authors chose a "representative" sample of about one hundred papers from a collection of 10,000 papers previously indexed by analysts at the Defense Documentation Center. The significant content words (those not on a list of stop words to be ignored) of the title and abstract of each paper were extracted, and paired with all the descriptors for that paper. From all the pairs obtained from this teaching sample, and the relative frequency of occurrence of each descriptor, a co-occurrence value for each pair was computed, and for "validated" descriptors (those appearing at least three times in the teaching sample), this co-occurrence data was retained. The remaining descriptor names were kept on a list of "candidate" descriptors.
fLanguage :
English
Journal_Title :
Electronic Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0367-7508
Type :
jour
DOI :
10.1109/PGEC.1965.263978
Filename :
4038433
Link To Document :
بازگشت