Title :
Feature and Prototype Evolution for Nearest Neighbor Classification of Web Documents
Author :
Cheatham, Michelle ; Rizki, Mateen
Author_Institution :
Inf. Directorate, Air Force Res. Lab., Wright-Patterson AFB, OH
Abstract :
A nearest neighbor classifier (NNC) approaches the problem of text classification by computing a similarity metric between feature vector representations of an unknown document and a set of known prototype documents. The accuracy and speed of the NNC are dependent upon the choices of features and prototypes. In this paper, we consider the use of a genetic algorithm to optimize the feature and prototype sets for an NNC. We also examine whether simultaneously evolving the feature and prototype sets produces better results than sequential optimization
Keywords :
Internet; classification; genetic algorithms; text analysis; Web documents; feature evolution; genetic algorithm; nearest neighbor classification; prototype evolution; text classification; Computer science; Design engineering; Genetic algorithms; HTML; Laboratories; Military computing; Nearest neighbor searches; Prototypes; Space exploration; Text categorization;
Conference_Titel :
Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2497-4
DOI :
10.1109/ITNG.2006.64