Title :
Genetic Heuristic Development: Feature selection for author identification
Author :
Adams, J. ; Williams, Henry ; Carter, Jenny ; Dozier, Gerry
Author_Institution :
North Carolina A&T State Univ., Greensboro, NC, USA
Abstract :
Author identification is the process of recognizing an author based on a sample of text. Feature selection is the process of selecting the most salient features required for recognition. In many cases, this results in an increase in recognition accuracy. In this paper, we apply Genetic and Evolutionary Feature Selection with Machine Learning (GEFeSML) to author identification. We then introduce Genetic Heuristic Development (GHD), a process to improve the matching process. GHD uses subsets of features found by GEFeSML to create a high performing heuristic for feature selection. This technique successfully increases recognition accuracy while significantly reducing the number of features required for recognition.
Keywords :
biometrics (access control); genetic algorithms; learning (artificial intelligence); GEFeSML; GHD; author identification; author recognition; biometric recognition; genetic and evolutionary feature selection; genetic heuristic development; machine learning; Accuracy; Biometrics (access control); Feature extraction; Frequency modulation; Genetics; Sociology; Statistics; Author identification; Biometrics; Classification algorithms; Evolutionary Computation; Feature Extraction; Genetic Algorithms (GAs); Heuristic algorithms;
Conference_Titel :
Computational Intelligence in Biometrics and Identity Management (CIBIM), 2013 IEEE Workshop on
Conference_Location :
Singapore
DOI :
10.1109/CIBIM.2013.6607911