Title :
A predictive model using improved Normalized Point Wise Mutual Information (INPMI)
Author :
Sasikala, S. ; Geetha, S. ; Christopher, A.B.A. ; Balamurugan, S.A.A.
Author_Institution :
Anna Univ., Chennai, India
Abstract :
In machine learning, selection of optimal features for the classifier is a critical problem. In order to address this problem a novel feature selection method named “Improved Normalized Point wise Mutual Information (INPMI)” is proposed. The proposed INPMI method coupled with Sequential forward search (SFS) finds the best feature subset to aid feature selection process. The key properties of evaluating feature subset i.e. relevancy and redundancy are analysed well. The classifiers like Naive Bayes, Support Vector Machine and J48 are used to determine the accuracy for the choice of features selected. Experimental results with benchmark medical datasets from UCI (University of California Irvine) machine learning database show that proposed INPMI-NB model with SFS, INPMI-SVM model with SFS, INPMI-J48model with SFS achieves 98.36 %, 98.90 %, 94.53 % classification accuracy and selects 22 features for erythemato-squamous diseases. Also the proposed work is evaluated on a World Aircraft dataset to prove its generalization ability. Experimental results prove that the proposed INPMI method outperforms the existing systems.
Keywords :
Bayes methods; diseases; feature selection; generalisation (artificial intelligence); learning (artificial intelligence); medical diagnostic computing; pattern classification; support vector machines; INPMI-J48 model; INPMI-NB model; INPMI-SVM model; SFS; UCI machine learning database; University of California Irvine machine learning database; World Aircraft dataset; benchmark medical datasets; classifiers; erythemato-squamous diseases; feature selection method; feature selection process; feature subset evaluation; generalization ability; improved normalized point wise mutual information; medical data mining; naive Bayes; optimal feature selection; predictive model; sequential forward search; support vector machine; Accuracy; Classification algorithms; Diseases; Joints; Medical diagnostic imaging; Mutual information; Probability distribution; Erythemato-squamous diseases; Feature selection; Improved Normalized Point wise MutualInformation(INPMI); Medical Data mining;
Conference_Titel :
ICT and Knowledge Engineering (ICT&KE), 2013 11th International Conference on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4799-2294-9
DOI :
10.1109/ICTKE.2013.6756284