DocumentCode :
1944614
Title :
Regression in the Presence Missing Data Using Ensemble Methods
Author :
Hassan, Mostafa M. ; Atiya, Amir F. ; El-Gayar, Neamat ; El-Fouly, Raafat
Author_Institution :
Dept. of Comput. Eng., Cairo Univ., Cairo
fYear :
2007
fDate :
12-17 Aug. 2007
Firstpage :
1261
Lastpage :
1265
Abstract :
We consider the problem of missing data, and develop ensemble-network models for handling the missing data. The proposed method is based on utilizing the inherent uncertainty of the missing records in generating diverse training sets for the ensemble´s networks. The proposed method is based on generating the missing values using their probability density. We repeat this procedure many time thereby creating several complete data sets. A network is trained for each of these data sets, therefore obtaining an ensemble of networks. Several variants are proposed, including the univariate approach and the multivariate approach, which differ in the way missing values are generated. Simulation results confirm the general superiority of the proposed methods compared to the conventional approaches.
Keywords :
data handling; neural nets; probability; regression analysis; ensemble-network model; missing data handling; missing records; probability density; regression; Information technology; Learning systems; Linear regression; Machine learning algorithms; Maximum likelihood estimation; Neural networks; Parameter estimation; Statistics; Training data; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2007.4371139
Filename :
4371139
Link To Document :
بازگشت