Title of article :
Simultaneous variable selection and outlier identification in linear regression using the mean-shift outlier model
Author/Authors :
Sung-Soo Kim، نويسنده , , Sung H. Park & W.J. Krzanowski، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Abstract :
We provide a method for simultaneous variable selection and outlier identification using the mean-shift
outlier model. The procedure consists of two steps: the first step is to identify potential outliers, and the
second step is to perform all possible subset regressions for the mean-shift outlier model containing the
potential outliers identified in step 1. This procedure is helpful for model selection while simultaneously
considering outlier identification, and can be used to identify multiple outliers. In addition, we can evaluate
the impact on the regression model of simultaneous omission of variables and interesting observations. In an
example, we provide detailed output from the R system, and compare the results with those using posterior
model probabilities as proposed by Hoeting et al. [Comput. Stat. Data Anal. 22 (1996), pp. 252–270] for
simultaneous variable selection and outlier identification.
Keywords :
Multiple outliers , Variable selection , Mean-shift outlier model , all-subset regressions
Journal title :
JOURNAL OF APPLIED STATISTICS
Journal title :
JOURNAL OF APPLIED STATISTICS