DocumentCode
909956
Title
Estimation by the nearest neighbor rule
Author
Cover, Thomas M.
Volume
14
Issue
1
fYear
1968
fDate
1/1/1968 12:00:00 AM
Firstpage
50
Lastpage
55
Abstract
Let
denote the Bayes risk (minimum expected loss) for the problem of estimating
, given an observed random variable
, joint probability distribution
, and loss function
. Consider the problem in which the only knowledge of
is that which can be inferred from samples
, where the
are independently identically distributed according to
. Let the nearest neighbor estimate of the parameter
associated with an observation
be defined to be the parameter
associated with the nearest neighbor
to
. Let R be the large sample risk of the nearest neighbor rule. It will be shown, for a wide range of probability distributions, that
for metric loss functions and
for squared-error loss functions. A simple estimator using the nearest
neighbors yields
in the squared-error loss case. In this sense, it can be said that at least haft the information in the infinite training set is contained in the nearest neighbor. This paper is an extension of earlier work[q from the problem of classification by the nearest neighbor rule to that of estimation. However, the unbounded loss functions in the estimation problem introduce additional problems concerning the convergence of the unconditional risk. Thus some work is devoted to the investigation of natural conditions on the underlying distribution assuring the desired convergence.
denote the Bayes risk (minimum expected loss) for the problem of estimating
, given an observed random variable
, joint probability distribution
, and loss function
. Consider the problem in which the only knowledge of
is that which can be inferred from samples
, where the
are independently identically distributed according to
. Let the nearest neighbor estimate of the parameter
associated with an observation
be defined to be the parameter
associated with the nearest neighbor
to
. Let R be the large sample risk of the nearest neighbor rule. It will be shown, for a wide range of probability distributions, that
for metric loss functions and
for squared-error loss functions. A simple estimator using the nearest
neighbors yields
in the squared-error loss case. In this sense, it can be said that at least haft the information in the infinite training set is contained in the nearest neighbor. This paper is an extension of earlier work[q from the problem of classification by the nearest neighbor rule to that of estimation. However, the unbounded loss functions in the estimation problem introduce additional problems concerning the convergence of the unconditional risk. Thus some work is devoted to the investigation of natural conditions on the underlying distribution assuring the desired convergence.Keywords
Bayes procedures; Decision procedures; Estimation; Pattern classification; Contracts; Convergence; Extraterrestrial measurements; Nearest neighbor searches; Neural networks; Parameter estimation; Probability distribution; Random variables; Risk analysis; Yield estimation;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.1968.1054098
Filename
1054098
Link To Document