• DocumentCode
    1743021
  • Title

    Data description in subspaces

  • Author

    Tax, David M J ; Duin, Robert P W

  • Author_Institution
    Fac. of Appl. Sci., Delft Univ. of Technol., Netherlands
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    672
  • Abstract
    We investigate how the boundary of a data set can be obtained in case of (very) low sample sizes. This boundary can be used to detect if new objects resemble the data set and therefore make the subsequent classification more confident. When a large number of training objects is available it is possible to directly estimate the density. After thresholding the probability density a boundary around the data is obtained. However, in the case of very low sample sizes, extrapolations have to be performed. In this paper we propose a simple method based on nearest neighbor distances which is capable of finding data boundaries in these low sample sizes. It appears to be especially useful when the data is distributed in subspaces
  • Keywords
    extrapolation; learning systems; normal distribution; pattern classification; statistical analysis; data boundary; data description; extrapolation; machine learning; nearest neighbor distances; normal distribution; pattern classification; probability density; Covariance matrix; Extrapolation; Gaussian distribution; Nearest neighbor searches; Object detection; Pattern recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2000. Proceedings. 15th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-0750-6
  • Type

    conf

  • DOI
    10.1109/ICPR.2000.906164
  • Filename
    906164