• Title of article

    Clustering confidence sets

  • Author/Authors

    Serban، نويسنده , , Nicoleta، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2009
  • Pages
    16
  • From page
    109
  • To page
    124
  • Abstract
    In this article, we present a novel approach to clustering finite or infinite dimensional objects observed with different uncertainty levels. The novelty lies in using confidence sets rather than point estimates to obtain cluster membership and the number of clusters based on the distance between the confidence set estimates. The minimal and maximal distances between the confidence set estimates provide confidence intervals for the true distances between objects. The upper bounds of these confidence intervals can be used to minimize the within clustering variability and the lower bounds can be used to maximize the between clustering variability. We assign objects to the same cluster based on a min–max criterion and we separate clusters based on a max–min criterion. We illustrate our technique by clustering a large number of curves and evaluate our clustering procedure with a synthetic example and with a specific application.
  • Keywords
    Clustering error rate , Gap sequence , Compustat Global database , Q-ratio , Single-linkage tree , Distance confidence interval , Simultaneous confidence sets
  • Journal title
    Journal of Statistical Planning and Inference
  • Serial Year
    2009
  • Journal title
    Journal of Statistical Planning and Inference
  • Record number

    2219776