Title of article :
Classification performance resulting from a 2-means
Author/Authors :
Ruwet، نويسنده , , C. and Haesbroeck، نويسنده , , G.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2013
Abstract :
The k-means procedure is probably one of the most common nonhierachical clustering techniques. From a theoretical point of view, it is related to the search for the k principal points of the underlying distribution. In this paper, the classification resulting from that procedure for k=2 is shown to be optimal under a balanced mixture of two spherically symmetric and homoscedastic distributions. Then, the classification efficiency of the 2-means rule is assessed using the second order influence function and compared to the classification efficiencies of Fisher and Logistic discriminations. Influence functions are also considered here to compare the robustness to infinitesimal contamination of the 2-means method w.r.t. the generalized 2-means technique.
Keywords :
k-means , Principal points , Robustness , Influence function , Cluster analysis , Asymptotic loss , Error rate
Journal title :
Journal of Statistical Planning and Inference
Journal title :
Journal of Statistical Planning and Inference