Title :
Influential data subsets labelling: A fuzzy relation approach
Author_Institution :
Dept. of Math. Sci., Univ. Teknol. Malaysia, Skudai, Malaysia
Abstract :
The selection of candidate for influential data subsets is an important step in regression analysis in order to construct a model of high quality. One of the most popular and widely used methods for labelling influential data subsets is 1-clustering method. This is an indexed hierarchical clustering based on the notion of subdominant ultrametric (SDU) of a dissimilarity matrix. In the literature, there are many different algorithms available to construct SDU. However, the computational complexity of those algorithms is high; the running time is very slow especially when the sample size is large. In this paper a method based on fuzzy relation approach, which allows us to construct a promising algorithm to obtain SDU even for a large number of data, is introduced. An example will be presented and discussed to illustrate the advantage of the proposed algorithm.
Keywords :
computational complexity; fuzzy set theory; matrix algebra; pattern clustering; regression analysis; 1-clustering method; SDU; computational complexity; dissimilarity matrix; fuzzy relation approach; indexed hierarchical clustering; influential data subset labelling; regression analysis; running time; subdominant ultrametric; Clustering algorithms; Computational complexity; Convergence; Couplings; Indexes; Labeling; Mathematical model; Dissimilarity index; hat matrix; min-max transitive closure; sub-dominant ultrametric; ultrametric distance;
Conference_Titel :
Statistics in Science, Business, and Engineering (ICSSBE), 2012 International Conference on
Conference_Location :
Langkawi
Print_ISBN :
978-1-4673-1581-4
DOI :
10.1109/ICSSBE.2012.6396633