Influential data subsets labelling: A fuzzy relation approach

Author

Djauhari, M.A.

Author_Institution

Dept. of Math. Sci., Univ. Teknol. Malaysia, Skudai, Malaysia

fYear

2012

fDate

10-12 Sept. 2012

Firstpage

1

Lastpage

3

Abstract

The selection of candidate for influential data subsets is an important step in regression analysis in order to construct a model of high quality. One of the most popular and widely used methods for labelling influential data subsets is 1-clustering method. This is an indexed hierarchical clustering based on the notion of subdominant ultrametric (SDU) of a dissimilarity matrix. In the literature, there are many different algorithms available to construct SDU. However, the computational complexity of those algorithms is high; the running time is very slow especially when the sample size is large. In this paper a method based on fuzzy relation approach, which allows us to construct a promising algorithm to obtain SDU even for a large number of data, is introduced. An example will be presented and discussed to illustrate the advantage of the proposed algorithm.

Keywords

computational complexity; fuzzy set theory; matrix algebra; pattern clustering; regression analysis; 1-clustering method; SDU; computational complexity; dissimilarity matrix; fuzzy relation approach; indexed hierarchical clustering; influential data subset labelling; regression analysis; running time; subdominant ultrametric; Clustering algorithms; Computational complexity; Convergence; Couplings; Indexes; Labeling; Mathematical model; Dissimilarity index; hat matrix; min-max transitive closure; sub-dominant ultrametric; ultrametric distance;

fLanguage

English

Publisher

ieee

Conference_Titel

Statistics in Science, Business, and Engineering (ICSSBE), 2012 International Conference on

Conference_Location

Langkawi

Print_ISBN

978-1-4673-1581-4

Type

conf

DOI

10.1109/ICSSBE.2012.6396633

Filename

6396633