Title :
Robust information clustering
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
fDate :
31 July-4 Aug. 2005
Abstract :
We focus on the scenario of robust information clustering (RIC) based on the minimax optimization of mutual information (MI). The minimization of MI leads to the standard mass constrained deterministic annealing clustering which is an empirical risk minimization algorithm. The maximization of MI works out an upper bound of the empirical risk via the identification of outliers (noisy data points). Furthermore, we estimate the real risk VC-bound and determine an optimal cluster number of the RIC based on the structural risk minimization (SRM) principle. One of the main advantages of the minimax optimization of MI is that it is a nonparametric approach, which identifies the outliers through the robust density estimate and forms a simple data clustering algorithm based on the square error of the Euclidean distance.
Keywords :
learning (artificial intelligence); least mean squares methods; minimax techniques; pattern clustering; Euclidean distance; empirical risk minimization algorithm; mass constrained deterministic annealing clustering; minimax optimization; mutual information; nonparametric approach; robust information clustering; square error; Annealing; Clustering algorithms; Data structures; Distortion measurement; Euclidean distance; Minimax techniques; Mutual information; Noise robustness; Risk management; Upper bound;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1556041