DocumentCode :
2724796
Title :
Mining Subspace Correlations
Author :
Harpaz, Rave ; Haralick, Robert
Author_Institution :
Graduate Center, City Univ. of New York, NY
fYear :
2007
fDate :
March 1 2007-April 5 2007
Firstpage :
335
Lastpage :
342
Abstract :
In recent applications of clustering such as gene expression microarray analysis, collaborative filtering, and Web mining, object similarity is no longer measured by physical distance, but rather by the behavior patterns objects manifest or the magnitude of correlations they induce. Current state of the art algorithms aiming at this type of clustering typically postulate specific cluster models that are able to capture only specific behavior patterns or correlations, and omit the possibility that other information carrying patterns or correlations may coexist in the data. We cast the problem of searching for pattern clusters or clusters that induce large correlations in some subset of features into the problem of searching for groups of points embedded in lines. The advantage of this approach is that is allows the clustering of different patterns or correlations simultaneously. It also allows the clustering of patterns and correlations that are overlooked by existing methods. A formal stochastic line cluster model is presented and its connection to correlation is established. Based on this model an algorithm, which uses feature selection to search for line clusters embedded in subspaces of the data is presented
Keywords :
data mining; pattern clustering; stochastic processes; behavior patterns; clustering algorithms; feature selection; formal stochastic line cluster model; object similarity; pattern clusters; subspace correlation mining; Application software; Clustering algorithms; Clustering methods; Collaboration; Computational intelligence; Data mining; Gene expression; Laboratories; Pattern recognition; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
Type :
conf
DOI :
10.1109/CIDM.2007.368893
Filename :
4221317
Link To Document :
بازگشت