DocumentCode :
3382237
Title :
Combining statistical independence testing, visual attribute selection and automated analysis to find relevant attributes for classification
Author :
May, Thorsten ; Davey, James ; Kohlhammer, Jörn
Author_Institution :
Fraunhofer Inst. for Comput. Graphics Res., Darmstadt, Germany
fYear :
2010
fDate :
25-26 Oct. 2010
Firstpage :
239
Lastpage :
240
Abstract :
We present an iterative strategy for finding a relevant subset of attributes for the purpose of classification in high-dimensional, heterogeneous data sets. The attribute subset is used for the construction of a classifier function. In order to cope with the challenge of scalability, the analysis is split into an overview of all attributes and a detailed analysis of small groups of attributes. The overview provides generic information on statistical dependencies between attributes. With this information the user can select groups of attributes and an analytical method for their detailed analysis. The detailed analysis involves the identification of redundant attributes (via classification or regression) and the creation of summarizing attributes (via clustering or dimension reduction). Our strategy does not prescribe specific analytical methods. Instead, we recursively combine the results of different methods to find or generate a subset of attributes to use for classification.
Keywords :
attribute grammars; data analysis; pattern classification; attributes subset finding; automated analysis; classifier function; iterative strategy; statistical independence testing; visual attribute selection; Clustering methods; Data visualization; Electronic mail; Testing; USA Councils; Visualization; Data Clustering; Data Filtering; Dimensionality Reduction; High-Dimensional Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Visual Analytics Science and Technology (VAST), 2010 IEEE Symposium on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4244-9488-0
Electronic_ISBN :
978-1-4244-9487-3
Type :
conf
DOI :
10.1109/VAST.2010.5654445
Filename :
5654445
Link To Document :
بازگشت