DocumentCode :
1126026
Title :
Feature subset selection and feature ranking for multivariate time series
Author :
Yoon, Hyunjin ; Yang, Kiyoung ; Shahabi, Cyrus
Author_Institution :
Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
Volume :
17
Issue :
9
fYear :
2005
Firstpage :
1186
Lastpage :
1198
Abstract :
Feature subset selection (FSS) is a known technique to preprocess the data before performing any data mining tasks, e.g., classification and clustering. FSS provides both cost-effective predictors and a better understanding of the underlying process that generated the data. We propose a family of novel unsupervised methods for feature subset selection from multivariate time series (MTS) based on common principal component analysis, termed CLeVer. Traditional FSS techniques, such as recursive feature elimination (RFE) and Fisher criterion (FC), have been applied to MTS data sets, e.g., brain computer interface (BCI) data sets. However, these techniques may lose the correlation information among features, while our proposed techniques utilize the properties of the principal component analysis to retain that information. In order to evaluate the effectiveness of our selected subset of features, we employ classification as the target data mining task. Our exhaustive experiments show that CLeVer outperforms RFE, FC, and random selection by up to a factor of two in terms of the classification accuracy, while taking up to 2 orders of magnitude less processing time than RFE and FC.
Keywords :
data analysis; data mining; feature extraction; pattern classification; principal component analysis; time series; unsupervised learning; BCI data sets; CLeVer; FC; FSS; Fisher criterion; MTS data sets; RFE; brain computer interface; data mining; feature extraction; feature ranking; feature representation; feature subset selection; multivariate time series; pattern classification; principal component analysis; recursive feature elimination; unsupervised methods; Application software; Brain computer interfaces; Data mining; Electroencephalography; Feature extraction; Frequency selective surfaces; Humans; Principal component analysis; Time measurement; Time series analysis; Index Terms- Data mining; feature evaluation and selection; feature extraction or construction; feature representation.; time series analysis;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2005.144
Filename :
1490526
Link To Document :
بازگشت