DocumentCode :
3151734
Title :
Feature selection for composite hypothesis testing with small samples: Fundamental limits and algorithms
Author :
Dayu Huang ; Meyn, Sean
Author_Institution :
CSL & ECE, Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
1917
Lastpage :
1920
Abstract :
This paper considers the problem of feature selection for composite hypothesis testing: The goal is to select, from m candidate features, r relevant ones for distinguishing the null hypothesis from the composite alternative hypothesis; the training data are given as L sequences of observations, of which each is an n-sample sequence coming from one distribution in the alternative hypothesis. What is the fundamental limit for successful feature selection? Are there any algorithms that achieve this limit? We investigate this problem in a small-sample high-dimensional setting, with n = o(m), and obtain a tight pair of achievability and converse results: (i) There exists a function f(L, n, r,m) such that if f(L, n, r,m) ↓ 0, then no asymptotically consistent feature selection algorithm exists; (ii) We propose a feature selection algorithm that is asymptotically consistent whenever f(L, n, r,m) ↑ ∞.
Keywords :
computational complexity; learning (artificial intelligence); sampling methods; statistical testing; achievability; asymptotically consistent feature selection algorithm; composite alternative hypothesis; composite hypothesis testing; fundamental limit; n-sample sequence; null hypothesis; small-sample high-dimensional setting; supervised learning; training data; Algorithm design and analysis; Complexity theory; Reactive power; Testing; Training data; US Government; USA Councils; Feature selection; composite hypothesis testing; high-dimensional model; small sample; supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288279
Filename :
6288279
Link To Document :
بازگشت