مرکز منطقه ای اطلاع رساني علوم و فناوري - Sparse Bayesian approach for feature selection

DocumentCode :

1800027

Title :

Sparse Bayesian approach for feature selection

Author :

Chang Li ; Huanhuan Chen

Author_Institution :

Sch. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2014

fDate :

9-12 Dec. 2014

Firstpage :

Lastpage :

Abstract :

This paper employs sparse Bayesian approach to enable the Probabilistic Classification Vector Machine (PCVM) to select a relevant subset of features. Because of probabilistic outputs and the ability to automatically optimize the regularization items, the sparse Bayesian framework has shown great advantages in real-world applications. However, the Gaussian priors that introduce the same prior to different classes may lead to instability in the classifications. An improved Gaussian prior, whose sign is determined by the class label, is adopt in PCVM. In this paper, we present a joint classifier and feature learning algorithm: Feature Selection Probabilistic Classification Vector Machine (FPCVM). The improved Gaussian priors, named as truncated Gaussian prior, are introduced into the feature space for feature selection, and into the sample space to generate sparsity to the weight parameters, respectively. The expectation-maximization (EM) algorithm is employed to obtain a maximum a posteriori (MAP) estimation of these parameters. In experiments, both the accuracy of classification and performance of feature selection are evaluated on synthetic datasets, benchmark datasets and high-dimensional gene expression datasets.

Keywords :

Bayes methods; Gaussian processes; expectation-maximisation algorithm; feature selection; genetics; learning (artificial intelligence); pattern classification; probability; support vector machines; EM algorithm; FPCVM; MAP estimation; benchmark datasets; expectation-maximization algorithm; feature selection classification; feature selection performance; feature selection probabilistic classification vector machine; feature space; high-dimensional gene expression datasets; maximum a posteriori estimation; sparse Bayesian approach; synthetic datasets; truncated Gaussian prior; weight parameters; Bayes methods; Joints; Kernel; Mathematical model; Probabilistic logic; Support vector machines; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computational Intelligence in Big Data (CIBD), 2014 IEEE Symposium on

Conference_Location :

Orlando, FL

Type :

conf

DOI :

10.1109/CIBD.2014.7011521

Filename :

7011521

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1800027