DocumentCode
140273
Title
Multi-level gene/MiRNA feature selection using deep belief nets and active learning
Author
Ibrahim, Roliana ; Yousri, Noha A. ; Ismail, Muhammad Ali ; El-Makky, Nagwa M.
Author_Institution
Comput. & Syst. Eng. Dept., Alexandria Univ., Alexandria, Egypt
fYear
2014
fDate
26-30 Aug. 2014
Firstpage
3957
Lastpage
3960
Abstract
Selecting the most discriminative genes/miRNAs has been raised as an important task in bioinformatics to enhance disease classifiers and to mitigate the dimensionality curse problem. Original feature selection methods choose genes/miRNAs based on their individual features regardless of how they perform together. Considering group features instead of individual ones provides a better view for selecting the most informative genes/miRNAs. Recently, deep learning has proven its ability in representing the data in multiple levels of abstraction, allowing for better discrimination between different classes. However, the idea of using deep learning for feature selection is not widely used in the bioinformatics field yet. In this paper, a novel multi-level feature selection approach named MLFS is proposed for selecting genes/miRNAs based on expression profiles. The approach is based on both deep and active learning. Moreover, an extension to use the technique for miRNAs is presented by considering the biological relation between miRNAs and genes. Experimental results show that the approach was able to outperform classical feature selection methods in hepatocellular carcinoma (HCC) by 9%, lung cancer by 6% and breast cancer by around 10% in F1-measure. Results also show the enhancement in F1-measure of our approach over recently related work in [1] and [2].
Keywords
RNA; bioinformatics; cancer; cellular biophysics; data structures; feature selection; genetics; genomics; learning (artificial intelligence); lung; molecular biophysics; F1-measure; active learning; bioinformatics; breast cancer; data representation; deep belief nets; dimensionality curse problem; disease classifier enhancement; hepatocellular carcinoma; lung cancer; multilevel gene-MiRNA feature selection; Accuracy; Bioinformatics; Breast cancer; Gene expression; Lungs; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE
Conference_Location
Chicago, IL
ISSN
1557-170X
Type
conf
DOI
10.1109/EMBC.2014.6944490
Filename
6944490
Link To Document