DocumentCode :
6967
Title :
Multi-Label Learning With Fuzzy Hypergraph Regularization for Protein Subcellular Location Prediction
Author :
Jing Chen ; Yuan Yan Tang ; Chen, C.L.P. ; Bin Fang ; Yuewei Lin ; Zhaowei Shang
Author_Institution :
Fac. of Sci. & Technol., Univ. of Macau, Macau, China
Volume :
13
Issue :
4
fYear :
2014
fDate :
Dec. 2014
Firstpage :
438
Lastpage :
447
Abstract :
Protein subcellular location prediction aims to predict the location where a protein resides within a cell using computational methods. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. The latent concepts are extracted through feature space decomposition and label space decomposition under the nonnegative data factorization framework. The extracted latent concepts are used as the codebook to indirectly connect the protein features to their annotations. We construct dual fuzzy hypergraphs to capture the intrinsic high-order relations embedded in not only feature space, but also label space. Finally, the subcellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hypergraph Laplacian regularization. The experimental results on the six protein benchmark datasets demonstrate the superiority of our proposed method by comparing it with the state-of-the-art methods, and illustrate the benefit of exploiting both feature correlations and label correlations.
Keywords :
Laplace equations; benchmark testing; biology computing; cellular biophysics; feature extraction; fuzzy systems; graph theory; learning (artificial intelligence); molecular biophysics; proteins; FHML; codebook; computational methods; dual fuzzy hypergraph Laplacian regularization; dual fuzzy hypergraphs; extracted latent concepts; feature correlations; feature space decomposition extraction; fuzzy hypergraph regularization; hierarchical multilabel learning; intrinsic high-order relations; label space decomposition; multilocation proteins; nonnegative data factorization framework; protein benchmark datasets; protein resides; protein subcellular location prediction; single-location proteins; subcellular location annotation information; Amino acids; Correlation; Feature extraction; Laplace equations; Predictive models; Proteins; Vectors; Dictionary learning; hypergraph regularization; multi-label learning; protein subcellular localization;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2014.2341111
Filename :
6869041
Link To Document :
بازگشت