Title :
Multi-Label Learning With Fuzzy Hypergraph Regularization for Protein Subcellular Location Prediction
Author :
Jing Chen ; Yuan Yan Tang ; Chen, C.L.P. ; Bin Fang ; Yuewei Lin ; Zhaowei Shang
Author_Institution :
Fac. of Sci. & Technol., Univ. of Macau, Macau, China
Abstract :
Protein subcellular location prediction aims to predict the location where a protein resides within a cell using computational methods. Considering the main limitations of the existing methods, we propose a hierarchical multi-label learning model FHML for both single-location proteins and multi-location proteins. The latent concepts are extracted through feature space decomposition and label space decomposition under the nonnegative data factorization framework. The extracted latent concepts are used as the codebook to indirectly connect the protein features to their annotations. We construct dual fuzzy hypergraphs to capture the intrinsic high-order relations embedded in not only feature space, but also label space. Finally, the subcellular location annotation information is propagated from the labeled proteins to the unlabeled proteins by performing dual fuzzy hypergraph Laplacian regularization. The experimental results on the six protein benchmark datasets demonstrate the superiority of our proposed method by comparing it with the state-of-the-art methods, and illustrate the benefit of exploiting both feature correlations and label correlations.
Keywords :
Laplace equations; benchmark testing; biology computing; cellular biophysics; feature extraction; fuzzy systems; graph theory; learning (artificial intelligence); molecular biophysics; proteins; FHML; codebook; computational methods; dual fuzzy hypergraph Laplacian regularization; dual fuzzy hypergraphs; extracted latent concepts; feature correlations; feature space decomposition extraction; fuzzy hypergraph regularization; hierarchical multilabel learning; intrinsic high-order relations; label space decomposition; multilocation proteins; nonnegative data factorization framework; protein benchmark datasets; protein resides; protein subcellular location prediction; single-location proteins; subcellular location annotation information; Amino acids; Correlation; Feature extraction; Laplace equations; Predictive models; Proteins; Vectors; Dictionary learning; hypergraph regularization; multi-label learning; protein subcellular localization;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2014.2341111