DocumentCode :
3521894
Title :
Prediction of Protein Quaternary Structure by a Novel Manifold Learning Algorithm
Author :
Wang, Tong ; Wang, Jian ; Yao, Lixiu
Author_Institution :
Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
fYear :
2011
fDate :
28-29 May 2011
Firstpage :
1
Lastpage :
3
Abstract :
With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to predict protein quaternary structure. To explore this problem, we adopted an approach based on a sequence encoding descriptor by fusing PseAA (Pseudo Amino Acid) and DC (Dipeptide Composition) representing a protein sample. Here, a completely different approach, manifold learning algorithm MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional feature space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife test results indicate that it is very promising to use the dimensionality reduction approaches to cope with complicated problems in biological systems, such as predicting the quaternary structure of proteins.
Keywords :
bioinformatics; biological techniques; data reduction; feature extraction; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; MVP; PseAA-DC fusion; automated protein quaternary structure prediction; dimension reduced descriptor vector; dimensionality reduction approach; dipeptide composition; feature extraction; high dimensional feature space; manifold learning algorithm; maximum variance projection; protein sequences; pseudo-amino acid; sequence encoding descriptor; Amino acids; Biological systems; Classification algorithms; Feature extraction; Prediction algorithms; Proteins; Support vector machine classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Applications (ISA), 2011 3rd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9855-0
Electronic_ISBN :
978-1-4244-9857-4
Type :
conf
DOI :
10.1109/ISA.2011.5873419
Filename :
5873419
Link To Document :
بازگشت