DocumentCode
3664372
Title
Protein sub-nuclear location by fusing AAC and PSSM features based on sequence information
Author
Shuhui Liu;Shunfang Wang;Haiyan Ding
Author_Institution
School of Information Science and Engineering, Yunnan University, Kunming 650091, China
fYear
2015
fDate
5/1/2015 12:00:00 AM
Firstpage
236
Lastpage
239
Abstract
To achieve good performance on Protein sub-nuclear location, one needs to extract a powerful representation containing rich information for identification. Various favorable techniques have been proposed, but it is believed that the single representations, containing one-sided information of protein sequence, are insufficient for discrimination. To this end, we in this paper propose the fused representations by integrating two single representations, the amino acid composition (AAC) and the position specific scoring matrix (PSSM). Due to two forms of PSSM, PsePSSM and GreyPSSM, two integrated representations, called briefly AACPsePSSM and AACGreyPSSM, are given. To evaluate the proposed representations, a benchmark data set is employed and the classical K nearest neighbor (KNN) is adopted classifier. And the experimental results show our proposed fusion representations outperform AAC and PSSM.
Keywords
"Protein sequence","Accuracy","Amino acids","Benchmark testing","Feature extraction","Data mining"
Publisher
ieee
Conference_Titel
Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
Print_ISBN
978-1-4799-7283-8
Type
conf
DOI
10.1109/ICEIEC.2015.7284529
Filename
7284529
Link To Document