Title :
Similarity analysis based on sparse representation for protein sequence comparison
Author :
Lina Yang ; Yuan Yan Tang ; Yulong Wang ; Huiwu Luo ; Jianjia Pan ; Haoliang Yuan ; Xianwei Zheng ; Chunli Li ; Ting Shu
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Macau, Macau, China
Abstract :
This paper propose a least square-based sparse representation algorithm to analyze similarity comparison of protein sequences in the area of bioinformatics and molecular biology, which helps the prediction and classification of protein structure and function. The protein sequences are represented into the 1-dimensional feature vectors by their biochemical quantities. Then using the least square method to form the feature vector. Through the similarity calculation, the distance matrix can be obtained, by which, the phylogenic tree can be constructed.We apply this approach by analyzing the ND5 (NADH dehydrogenase subunit 5) protein cluster dataset. The experimental results show that the proposed model is more accurate than the Su´s model,and it is closer with some known biological facts.
Keywords :
bioinformatics; data structures; least squares approximations; pattern classification; proteins; vectors; bioinformatics; feature vector; least square-based sparse representation algorithm; molecular biology; protein sequence; protein structure classification; protein structure prediction; similarity analysis; Amino acids; Encoding; Matching pursuit algorithms; Protein sequence; Training; Feature extraction; l1-regularized least squares; protein sequence analysis; sparse representation;
Conference_Titel :
Cybernetics (CYBCONF), 2015 IEEE 2nd International Conference on
Conference_Location :
Gdynia
Print_ISBN :
978-1-4799-8320-9
DOI :
10.1109/CYBConf.2015.7175964