Title :
Normalized Feature Vectors: A Novel Alignment-Free Sequence Comparison Method Based on the Numbers of Adjacent Amino Acids
Author :
De-Shuang Huang ; Hong-Jie Yu
Author_Institution :
Sch. of Electron. & Inf. Eng., Tongji Univ., Shanghai, China
Abstract :
Based on all kinds of adjacent amino acids (AAA), we map each protein primary sequence into a 400 by (L-1) matrix M. In addition, we further derive a normalized 400-tuple mathematical descriptors D, which is extracted from the primary protein sequences via singular values decomposition (SVD) of the matrix. The obtained 400-D normalized feature vectors (NFVs) further facilitate our quantitative analysis of protein sequences. Using the normalized representation of the primary protein sequences, we analyze the similarity for different sequences upon two data sets: 1) ND5 sequences from nine species and 2) transferrin sequences of 24 vertebrates. We also compared the results in this study with those from other related works. These two experiments illustrate that our proposed NFV-AAA approach does perform well in the field of similarity analysis of sequence.
Keywords :
biology computing; molecular biophysics; proteins; proteomics; singular value decomposition; vectors; ND5 sequences; NFV; SVD; adjacent amino acids; alignment-free sequence; normalized feature vectors; protein primary sequence; protein sequences; similarity analysis; singular values decomposition; transferrin sequences; Amino acids; Bioinformatics; Educational institutions; Feature extraction; Proteins; Vectors; Adjacent amino acids; Amino acids; Bioinformatics; Educational institutions; Feature extraction; Proteins; Vectors; alignment free; normalized feature vector; similarity analysis; singular value decomposition (SVD); Algorithms; Amino Acid Sequence; Amino Acids; Animals; Computational Biology; Databases, Protein; Humans; Phylogeny; Proteins; Sequence Analysis, Protein; Vertebrates;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2013.10