DocumentCode :
2583356
Title :
Protein motif searching through similar enriched Parikh vector identification
Author :
Huang, Xiaolu ; Ali, Hesham ; Sadanandam, Anguraj ; Singh, Rakesh
Author_Institution :
Dept. of Comput. Sci., Nebraska Univ., Omaha, NE, USA
fYear :
2005
fDate :
19-21 Oct. 2005
Firstpage :
285
Lastpage :
289
Abstract :
Biological researches have shown that some protein regions sharing similar functions or structures have inversed-ordered or highly dispersed sequence similarities and some intra-sequence similarities such as palindrome repeats also play important roles in protein folding. The current protein analysis tools cannot detect these "nontraditional" similarities. Although some tools can be modified for searching intra-sequence inversed or forward ordered similarities, their maximally optimal path processes will miss many suboptimal biologically meaningful similarities. The similar enriched Parikh vector searching (SRPVS) algorithm searches similarities by separating the subsequence composition and order information. The SRPVS first breaks sequences into groups of predefined-sized subsequences, each represented by an enriched Parikh vector (RPV); then similar RPV pairs (SRPV) are searched in each nonoverlapping RPV pair based on various order restrictions - forward, inversed, or shuffled. In this study, SRPVS has been applied to the protein ligand motif finding and the intra-sequence protein inversed repeats finding.
Keywords :
biology computing; molecular biophysics; molecular configurations; proteins; vectors; highly dispersed sequence similarities; intra-sequence protein inversed repeats finding; intra-sequence similarities; inversed-ordered sequence similarities; order information; palindrome repeats; protein analysis tools; protein folding; protein ligand motif finding; protein motif searching; similar enriched Parikh vector identification; subsequence composition; Amino acids; Biology; Computer science; Displays; Evolution (biology); Extracellular; Head; Hidden Markov models; Pathology; Protein sequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
Type :
conf
DOI :
10.1109/BIBE.2005.49
Filename :
1544482
Link To Document :
بازگشت