DocumentCode :
3519208
Title :
Protein Sequence Motif Super-Rule-Tree (SRT) Structure Constructed by Hybrid Hierarchical K-Means Clustering Algorithm
Author :
Chen, Bernard ; He, Jieyue ; Pellicer, Steven ; Pan, Yi
Author_Institution :
Comput. Sci. Dept., Univ. of Central Arkansas, Conway, AK
fYear :
2008
fDate :
3-5 Nov. 2008
Firstpage :
98
Lastpage :
103
Abstract :
Protein sequence motifs information is crucial to the analysis of biologically significant regions. The conserved regions have the potential to determine the role of the proteins. Many algorithms or techniques to discover motifs require a predefined fixed window size in advance. Due to the fixed size, these approaches often deliver a number of similar motifs simply shifted by some bases or including mismatches. To confront the mismatched motifs problem, we use the super-rule concept to construct a Super-Rule-Tree (SRT) by a modified HHK clustering which requires no parameter setup to identify the similarities and dissimilarities between the motifs. By analyzing the motifs results generated by our approach, they are not only significant in sequence area but secondary structure similarity. We believe new proposed HHK clustering algorithm and SRT can play an important role in similar researches which requires predefined fixed window size.
Keywords :
bioinformatics; molecular configurations; pattern clustering; proteins; proteomics; statistics; trees (mathematics); hybrid hierarchical k-means clustering algorithm; mismatched motifs problem; modified HHK clustering; protein motif discovery; protein sequence motif SRT structure; secondary structure; super rule tree structure; Algorithm design and analysis; Association rules; Bioinformatics; Biology; Clustering algorithms; Computer science; Helium; Information analysis; Protein sequence; USA Councils; Hybrid Hierarchical K-means clustering algorithm; Super-Rule-Tree (SRT); protein sequence motif;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
Type :
conf
DOI :
10.1109/BIBM.2008.11
Filename :
4684879
Link To Document :
بازگشت