Title :
A Feature Selection Algorithm for Detecting Subtype Specific Functional Sites from Protein Sequences for Smad Receptor Binding
Author :
Marchiori, Elena ; Pirovano, Walter ; Heringa, Jaap ; Feenstra, K. Anton
Author_Institution :
Centre for Integrative Bioinformatics, Vrije Univ., Amsterdam
Abstract :
Multiple sequence alignments are often used to reveal functionally important residues within a protein family. In particular, they can be very useful for identification of key residues that determine functional differences between protein subclasses (subtype specific sites). This paper proposes a new algorithm for selecting subtype specific sites from a set of aligned protein sequences. The algorithm combines a feature selection technique with neighbor position information for selecting and ranking a set of putative relevant sites. The algorithm is applied to a dataset of protein sequences from the MH2 domain of the SMAD family of transcriptor factors. Validation of the results on the basis of the known interaction and function of the sites shows that the algorithm successfully identifies the known (from literature) subtype specific sites and new putative ones
Keywords :
biology computing; feature extraction; proteins; set theory; MH2 domain; feature selection algorithm; multiple sequence alignment; protein sequence; putative relevant site; smad receptor binding; Adhesives; Amino acids; Bioinformatics; Boosting; Cellular networks; Entropy; Genomics; Proteins; Signal processing; State estimation;
Conference_Titel :
Machine Learning and Applications, 2006. ICMLA '06. 5th International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7695-2735-3
DOI :
10.1109/ICMLA.2006.7