Title : 
An adaptation of Pfam profiles to predict protein sub-cellular localization in Gram positive bacteria
         
        
            Author : 
Arango-Argoty, G.A. ; Ruiz-Munoz, J.F. ; Jaramillo-Garzon, Jorge Alberto ; Castellanos-Dominguez, C.G.
         
        
            Author_Institution : 
Signal Process. & Recognition Group, Univ. Nac. de Colombia, Manizales, Colombia
         
        
        
            fDate : 
Aug. 28 2012-Sept. 1 2012
         
        
        
        
            Abstract : 
Predicting the sub-cellular localization of a protein can provide useful information to uncover its molecular functions. In this sense, numerous prediction techniques have been developed, which usually have been focused on global information of the protein or sequence alignments. However, several studies have shown that the functional nature of proteins is ruled by conserved sub-sequence patterns known as domains. In this paper, an alternative methodology (PfamFeat) for gram-positive bacterial sub-cellular localization was developed. PfamFeat is based on information provided by Pfam database, which stores a series of HMM-profiles describing common protein domains. The likelihood of a sequence, to be generated by a given HMM-profile, can be used to characterize sequences in order to use pattern recognition techniques. Success rates obtained with a simple one-nearest neighbor classifier demonstrate that this method is competitive with popular sub-cellular prediction algorithms and it constitutes a promising research trend.
         
        
            Keywords : 
bioinformatics; cellular biophysics; medical signal processing; microorganisms; molecular biophysics; molecular configurations; pattern recognition; proteins; signal classification; Gram positive bacteria; Pfam profiles; PfamFeat method; molecular functions; one-nearest neighbor classifier; pattern recognition techniques; protein domains; protein sub-cellular localization; sequence alignments; sub-cellular prediction algorithms; Databases; Extracellular; Hidden Markov models; Microorganisms; Proteins; Training; Algorithms; Computational Biology; Gram-Positive Bacteria; Subcellular Fractions;
         
        
        
        
            Conference_Titel : 
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
         
        
            Conference_Location : 
San Diego, CA
         
        
        
            Print_ISBN : 
978-1-4244-4119-8
         
        
            Electronic_ISBN : 
1557-170X
         
        
        
            DOI : 
10.1109/EMBC.2012.6347252