Title :
The development of a proteomic analyzing pipeline to identify proteins with multiple RRMs and predict their domain boundaries
Author :
Ko, Kyung Dae ; Liu, Chunmei ; Rwebangira, Mugizi Robert ; Burge, Legand ; Southerland, William
Author_Institution :
Dept. of Syst. Comput. Sci., Howard Univ., Washington, DC, USA
Abstract :
The RNA-recognition motif (RRM) is the most abundant RNA-binding domain involved in many post-transcriptional processes. Since RRM-containing proteins have different functions with similar domain architecture, it is challenging to implement an automated annotation tool for these proteins in proteomic analysis. In this study, we implemented a proteomic analyzing pipeline to identify proteins with multiple RRMs and predict their domain boundaries using specific PSSMs, domain architectures, and proteins with the same entity name. After clustering sequences on the basis of their evolutionary distances, a reference group is selected comparing domain architectures. Then, candidate proteins are collected in a proteome using specific PSSMs from seed alignments in PFAM. Finally, target proteins are identified using multiple alignments and phyolgenetic trees between candidate and reference proteins. Therefore, we identified 33 proteins close to 12 types of RRM containing proteins and their domain boundaries among 508 candidates from 33610 sequences in a human proteome.
Keywords :
DNA; biology computing; domain boundaries; molecular configurations; proteins; proteomics; RNA-binding domain; RNA-recognition motif; automated annotation tool; clustering sequences; domain architectures; domain boundaries; evolutionary distances; human proteome; multiple alignments; phyolgenetic trees; post-transcriptional processes; proteins; proteomic analyzing pipeline; Databases; Hidden Markov models; Humans; Pipelines; Proteins; Proteomics; Sensitivity;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1612-6
DOI :
10.1109/BIBMW.2011.6112401