DocumentCode
2010898
Title
E2D: A Novel Tool for Annotating Protein Domains in Expressed Sequence Tags
Author
Lee, Guo-Hsing ; Chuang, Nai-Yu ; Lin, Wen-Dar ; Hsiao, Chung-Der ; Lee, Hahn-Ming ; Ho, Jan-Ming
Author_Institution
Inst. of Inf. Sci., Acad. Sinica, Taipei
fYear
2006
fDate
28-29 Sept. 2006
Firstpage
1
Lastpage
6
Abstract
The vast number of expressed sequence tags (ESTs) in public databases provides an important resource for comparative and functional genomics. A variety of methods based on homology search or domain profile search have been developed to functionally annotate protein domains in ESTs. However, these methods either ignore potentially valuable information from the homologues beyond the top N hits, or they are extremely time consuming. We provide an efficient and novel tool, called E2D (EST to Domain), which functionally annotates anonymous ESTs by recognizing potential domains from the enlarged hit proteins. Comparison with InterProScan shows that E2D is more efficient and effective for domain recognition. Additionally, we achieve 87.5% agreement with existing GO function annotations in TIGR through domain-GO mapping, which demonstrates the efficacy of our approach
Keywords
biology computing; genetics; proteins; domain profile search; domain recognition; expressed sequence tags; functional genomics; homology search; protein domain annotation; Assembly; Bioinformatics; Databases; Genomics; Hidden Markov models; Large-scale systems; Libraries; Pipelines; Protein engineering; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on
Conference_Location
Toronto, Ont.
Print_ISBN
1-4244-0624-2
Electronic_ISBN
1-4244-0624-2
Type
conf
DOI
10.1109/CIBCB.2006.330967
Filename
4133203
Link To Document