Title :
GeneTide - Terra Incognita Discovery Endeavor mining ESTs and expression data to elucidate known and de-novo GeneCards® genes
Author :
Shklar, Maxim ; Shmueli, Orit ; Strichman-Almashanu, Liora ; Shmoish, Michael ; Lancet, Doron ; Safran, Marilyn
Author_Institution :
Dept. of Molecular Genetics, Weizmann Inst. of Sci., Rehovot, Israel
Abstract :
The construction of a complete EST-based gene index is an intricate task yet to be accomplished. GeneTide, the Gene Terra Incognita Discovery Endeavor (http://genecards.weizmann.ac.il/genetide/), which is the newest addition to the GeneCards suite of databases, comprehensively maps >4.5 of the ∼5.5 million human ESTs currently available at dbEST with either known or newly defined putative human genes. The association is accomplished via data mining genomic resources, and integrating using a unified scoring scheme. Groups of unassociated transcripts serve as a basis for defining EST-based gene candidates (EGCs). These EGCs are annotated with various parameters, including expression data, to determine their validity as possible de-novo genes. An immediate application of GeneTide to microarray annotation has increased, in a specific example, the number of annotated Affymetrix HGU95A-E probe sets by 50% in comparison to previous attempts.
Keywords :
biology computing; data mining; genetics; Affymetrix HGU95A-E probe sets; Gene Terra Incognita Discovery Endeavor; GeneTide; data mining; de novo GeneCards genes; expressed sequence tags-based gene index; expression data; putative human genes; Bioinformatics; Biological information theory; Biology; Data mining; Databases; Genetics; Genomics; Humans; Probes; Throughput;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
DOI :
10.1109/CSB.2004.1332466