DocumentCode
2682292
Title
Enhancing gene detection with computer generated intergenic regions
Author
Caballero, Juan ; Glusman, Gustavo
Author_Institution
Inst. for Syst. Biol., Seattle, WA, USA
fYear
2009
fDate
17-21 May 2009
Firstpage
1
Lastpage
4
Abstract
Coding and non-coding gene prediction is still a challenge. Diverse computer-based tools have been created to screen sequences using elaborate strategies for gene prediction. Many of these implement various statistical tests to measure the plausibility of the prediction but until now, a comprehensive negative control did not exist. We developed an algorithm that generates sequences with characteristics of the intergenic regions of a genome, including nucleotide composition and typical inserted elements like interspersed repeats, low complexity sequences and pseudogenes. We also challenged some gene prediction programs to compare the artificial sequences with real intergenic regions.
Keywords
biology computing; genetics; genomics; organic compounds; statistical testing; coding gene prediction; computer-based tool; enhancing gene detection; gene prediction; genome intergenic region; noncoding gene prediction; nucleotide composition; pseudogenes; statistical test; Bioinformatics; Biology computing; Character generation; DNA; Genetic mutations; Genomics; Organisms; Sequences; System testing; Systems biology;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2009. GENSIPS 2009. IEEE International Workshop on
Conference_Location
Minneapolis, MN
Print_ISBN
978-1-4244-4761-9
Electronic_ISBN
978-1-4244-4762-6
Type
conf
DOI
10.1109/GENSIPS.2009.5174347
Filename
5174347
Link To Document