• DocumentCode
    2682292
  • Title

    Enhancing gene detection with computer generated intergenic regions

  • Author

    Caballero, Juan ; Glusman, Gustavo

  • Author_Institution
    Inst. for Syst. Biol., Seattle, WA, USA
  • fYear
    2009
  • fDate
    17-21 May 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Coding and non-coding gene prediction is still a challenge. Diverse computer-based tools have been created to screen sequences using elaborate strategies for gene prediction. Many of these implement various statistical tests to measure the plausibility of the prediction but until now, a comprehensive negative control did not exist. We developed an algorithm that generates sequences with characteristics of the intergenic regions of a genome, including nucleotide composition and typical inserted elements like interspersed repeats, low complexity sequences and pseudogenes. We also challenged some gene prediction programs to compare the artificial sequences with real intergenic regions.
  • Keywords
    biology computing; genetics; genomics; organic compounds; statistical testing; coding gene prediction; computer-based tool; enhancing gene detection; gene prediction; genome intergenic region; noncoding gene prediction; nucleotide composition; pseudogenes; statistical test; Bioinformatics; Biology computing; Character generation; DNA; Genetic mutations; Genomics; Organisms; Sequences; System testing; Systems biology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2009. GENSIPS 2009. IEEE International Workshop on
  • Conference_Location
    Minneapolis, MN
  • Print_ISBN
    978-1-4244-4761-9
  • Electronic_ISBN
    978-1-4244-4762-6
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2009.5174347
  • Filename
    5174347