• DocumentCode
    2341181
  • Title

    Automated identification of single nucleotide polymorphisms from sequencing data

  • Author

    Takahashi, Masazumi ; Matsuda, Fumihiko ; Margetic, Nino ; Lathrop, Mark

  • Author_Institution
    Centre Nat. de Genotypage, Evry, France
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    87
  • Lastpage
    93
  • Abstract
    Single nucleotide polymorphisms (SNPs) provide abundant information about genetic variation. Large scale discovery of high frequency SNPs is being undertaken using various methods. However, the publicly available SNP data are not always accurate, and therefore should be verified. If only a particular gene locus is concerned, locus-specific polymerase chain reaction amplification may be useful. Problem of this method is that the secondary peak has to be measured. We have analyzed trace data from conventional sequencing equipment and found an applicable rule to discern SNPs from noise. We have developed software that integrates this function to automatically identify SNPs. The software works accurately for high quality sequences and also can detect SNPs in low quality sequences. Further, it can determine allele frequency, display this information as a bar graph and assign corresponding nucleotide combinations. It is very useful for identifying de novo SNPs in a DNA fragment of interest.
  • Keywords
    DNA; biology computing; noise; pattern recognition; polymorphism; allele frequency determination; automated identification; bar graph; genetic variation; high-frequency SNP; large-scale discovery; locus-specific polymerase chain reaction amplification; noise; nucleotide combination assignment; sequencing data; single nucleotide polymorphisms; trace data; Bioinformatics; DNA; Data analysis; Diseases; Frequency estimation; Genetics; Large-scale systems; Polymers; Sequences; Software quality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics Conference, 2002. Proceedings. IEEE Computer Society
  • Print_ISBN
    0-7695-1653-X
  • Type

    conf

  • DOI
    10.1109/CSB.2002.1039332
  • Filename
    1039332