DocumentCode
2341181
Title
Automated identification of single nucleotide polymorphisms from sequencing data
Author
Takahashi, Masazumi ; Matsuda, Fumihiko ; Margetic, Nino ; Lathrop, Mark
Author_Institution
Centre Nat. de Genotypage, Evry, France
fYear
2002
fDate
2002
Firstpage
87
Lastpage
93
Abstract
Single nucleotide polymorphisms (SNPs) provide abundant information about genetic variation. Large scale discovery of high frequency SNPs is being undertaken using various methods. However, the publicly available SNP data are not always accurate, and therefore should be verified. If only a particular gene locus is concerned, locus-specific polymerase chain reaction amplification may be useful. Problem of this method is that the secondary peak has to be measured. We have analyzed trace data from conventional sequencing equipment and found an applicable rule to discern SNPs from noise. We have developed software that integrates this function to automatically identify SNPs. The software works accurately for high quality sequences and also can detect SNPs in low quality sequences. Further, it can determine allele frequency, display this information as a bar graph and assign corresponding nucleotide combinations. It is very useful for identifying de novo SNPs in a DNA fragment of interest.
Keywords
DNA; biology computing; noise; pattern recognition; polymorphism; allele frequency determination; automated identification; bar graph; genetic variation; high-frequency SNP; large-scale discovery; locus-specific polymerase chain reaction amplification; noise; nucleotide combination assignment; sequencing data; single nucleotide polymorphisms; trace data; Bioinformatics; DNA; Data analysis; Diseases; Frequency estimation; Genetics; Large-scale systems; Polymers; Sequences; Software quality;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics Conference, 2002. Proceedings. IEEE Computer Society
Print_ISBN
0-7695-1653-X
Type
conf
DOI
10.1109/CSB.2002.1039332
Filename
1039332
Link To Document