Title :
Selection of DNA Markers
Author :
Hoogeboom, Hendrik Jan ; Kosters, Walter A. ; Laros, Jeroen F J
Author_Institution :
Univ. Leiden, Leiden
Abstract :
Given a genome, i.e., a long string over a fixed finite alphabet, the problem is to find short (dis)similar substrings. This computationally intensive task has many biological applications. We first describe an algorithm to detect substrings that have edit distances to a fixed substring at most equal to a given. We then propose an algorithm that finds the set of all substrings that have edit distances larger than to all others. Several applications are given, where attention is paid to practical biological issues such as hairpins and GC percentage. An experiment shows the potential of the methods.
Keywords :
DNA; biology computing; molecular biophysics; DNA marker selection; computationally intensive task; edit distance; fixed substring; genome; substrings detection; Algorithm design and analysis; Assembly; Bioinformatics; Biology computing; DNA; Genomics; Helium; Internet; Neural networks; Sequences; DNA; DNA, edit distance, primers, substrings, uniqueness; edit distance; primers; substrings; uniqueness;
Journal_Title :
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
DOI :
10.1109/TSMCC.2007.906060