Title :
Sequencing-by-hybridization revisited: the analog-spectrum proposal
Author :
Preparata, Franco P.
Author_Institution :
Dept. of Comput. Sci., Brown Univ., Providence, RI, USA
Abstract :
All published approaches to DNA sequencing by hybridization (SBH) consist of the biochemical acquisition of the spectrum of a target sequence (the set of its subsequences conforming to a given probing pattern) followed by the algorithmic reconstruction of the sequence from its spectrum. In the "standard" or "uniform" approach, the probing pattern is a string of length L and the length of reliably reconstructible sequences is known to be mlen = O(2L). For a fixed microarray area, higher sequencing performance can be achieved by inserting nonprobing gaps ("wild-cards") in the probing pattern. The reconstruction, however, must cope with the emergence of fooling probes due to the gaps and algorithmic failure occurs when the spectrum becomes too densely populated, although we can achieve mcomp = O(4L). Despite the combinatorial success of gapped probing, all current approaches are based on a biochemically unrealistic spectrum-acquisition model (digital-spectrum). The reality of hybridization is much more complex. Departing from the conventional model, in this paper, we propose an alternative, called the analog-spectrum model, which more closely reflects the biochemical process. This novel modeling reestablishes probe length as the performance-governing factor, adopting "semidegenerate bases" as suitable emulators of currently inadequate universal bases. One important conclusion is that accurate biochemical measurements are pivotal to the success of SBH. The theoretical proposal presented in this paper should be a convincing stimulus for the needed biotechnological work.
Keywords :
DNA; biochemistry; biological techniques; molecular biophysics; physiological models; DNA sequencing; analog-spectrum model; biochemical acquisition; biochemically unrealistic spectrum-acquisition model; fooling probes; gapped probing; hybridization; probe length; probing pattern; DNA; Labeling; Laboratories; Libraries; Optical wavelength conversion; Probes; Proposals; Sequences; Thermodynamics; Visualization; Index Terms- DNA sequencing; analog spectrum; gapped probes; microarrays; semidegenerate bases.; sequencing-by-hybridization; thermodynamics of hybridization; Algorithms; Computational Biology; DNA; Nucleic Acid Hybridization; Oligonucleotide Array Sequence Analysis; Sequence Analysis, DNA; Thermodynamics; Transition Temperature;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2004.12