Title :
Thermodynamic Post-Processing versus GC-Content Pre-Processing for DNA Codes Satisfying the Hamming Distance and Reverse-Complement Constraints
Author :
Tulpan, Dan ; Smith, Derek H. ; Montemanni, Roberto
Author_Institution :
Nat. Res. Council Canada, Inf. & Commun. Technol., Moncton, NB, Canada
Abstract :
Stochastic, meta-heuristic and linear construction algorithms for the design of DNA strands satisfying Hamming distance and reverse-complement constraints often use a GC-content constraint to pre-process the DNA strands. Since GC-content is a poor predictor of DNA strand hybridization strength the strands can be filtered by post-processing using thermodynamic calculations. An alternative approach is considered here, where the algorithms are modified to remove consideration of GC-content and rely on post-processing alone to obtain large sets of DNA strands with satisfactory melting temperatures. The two approaches (pre-processing GC-content and post-processing melting temperatures) are compared and are shown to be complementary when large DNA sets are desired. In particular, the second approach can give significant improvements when linear constructions are used.
Keywords :
DNA; Hamming codes; biology computing; melting; molecular biophysics; molecular configurations; stochastic processes; thermodynamics; DNA codes; DNA strand hybridization strength; GC-content preprocessing; Hamming distance; large DNA sets; linear construction algorithms; linear constructions; meta-heuristic algorithms; post-processing melting temperatures; reverse-complement constraints; stochastic algorithms; thermodynamic calculations; thermodynamic post-processing; Algorithm design and analysis; Bioinformatics; Computational biology; DNA; Hamming distance; Thermodynamics; DNA design; Hamming distance; linear codes; reverse-complement; stochastic local search;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2014.2299815