DocumentCode
945737
Title
A statistical model for microarrays, optimal estimation algorithms, and limits of performance
Author
Vikalo, Haris ; Hassibi, Babak ; Hassibi, Arjang
Author_Institution
Dept. of Electr. Eng., California Inst. of Technol., Pasadena, CA, USA
Volume
54
Issue
6
fYear
2006
fDate
6/1/2006 12:00:00 AM
Firstpage
2444
Lastpage
2455
Abstract
DNA microarray technology relies on the hybridization process, which is stochastic in nature. Currently, probabilistic cross hybridization of nonspecific targets, as well as the shot noise (Poisson noise) originating from specific targets binding, are among the main obstacles for achieving high accuracy in DNA microarray analysis. In this paper, statistical techniques are used to model the hybridization and cross-hybridization processes and, based on the model, optimal algorithms are employed to detect the targets and to estimate their quantities. To verify the theory, two sets of microarray experiments are conducted: one with oligonucleotide targets and the other with complementary DNA (cDNA) targets in the presence of biological background. Both experiments indicate that, by appropriately modeling the cross-hybridization interference, significant improvement in the accuracy over conventional methods such as direct readout can be obtained. This substantiates the fact that the accuracy of microarrays can become exclusively noise limited, rather than interference (i.e., cross-hybridization) limited. The techniques presented in this paper potentially increase considerably the signal-to-noise ratio (SNR), dynamic range, and resolution of DNA and protein microarrays as well as other affinity-based biosensors. A preliminary study of the Cramer-Rao bound for estimating the target concentrations suggests that, in some regimes, cross hybridization may even be beneficial-a result with potential ramifications for probe design, which is currently focused on minimizing cross hybridization. Finally, in its current form, the proposed method is best suited to low-density arrays arising in diagnostics, single nucleotide polymorphism (SNP) detection, toxicology, etc. How to scale it to high-density arrays (with many thousands of spots) is an interesting challenge.
Keywords
DNA; modelling; proteins; statistical analysis; Cramer-Rao bound; DNA microarrays; SNR; complementary DNA; cross-hybridization interference; cross-hybridization process; oligonucleotide targets; optimal estimation algorithms; protein microarrays; signal-to-noise ratio; statistical model; Biological system modeling; Biosensors; DNA; Dynamic range; Interference; Probes; Proteins; Signal resolution; Signal to noise ratio; Stochastic resonance; Cross hybridization; DNA microarrays; Poisson noise; maximum a posteriori; maximum likelihood; minimum-mean-square-error (MMSE) estimation; quantum-limited SNR; shot noise; statistical modeling;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/TSP.2006.873716
Filename
1634847
Link To Document