• DocumentCode
    945737
  • Title

    A statistical model for microarrays, optimal estimation algorithms, and limits of performance

  • Author

    Vikalo, Haris ; Hassibi, Babak ; Hassibi, Arjang

  • Author_Institution
    Dept. of Electr. Eng., California Inst. of Technol., Pasadena, CA, USA
  • Volume
    54
  • Issue
    6
  • fYear
    2006
  • fDate
    6/1/2006 12:00:00 AM
  • Firstpage
    2444
  • Lastpage
    2455
  • Abstract
    DNA microarray technology relies on the hybridization process, which is stochastic in nature. Currently, probabilistic cross hybridization of nonspecific targets, as well as the shot noise (Poisson noise) originating from specific targets binding, are among the main obstacles for achieving high accuracy in DNA microarray analysis. In this paper, statistical techniques are used to model the hybridization and cross-hybridization processes and, based on the model, optimal algorithms are employed to detect the targets and to estimate their quantities. To verify the theory, two sets of microarray experiments are conducted: one with oligonucleotide targets and the other with complementary DNA (cDNA) targets in the presence of biological background. Both experiments indicate that, by appropriately modeling the cross-hybridization interference, significant improvement in the accuracy over conventional methods such as direct readout can be obtained. This substantiates the fact that the accuracy of microarrays can become exclusively noise limited, rather than interference (i.e., cross-hybridization) limited. The techniques presented in this paper potentially increase considerably the signal-to-noise ratio (SNR), dynamic range, and resolution of DNA and protein microarrays as well as other affinity-based biosensors. A preliminary study of the Cramer-Rao bound for estimating the target concentrations suggests that, in some regimes, cross hybridization may even be beneficial-a result with potential ramifications for probe design, which is currently focused on minimizing cross hybridization. Finally, in its current form, the proposed method is best suited to low-density arrays arising in diagnostics, single nucleotide polymorphism (SNP) detection, toxicology, etc. How to scale it to high-density arrays (with many thousands of spots) is an interesting challenge.
  • Keywords
    DNA; modelling; proteins; statistical analysis; Cramer-Rao bound; DNA microarrays; SNR; complementary DNA; cross-hybridization interference; cross-hybridization process; oligonucleotide targets; optimal estimation algorithms; protein microarrays; signal-to-noise ratio; statistical model; Biological system modeling; Biosensors; DNA; Dynamic range; Interference; Probes; Proteins; Signal resolution; Signal to noise ratio; Stochastic resonance; Cross hybridization; DNA microarrays; Poisson noise; maximum a posteriori; maximum likelihood; minimum-mean-square-error (MMSE) estimation; quantum-limited SNR; shot noise; statistical modeling;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2006.873716
  • Filename
    1634847