DocumentCode :
1550365
Title :
Image Analysis and Length Estimation of Biomolecules Using AFM
Author :
Sundstrom, A. ; Cirrone, S. ; Paxia, S. ; Hsueh, Chien-Hsin ; Kjolby, R. ; Gimzewski, J.K. ; Reed, Jeff ; Mishra, Bud
Author_Institution :
Courant Inst. of Math. Sci., New York Univ., New York, NY, USA
Volume :
16
Issue :
6
fYear :
2012
Firstpage :
1200
Lastpage :
1207
Abstract :
There are many examples of problems in pattern analysis for which it is often possible to obtain systematic characterizations, if in addition a small number of useful features or parameters of the image are known a priori or can be estimated reasonably well. Often, the relevant features of a particular pattern analysis problem are easy to enumerate, as when statistical structures of the patterns are well understood from the knowledge of the domain. We study a problem from molecular image analysis, where such a domain-dependent understanding may be lacking to some degree and the features must be inferred via machine-learning techniques. In this paper, we propose a rigorous, fully automated technique for this problem. We are motivated by an application of atomic force microscopy (AFM) image processing needed to solve a central problem in molecular biology, aimed at obtaining the complete transcription profile of a single cell, a snapshot that shows which genes are being expressed and to what degree. Reed et al. (“Single molecule transcription profiling with AFM,” Nanotechnology, vol. 18, no. 4, 2007) showed that the transcription profiling problem reduces to making high-precision measurements of biomolecule backbone lengths, correct to within 20-25 bp (6-7.5 nm). Here, we present an image processing and length estimation pipeline using AFM that comes close to achieving these measurement tolerances. In particular, we develop a biased length estimator on trained coefficients of a simple linear regression model, biweighted by a Beaton-Tukey function, whose feature universe is constrained by James-Stein shrinkage to avoid overfitting. In terms of extensibility and addressing the model selection problem, this formulation subsumes the models we studied.
Keywords :
DNA; atomic force microscopy; biology computing; cellular biophysics; image processing; learning (artificial intelligence); molecular biophysics; molecular configurations; AFM; Beaton-Tukey function; James-Stein shrinkage; atomic force microscopy; biased length estimator; biomolecule backbone length; biomolecule image analysis; biomolecule length estimation; domain-dependent understanding; fully automated technique; gene; high-precision measurement; image processing; length estimation pipeline; linear regression model; machine-learning techniques; model selection problem; molecular biology; molecular image analysis; single cell transcription profile; Atomic force microscopy; DNA; Image processing; Length measurement; Linear regression; Machine learning; RNA; Supervised learning; Atomic force microscopy (AFM); Beaton–Tukey; DNA; RNA; biased estimation; biomolecule; biweight; cDNA; digital contour; image processing; length estimation; linear regression; machine learning; single molecule; supervised learning; Artificial Intelligence; DNA; Image Processing, Computer-Assisted; Linear Models; Microscopy, Atomic Force; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2012.2206819
Filename :
6228535
Link To Document :
بازگشت