DocumentCode :
2800646
Title :
Swift: Scalable weighted iterative sampling for flow cytometry clustering
Author :
Naim, Iftekhar ; Datta, Suprakash ; Sharma, Gaurav ; Cavenaugh, James S. ; Mosmann, Tim R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
509
Lastpage :
512
Abstract :
Flow cytometry (FC) is a powerful technology for rapid multivariate analysis and functional discrimination of cells. Current FC platforms generate large, high-dimensional datasets which pose a significant challenge for traditional manual bivariate analysis. Automated multivariate clustering, though highly desirable, is also stymied by the critical requirement of identifying rare populations that form rather small clusters, in addition to the computational challenges posed by the large size and dimensionality of the datasets. In this paper, we address these twin challenges by developing a two-stage scalable multivariate parametric clustering algorithm. In the first stage, we model the data as a mixture of Gaussians and use an iterative weighted sampling technique to estimate the mixture components successively in order of decreasing size. In the second stage, we apply a graph-based hierarchical merging technique to combine Gaussian components with significant overlaps into the final number of desired clusters. The resulting algorithm offers a reduction in complexity over conventional mixture modeling while simultaneously allowing for better detection of small populations. We demonstrate the effectiveness of our method both on simulated data and actual flow cytometry datasets.
Keywords :
Gaussian processes; biological techniques; biology computing; cellular biophysics; graphs; iterative methods; merging; statistical analysis; Gaussian mixture model; cells; flow cytometry; graph-based hierarchical merging technique; scalable weighted iterative sampling; two-stage scalable multivariate parametric clustering algorithm; Biological system modeling; Cells (biology); Clustering algorithms; Computational biology; Gaussian processes; Immune system; Iterative algorithms; Merging; Power engineering and energy; Sampling methods; Flow cytometry; Gaussian mixture model; clustering; expectation-maximization; sampling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495653
Filename :
5495653
Link To Document :
بازگشت