• DocumentCode
    2800646
  • Title

    Swift: Scalable weighted iterative sampling for flow cytometry clustering

  • Author

    Naim, Iftekhar ; Datta, Suprakash ; Sharma, Gaurav ; Cavenaugh, James S. ; Mosmann, Tim R.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    509
  • Lastpage
    512
  • Abstract
    Flow cytometry (FC) is a powerful technology for rapid multivariate analysis and functional discrimination of cells. Current FC platforms generate large, high-dimensional datasets which pose a significant challenge for traditional manual bivariate analysis. Automated multivariate clustering, though highly desirable, is also stymied by the critical requirement of identifying rare populations that form rather small clusters, in addition to the computational challenges posed by the large size and dimensionality of the datasets. In this paper, we address these twin challenges by developing a two-stage scalable multivariate parametric clustering algorithm. In the first stage, we model the data as a mixture of Gaussians and use an iterative weighted sampling technique to estimate the mixture components successively in order of decreasing size. In the second stage, we apply a graph-based hierarchical merging technique to combine Gaussian components with significant overlaps into the final number of desired clusters. The resulting algorithm offers a reduction in complexity over conventional mixture modeling while simultaneously allowing for better detection of small populations. We demonstrate the effectiveness of our method both on simulated data and actual flow cytometry datasets.
  • Keywords
    Gaussian processes; biological techniques; biology computing; cellular biophysics; graphs; iterative methods; merging; statistical analysis; Gaussian mixture model; cells; flow cytometry; graph-based hierarchical merging technique; scalable weighted iterative sampling; two-stage scalable multivariate parametric clustering algorithm; Biological system modeling; Cells (biology); Clustering algorithms; Computational biology; Gaussian processes; Immune system; Iterative algorithms; Merging; Power engineering and energy; Sampling methods; Flow cytometry; Gaussian mixture model; clustering; expectation-maximization; sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495653
  • Filename
    5495653