Title :
Integration of Clustering and Multidimensional Scaling to Determine Phylogenetic Trees as Spherical Phylograms Visualized in 3 Dimensions
Author :
Yang Ruan ; House, Geoffrey L. ; Ekanayake, S. ; Schutte, Ursel ; Bever, James D. ; Haixu Tang ; Fox, G.
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
Abstract :
Phylogenetic analysis is commonly used to analyze genetic sequence data from fungal communities, while ordination and clustering techniques commonly are used to analyze sequence data from bacterial communities. However, few studies have attempted to link these two independent approaches. In this paper, we propose a method, which we call spherical phylogram (SP), to display the phylogenetic tree within the clustering and visualization result from a pipeline called DACIDR. In comparison with traditional tree display methods, the correlations between the tree and the clustering can be observed directly. In addition, we propose an algorithm called interpolative joining (IJ) to construct and visualize the SP in 3D space. In the experiments, we used the sum of branch lengths to quantify the general fit between the clustering and the phylogenetic tree in SP and Mantel tests to determine how well the same grouping of sequences was preserved between the clustering and the SP. Our results show that DACIDR has a classification accuracy that is similar to a phylogenetic tree generated using a multiple sequence alignment, while having much lower computational cost.
Keywords :
biology computing; data visualisation; evolution (biological); genetics; interpolation; microorganisms; pattern classification; pattern clustering; 3D space; DACIDR; IJ; Mantel tests; SP; bacterial communities; branch lengths; classification accuracy; clustering techniques; fungal communities; genetic sequence data; interpolative joining; multidimensional scaling; ordination; phylogenetic analysis; phylogenetic tree; sequence alignment; sequence data analysis; spherical phylogram; tree display methods; visualization; Clustering algorithms; Equations; Interpolation; Mathematical model; Phylogeny; Three-dimensional displays; Visualization; Environmental Genomics; Microbial Communities; Multidimensional Scaling; Phylogenetic Tree;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
DOI :
10.1109/CCGrid.2014.126