DocumentCode :
3264509
Title :
Selected String Representation for Whole Genomes
Author :
Wu, Xiaomeng ; Lin, Guohui
Author_Institution :
Department of Computing Science, University of Alberta Edmonton, Alberta T6G 2E8, Canada
fYear :
2005
fDate :
14-15 Nov. 2005
Firstpage :
1
Lastpage :
8
Abstract :
The increase in the amount of available genomic data has made phylogenetic analysis possible at the whole genome scale. However, such a huge amount of data imposes computational challenges in both memory consumption and CPU usage. One novel proposal in this paper is to extract sequence patterns that are biologically meaningful. Using these patterns, whole genomes can be mapped into a significantly lower dimensional space and subsequent studies using these representations become computationally feasible. Experiments on two datasets of 64 vertebrate mitochondrial genomes and 99 prokaryote whole genomes demonstrate that the selected sequence patterns result in good quality evolutionary distances in terms of the final phylogeny.
Keywords :
Amino Acid Composition; Phylogeny; String Composition; String Selection; Amino acids; Bioinformatics; Biology computing; Computational complexity; Data mining; Frequency; Genomics; Peptides; Phylogeny; Proposals; Amino Acid Composition; Phylogeny; String Composition; String Selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN :
0-7803-9387-2
Type :
conf
DOI :
10.1109/CIBCB.2005.1594905
Filename :
1594905
Link To Document :
بازگشت