DocumentCode
3408286
Title
Shannon information in complete genomes
Author
Chang, Chang-Heng ; Hsieh, Li-Ching ; Chen, Ta-Yuan ; Chen, Hong-Da ; Luo, Liaofu ; Lee, Hoong-Chien
Author_Institution
Nat. Central Univ., Chung-li, Taiwan
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
20
Lastpage
30
Abstract
Shannon information in the genomes of all completely sequenced prokaryotes and eukaryotes are measured in word lengths of two to ten letters. It is found that in a scale-dependent way, the Shannon information in complete genomes are much greater than that in matching random sequences - thousands of times greater in the case of short words. Furthermore, with the exception of the 14 chromosomes of Plasmodium falciparum, the Shannon information in all available complete genomes belong to a universality class given by an extremely simple formula. The data are consistent with a model for genome growth composed of two main ingredients: random segmental duplications that increase the Shannon information in a scale-independent way, and random point mutations that preferentially reduces the larger-scale Shannon information. The inference drawn from the present study is that the large-scale and coarse-grained growth of genomes was selectively neutral and this suggests an independent corroboration of Kimura´s neutral theory of evolution.
Keywords
biology computing; genetics; microorganisms; molecular biophysics; Kimura neutral evolution theory; Plasmodium falciparum; Shannon information; chromosomes; complete genomes; completely sequenced prokaryotes; eukaryotes; random point mutations; random segmental duplications; Bioinformatics; Biological cells; Electronic mail; Entropy; Frequency; Genetic mutations; Genomics; Length measurement; Physics; Random sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332413
Filename
1332413
Link To Document