Title of article :
Clustering of Protein Domains in the Human Genome
Author/Authors :
Lianne R Mayor، نويسنده , , Keiran P Fleming، نويسنده , , Arne Müller، نويسنده , , David J. Balding، نويسنده , , Michael J.E. Sternberg، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2004
Pages :
14
From page :
991
To page :
1004
Abstract :
We present a systematic study of the clustering of genes within the human genome based on homology inferred from both sequence and structural similarity. The 3D-Genomics automated proteome annotation pipeline () was utilised to infer homology for each protein domain in the genome, for the 26 superfamilies most highly represented in the Structural Classification Of Proteins (SCOP) database. This approach enabled us to identify homologues that could not be detected by sequence-based methods alone. For each superfamily, we investigated the distribution, both within and among chromosomes, of genes encoding at least one domain within the superfamily. The results indicate a diversity of clustering behaviours: some superfamilies showed no evidence of any clustering, and others displayed significant clustering either within or among chromosomes, or both. Removal of tandem repeats reduced the levels of clustering observed, but some superfamilies still displayed highly significant clustering. Thus, our study suggests that either the process of gene duplication, or the evolution of the resulting clusters, differs between structural superfamilies.
Keywords :
Tandem Repeats , protein domains , Bioinformatics , genome evolution , Gene clustering
Journal title :
Journal of Molecular Biology
Serial Year :
2004
Journal title :
Journal of Molecular Biology
Record number :
1243773
Link To Document :
بازگشت