Title of article :
Analysis on the Distribution of Bases in 1487 Human Protein Coding Sequences
Author/Authors :
Zhang، نويسنده , , Chun-Ting and Zhan، نويسنده , , Yong، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 1994
Pages :
5
From page :
161
To page :
165
Abstract :
The occurrence frequencies of bases A, C, G and T, denoted by a, c, g and t, respectively, in 1487 human protein coding sequences have been calculated and analyzed. The analysis has been performed by a diagrammatic method presented recently, in which each coding sequence is represented by a point in 3-D space. The distribution of points gives the observer an overall and intuitive picture of the base frequencies. The distance between a point and the origin of the co-ordinate, which corresponds to the case of a = c = g = t = 1/4, is called the radical distance. The radical distribution of 1487 points in 3-D space has been found to be normal, with the center basically coinciding with the origin of the co-ordinate. We have found that among 1487 coding sequences, an empirical rule a2 + c2 + g2 + t2 < 1/3 holds for 1486 sequences. The only sequence in which the above rule does not hold is the one coding for the human parathymosin protein. The composition of amino acids and the structural class of this protein has been studied in some detail.
Journal title :
Journal of Theoretical Biology
Serial Year :
1994
Journal title :
Journal of Theoretical Biology
Record number :
1532333
Link To Document :
بازگشت