Title of article :
A top-down linguistic approach to the analysis of genomic sequences: The metabotropic glutamate receptors 1 and 5 in human and in mouse as a case study
Author/Authors :
Menconi، نويسنده , , Giulia and Puliti، نويسنده , , Aldamaria and Sbrana، نويسنده , , Isabella and Conti، نويسنده , , Valerio and Marangoni، نويسنده , , Roberto، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Abstract :
This paper presents a top-down strategy to detect features in genomic sequences. The strategyʹs core is to exploit dictionary-based compression algorithms and analyse the content of the automatically generated dictionary. We classify the different over-represented segments and in the case study we correlate them to experimentally identified or theoretically forecasted biological features. A large spectrum analysis reveals that the only feature co-located with the a priori extracted segments is the torsional flexibility of DNA, while non-B DNA configurations are anti-localized and other features are mostly independent of the extracted sequences. This analysis unravels complex relationships between the linguistic structures investigated under our approach and some known biological features.
Keywords :
Over-represented segments , DNA flexibility , Combinatorics on words
Journal title :
Journal of Theoretical Biology
Journal title :
Journal of Theoretical Biology