DocumentCode
3663120
Title
A stochastic model for genomic interspersed duplication
Author
Farzad Farnoud;Moshe Schwartz;Jehoshua Bruck
Author_Institution
Electrical Engineering, California Institute of Technology, Pasadena, 91125, USA
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
904
Lastpage
908
Abstract
Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem and interspersed duplication) have an important role in evolution, as they lead to genomic diversity, and thus to phenotypic variation. In this work, we study the expressive power of interspersed duplication, i.e., its ability to generate diversity, via a simple but fundamental stochastic model, where the length and the location of the substring that is duplicated and the point of insertion of the copy are chosen randomly. We investigate the properties of the set of high-probability sequences in these stochastic systems. In particular we provide results regarding the asymptotic behavior of frequencies of symbols and strings in a sequence evolving through interspersed duplication. The study of such systems is an important step towards the design and analysis of more realistic and sophisticated models of genomic mutation processes.
Keywords
"Genomics","Bioinformatics","Stochastic processes","Approximation methods","DNA","Electronic mail","Analytical models"
Publisher
ieee
Conference_Titel
Information Theory (ISIT), 2015 IEEE International Symposium on
Electronic_ISBN
2157-8117
Type
conf
DOI
10.1109/ISIT.2015.7282586
Filename
7282586
Link To Document