DocumentCode :
3602349
Title :
Inferring Sequential Order of Somatic Mutations during Tumorgenesis based on Markov Chain Model
Author :
Hao Kang ; Kwang-Hyun Cho ; Zhang, Xiaohua Douglas ; Tao Zeng ; Luonan Chen
Author_Institution :
Sch. of Life Sci. & Technol., Shanghai Tech Univ., Shanghai, China
Volume :
12
Issue :
5
fYear :
2015
Firstpage :
1094
Lastpage :
1103
Abstract :
Tumors are developed and worsen with the accumulated mutations on DNA sequences during tumorigenesis. Identifying the temporal order of gene mutations in cancer initiation and development is a challenging topic. It not only provides a new insight into the study of tumorigenesis at the level of genome sequences but also is an effective tool for early diagnosis of tumors and preventive medicine. In this paper, we develop a novel method to accurately estimate the sequential order of gene mutations during tumorigenesis from genome sequencing data based on Markov chain model as TOMC (Temporal Order based on Markov Chain), and also provide a new criterion to further infer the order of samples or patients, which can characterize the severity or stage of the disease. We applied our method to the analysis of tumors based on several high-throughput datasets. Specifically, first, we revealed that tumor suppressor genes (TSG) tend to be mutated ahead of oncogenes, which are considered as important events for key functional loss and gain during tumorigenesis. Second, the comparisons of various methods demonstrated that our approach has clear advantages over the existing methods due to the consideration on the effect of mutation dependence among genes, such as co-mutation. Third and most important, our method is able to deduce the ordinal sequence of patients or samples to quantitatively characterize their severity of tumors. Therefore, our work provides a new way to quantitatively understand the development and progression of tumorigenesis based on high throughput sequencing data.
Keywords :
DNA; Markov processes; bioinformatics; cancer; genetics; genomics; inference mechanisms; patient diagnosis; tumours; DNA sequences; Markov chain model; TOMC; TSG; Temporal Order based on Markov Chain; cancer; co-mutation; disease severity; disease stage; early tumor diagnosis; functional gain; functional loss; gene mutations; genome sequences; genome sequencing data; high throughput sequencing data; mutation dependence; oncogenes; preventive medicine; sequential gene mutation order; sequential order; somatic mutations; tumor suppressor genes; tumorgenesis; tumorigenesis; Accuracy; Bioinformatics; Cancer; Diseases; Lungs; Markov processes; Tumors; First hitting time; Markov chain; Mutation order; first hitting time; markov chain;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2015.2424408
Filename :
7110359
Link To Document :
بازگشت