DocumentCode :
3542681
Title :
Beyond seed match: Improving miRNA target prediction using PAR-CLIP data
Author :
Lu, Mingzhu ; Chen, C. L Philip ; Huang, Yufei
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at San Antonio, San Antonio, TX, USA
fYear :
2011
fDate :
4-6 Dec. 2011
Firstpage :
127
Lastpage :
130
Abstract :
Since miRNA plays an important role in post-transcript regulation, many computational approaches have been proposed for miRNA target prediction. Yet, the existing algorithms lack the capability to predict the true target when the perfect seed match presents in mRNA sequences and methods based on seed-match still suffer from a high false positive rate. Therefore, this paper proposes a new prediction method that exploits the data produced by the PAR-CLIP, which is a recent high throughput, high precision technology for genome-wide miRNA targets. This algorithm searches true miRNA targets among the candidates with seed-matches by using machine learning approaches. The target prediction results on top 20 expressed miRNAs in HEK293 cells of AGO1-4 proteins PAR-CLIP data show that given presence of seed pairing, the proposed method greatly outperforms the traditional miRNA target prediction algorithms and improve the precision significantly. Because biologists usually need to mutate the seed region to validation the miRNA targets, and only capable of conducting biological experiments on limited miRNA and mRNA sequences due to the time and cost, the proposed approach will make significant impact on the biology and healthcare fields.
Keywords :
RNA; biology computing; data handling; genomics; learning (artificial intelligence); AGO1-4 proteins; HEK293 cells; PAR-CLIP data; mRNA sequences; machine learning; miRNA sequences; miRNA target prediction algorithms; post-transcript regulation; seed match; seed region; single-stranded ~22 nucleotides noncoding RNA; Bioinformatics; Context; Feature extraction; Genomics; Prediction algorithms; Training data; Gaussian Process; MicroRNA target prediction; PAR-CLIP; machine learning; seed matches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on
Conference_Location :
San Antonio, TX
ISSN :
2150-3001
Print_ISBN :
978-1-4673-0491-7
Electronic_ISBN :
2150-3001
Type :
conf
DOI :
10.1109/GENSiPS.2011.6169461
Filename :
6169461
Link To Document :
بازگشت