DocumentCode :
1650326
Title :
Prediction of microRNA Hairpins using One-Class Support Vector Machines
Author :
Tran, Dang Hung ; Pham, Tho Hoan ; Satou, Kenji ; Ho, Tu Bao
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Kanazawa
fYear :
2008
Firstpage :
33
Lastpage :
36
Abstract :
MicroRNAs(miRNAs) are small molecular non- coding RNAs that have important roles in the post-transcriptional mechanism of animals and plants. They are commonly 21-25 nucleotides (nt) long and derived from 60-90 nt RNA hairpin structures, called miRNA hairpins. A larger amount of sequence segments in the human genome have been computationally identified with such a 60-90 nt hairpin, however a majority of them are not miRNA hairpins. Most computational methods so far for predicting miRNA hairpins were based on a two-class classifier to distinguish between miRNA hairpins and the sequence segments with a hairpin structures. The difficulty of these methods is how to select hairpins as the negative examples of miRNA hairpins in the classifier-training datasets since only few miRNA hairpins are available. Therefore, their classifier may be mis-trained due to some false negative examples of the training dataset. In this paper, we introduce a one-class support vector machine (SVM) method to predict miRNA hairpins from the hairpin structures. Different from existing methods for predicting miRNA hairpins, the one-class SVM model is trained only on the information of the miRNA class. We also illustrated some examples of predicting miRNA hairpins in human chromosomes 10, 15, and 21 where our method overcomes the above disadvantage of existing two- class methods.
Keywords :
biology computing; cellular biophysics; macromolecules; molecular biophysics; molecular configurations; support vector machines; animals; classifier-training datasets; human chromosomes; human genome; microRNA hairpin structures; molecular noncoding RNAs; nucleotides; one-class support vector machines; plants; sequence segments; Bioinformatics; Biological cells; Educational technology; Genomics; Humans; Predictive models; RNA; Sequences; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1747-6
Electronic_ISBN :
978-1-4244-1748-3
Type :
conf
DOI :
10.1109/ICBBE.2008.15
Filename :
4534895
Link To Document :
بازگشت