DocumentCode :
2454544
Title :
Using Randomised Vectors in Transcription Factor Binding Site Predictions
Author :
Rezwan, Faisal ; Sun, Yi ; Davey, Neil ; Adams, Rod ; Rust, Alistair G. ; Robinson, Mark
Author_Institution :
Sch. of Comput. Sci., Univ. of Hertfordshire, Hatfield, UK
fYear :
2010
fDate :
12-14 Dec. 2010
Firstpage :
523
Lastpage :
527
Abstract :
Finding the location of binding sites in DNA is a difficult problem. Although the location of some binding sites have been experimentally identified, other parts of the genome may or may not contain binding sites. This poses problems with negative data in a trainable classifier. Here we show that using randomized negative data gives a large boost in classifier performance when compared to the original labeled data.
Keywords :
DNA; Gaussian processes; biology computing; data analysis; genomics; pattern classification; support vector machines; DNA binding site location; Gaussian kernel; classifier performance; genome; randomised vector; randomized negative data; support vector machine; transcription factor binding site prediction; Bioinformatics; Genomics; Mice; Prediction algorithms; Support vector machines; Training; Binding Site; Classification; Genes; Support Vector Machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-9211-4
Type :
conf
DOI :
10.1109/ICMLA.2010.82
Filename :
5708880
Link To Document :
بازگشت