• DocumentCode
    2454544
  • Title

    Using Randomised Vectors in Transcription Factor Binding Site Predictions

  • Author

    Rezwan, Faisal ; Sun, Yi ; Davey, Neil ; Adams, Rod ; Rust, Alistair G. ; Robinson, Mark

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Hertfordshire, Hatfield, UK
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    523
  • Lastpage
    527
  • Abstract
    Finding the location of binding sites in DNA is a difficult problem. Although the location of some binding sites have been experimentally identified, other parts of the genome may or may not contain binding sites. This poses problems with negative data in a trainable classifier. Here we show that using randomized negative data gives a large boost in classifier performance when compared to the original labeled data.
  • Keywords
    DNA; Gaussian processes; biology computing; data analysis; genomics; pattern classification; support vector machines; DNA binding site location; Gaussian kernel; classifier performance; genome; randomised vector; randomized negative data; support vector machine; transcription factor binding site prediction; Bioinformatics; Genomics; Mice; Prediction algorithms; Support vector machines; Training; Binding Site; Classification; Genes; Support Vector Machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.82
  • Filename
    5708880