Title :
SVMotif: A Machine Learning Motif Algorithm
Author :
Kon, Mark ; Fan, Yue ; Holloway, Dustin ; DeLisi, Charles
Author_Institution :
Boston Univ., Boston
Abstract :
We describe SVMotif, a support vector machine-based learning algorithm for identification of cellular DNA transcription factor (TF) motifs extrapolated from known TF-gene interactions. An important aspect of this procedure is its ability to utilize negative target information (examples of likely non-targets) as well as positive information. Applications involve situations where clusters of genes are distinguished in experiments with known transcription factors without known binding locations. We apply this to yeast TF data with target identifications from ChlP-chip and other sources, and compare performance with Gibbs sampling methods such as BioProspector. We verify that in yeast this method implies well-defined and cross-validated statistical correlations between TF binding and secondary motifs whose binding properties (either with the primary TF or other possible promoters) are not certain, and discuss some implications of this. SVMotif can be a useful standalone method or a complement to existing techniques, and it will be made publicly available.
Keywords :
DNA; biology computing; cellular biophysics; correlation methods; learning (artificial intelligence); sampling methods; statistical analysis; support vector machines; BioProspector; ChlP-chip; Gibbs sampling methods; SVMotif; cellular DNA transcription factor motifs; machine learning motif algorithm; negative target information; statistical correlations; support vector machine; Bioinformatics; DNA; Fungi; Gene expression; Learning systems; Machine learning; Machine learning algorithms; Proteins; Regulators; Sampling methods;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.105