DocumentCode :
1899974
Title :
Transcription Factor Discovery using Support Vector Machines and Heterogeneous Data
Author :
Barbe, J.F. ; Tewfik, Ahmed H. ; Khodursky, Arkady B.
Author_Institution :
Univ. of Minnesota, Minneapolis
fYear :
2007
fDate :
10-12 June 2007
Firstpage :
1
Lastpage :
4
Abstract :
In this work we analyze the suitability of expression and sequence data for discovery of co-regulatory relationships using Support Vector Machines. In addition, we try to assess the possibility of improving such results by heterogeneous data fusion and by estimating a probability of a correct classification. As shown in other studies, we have found that transcription co-expression is a good estimator for genetic co-regulation. We also have found some evidence that operator site sequence motifs can be used to estimate co-regulation, but the kernels used for feature extraction did not achieve classification rates comparable to expression data. Finally, the additional information provided by combining sequence and expression data can be exploited to estimate the probability of correct classification.
Keywords :
probability; support vector machines; coregulatory relationships; heterogeneous data; operator site sequence; probability estimation; sequence data; support vector machines; transcription factor discovery; DNA; Feature extraction; Hidden Markov models; Kernel; Polymers; Probability; RNA; Sequences; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics, 2007. GENSIPS 2007. IEEE International Workshop on
Conference_Location :
Tuusula
Print_ISBN :
978-1-4244-0998-3
Electronic_ISBN :
978-1-4244-0999-0
Type :
conf
DOI :
10.1109/GENSIPS.2007.4365812
Filename :
4365812
Link To Document :
بازگشت