DocumentCode :
3319975
Title :
An Efficient Mining Algorithm to Predict Transcription Factor DNA Binding Preferences
Author :
Zhou, Kun ; Bu, Daocheng ; Xiong, Yun
Author_Institution :
Res. Center for Data Sci., Fudan Univ., Shanghai, China
fYear :
2011
fDate :
10-12 May 2011
Firstpage :
1
Lastpage :
5
Abstract :
Transcription factor (TF) DNA binding preferences provide vast amounts of information about essential processes inside transcription regulatory mechanisms. Therefore, identifying DNA binding preferences of transcription factors is of prime importance. Several computational approaches have been proposed in previous works to develop a quick solution against the more expensive experiments. However, these computational approaches limit themselves to using existing biological information only while ignore the relationship between these properties. In this paper we take into account the weight and correlations of these biological properties and propose a novel computational approach, PWC (i.e., Predict TF DNA Binding preferences based on the Weight and Correlations of biological properties). By utilizing tf-idf method (term frequency inverse document frequency) to compute feature weight and GVSM (Generalized Vector Space Model) to compute feature correlations, PWC can provide a powerful approach to infer the TF DNA binding preferences. Our performance study on real data shows that PWC is better than other previous algorithms.
Keywords :
DNA; bioinformatics; data mining; molecular biophysics; proteins; GVSM; PWC; TF DNA binding preference; biological information; feature correlation; generalized vector space model; mining algorithm; tf-idf method; transcription factor DNA binding preference; transcription regulatory mechanism; Biological system modeling; Classification algorithms; Computational modeling; Correlation; DNA; Proteins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering, (iCBBE) 2011 5th International Conference on
Conference_Location :
Wuhan
ISSN :
2151-7614
Print_ISBN :
978-1-4244-5088-6
Type :
conf
DOI :
10.1109/icbbe.2011.5780163
Filename :
5780163
Link To Document :
بازگشت