Title :
Identification of transcription factor binding sites based on the Chi-Square (x2) distance of a probabilistic vector model
Author :
Huang, Lun ; Al Bataineh, Mohammad ; Atkin, G.E. ; Mohammed, Ismaeel ; Zhang, Wei ; Parra, Maria ; Del Mar Perez, Maria
Author_Institution :
ECE Dept., Illinois Inst. of Technol., Chicago, IL, USA
Abstract :
This paper describes a new approach for locating signals, such as promoter sequences, in nucleic acid sequences. Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position weight matrix (PWM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. In this paper, we present a Chi-square ( x2 ) distance model, which is based on the distance between the profiles of component vectors. It is a novel probabilistic method for modeling TF-DNA interactions. Our approach uses x2 distances to represent TF binding specificities. Simulation results show that the proposed approach identifies TF binding sites significantly better than the PWM model method.
Keywords :
DNA; biology computing; statistical distributions; Chi-Square distance; DNA target site; nucleic acid sequences; position weight matrix; probabilistic vector model; signal location; transcription factor binding; Chi-square distance; Transcription Factor; promoter;
Conference_Titel :
BioMedical Information Engineering, 2009. FBIE 2009. International Conference on Future
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-4690-2
Electronic_ISBN :
978-1-4244-4692-6
DOI :
10.1109/FBIE.2009.5405793