Title :
Enhancer prediction using distance aware kernels
Author :
Hoang Van Thanh ; Tu Minh Phuong
Author_Institution :
Dept. of Comput. Sci., Posts & Telecommun., Inst. of Technol., Hanoi, Vietnam
Abstract :
The regulation of gene expression is important for the development of living cells and their responses to environmental conditions. This mechanism is controlled, to a large extend, by transcription factors that bind to regulatory sequences, such as enhancers. The identification of enhancers is therefore important for understanding the regulatory networks within cells. In this paper, we propose new features and kernels that can be used with support vector machine (SVM) classifiers to predict enhancers from genomic sequences. These are based on general sequence features and kernels but are extended to incorporate the information about the distance between the features, thus can better capture the spatial preferences and combinatorial binding rules of transcription factors. Experiments on predicting enhancers in human and Caenorhabditis elegans show that, by combining the proposed features and kernels with SVM, our method achieves state-of-the-art accuracy and outperforms a leading enhancer prediction method.
Keywords :
biology computing; combinatorial mathematics; genomics; pattern classification; prediction theory; support vector machines; Caenorhabditis elegans; SVM classifiers; combinatorial binding rules; distance aware kernels; enhancer prediction method; environmental conditions; gene expression regulation; genomic sequences; living cells development; regulatory networks; regulatory sequences; sequence features; sequence kernels; support vector machine; transcription factors; Bioinformatics; DNA; Entropy; Genomics; Kernel; Support vector machines; Vectors; DNA sequence feature; Enhancer prediction; kernel function; support vector machines;
Conference_Titel :
Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4799-1349-7
DOI :
10.1109/RIVF.2013.6719867