DocumentCode
1566583
Title
Integration of knowledge-discovery and artificial-intelligence approaches for promoter recognition in DNA sequences
Author
Huang, Yin-Fu ; Wang, Chia-Ming
Author_Institution
Graduate Sch. of Comput. Sci. & Inf. Eng., National Yunlin Univ. of Sci. & Technol., Taiwan
Volume
1
fYear
2005
Firstpage
459
Abstract
Bioinformatics nowadays is a very attractive field. Many fascinating biological problems were still unsolved, even after a great amount of diverse genomic sequences have been sequenced for the coming of post genome era. Currently available programs are far from powerful enough to recognize the regulatory signals completely. Researches have looked for various types of patterns around the transcription start site (TSS) and tried to translate those as classification rules; however, they were not always good solutions. In this paper, we proposed a new hybrid learning system to recognize the regulatory elements (i.e., promoter) in deoxyribonucleic acid (DNA) sequences. The proposed hybrid system calculated the distributions of oligo-nucleotides statistics as positional weight matrices which contribute to discriminate promoters from non-promoters. This study can help to locate the expressive regions of DNA, to foretell and to realize the properties, structures, and functions of the proteins that are synthesized starting from the coding region of DNA. The benchmark datasets were evaluated using the leave-one-out method. The experimental results demonstrate that the proposed system has higher accuracy than others.
Keywords
DNA; biology computing; data mining; genetics; learning (artificial intelligence); DNA sequences; artificial intelligence; bioinformatics; classification rules; deoxyribonucleic acid sequences; genomic sequences; hybrid learning system; knowledge discovery; leave-one-out method; oligo-nucleotides statistics; positional weight matrices; promoter recognition; transcription start site; Bioinformatics; Computer science; DNA; Genetics; Genomics; Knowledge engineering; Polymers; Proteins; RNA; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
Print_ISBN
0-7695-2316-1
Type
conf
DOI
10.1109/ICITA.2005.162
Filename
1488848
Link To Document