Title :
Promoter prediction using DNA numerical representation and neural network: Case study with three organisms
Author :
Arniker, Swarna Bai ; Kwan, Hon Keung ; Law, Ngai-Fong ; Lun, Daniel Pak-Kong
Author_Institution :
Directorate of Laser Syst., Res. Centre Imarat, Hyderabad, India
Abstract :
Promoter recognition in various organisms is an area of interest in bioinformatics. In this paper, a feed-forward neural network classifier is presented to predict promoters in three organisms using a DNA numerical representation approach. The proposed system was found to be able to predict promoters with a sensitivity of 87%, 87%, 99% while reducing false prediction rate for non-promoter sequences with a specificity of 92%, 94%, 99% for the human, Drosophila melanogaster, and Arabidopsis thaliana sequences respectively. The results show that feed-forward neural networks can extract the statistical characteristics of promoters efficiently, and that the 2-bit binary coding for DNA data is suitable for the Berkeley Human and Drosophila datasets and the 4-bit binary is suitable for the TAIR Arabidopsis thaliana data sets. Another result demonstrated here is that the proposed prediction system is reconfigurable and versatile with a reduced architecture and computational complexity.
Keywords :
DNA; bioinformatics; feedforward neural nets; statistical analysis; DNA numerical representation; TAIR Arabidopsis thaliana data sets; computational complexity; feed-forward neural network classifier; neural network; promoter prediction; statistical characteristics; Biological neural networks; DNA; Encoding; Genomics; Humans; Sensitivity; Arabidopsis thaliana; DNA numerical representation; Drosophila melanogaster; bioinformatics; neural networks; promoter recognition;
Conference_Titel :
India Conference (INDICON), 2011 Annual IEEE
Conference_Location :
Hyderabad
Print_ISBN :
978-1-4577-1110-7
DOI :
10.1109/INDCON.2011.6139397