Title :
DNA base-calling using artificial neural networks
Author :
Khan, Omniyah Gul M ; Assaleh, Khaled T. ; Husseini, Ghaleb A. ; Majdalawieh, Amin F. ; Woodward, Scott R.
Author_Institution :
Dept. of Electr. Eng., American Univ. of Sharjah, Sharjah, United Arab Emirates
Abstract :
Information acquired from any species genomic sequence is expected to contribute massively to advances in various fields, such as medicine, forensics and agriculture. This huge impact of DNA sequencing leads to the need for efficient automation of mapping chromatogram traces to their corresponding string of bases through base-calling. This paper attempts to solve the problem of base-calling by modeling traces using Artificial Neural Networks (ANN). Traces, belonging to Homo sapiens, Saccharomyces mikatae and Drosophila melanogaster, undergo pre-processing, which includes de-correlation, de-convolution and normalization, to minimize or eliminate data imperfections. Representative features are then extracted for training and testing the ANN base-caller. Results obtained are then compared with the existing standards, PHRED and ABI KB base-caller in terms of deletion, insertion and substitution errors. Simulation results indicate that the proposed model achieve a higher base-calling accuracy when compared to PHRED and a comparable performance when compared to ABI KB. The results obtained validate the potential of the proposed model for efficient DNA base-calling.
Keywords :
DNA; biology computing; feature extraction; genomics; neural nets; ABI KB base-caller; ANN base-caller; DNA base-calling; DNA sequencing; PHRED base-caller; artificial neural networks; mapping chromatogram automation; species genomic sequence; trace modelling; Artificial neural networks; DNA; Feature extraction; Hidden Markov models; Mathematical model; Testing; Training;
Conference_Titel :
Biomedical Engineering (MECBME), 2011 1st Middle East Conference on
Conference_Location :
Sharjah
Print_ISBN :
978-1-4244-6998-7
DOI :
10.1109/MECBME.2011.5752074