Title :
Cellular automata in protein coding region identification
Author :
Maji, Pradipta ; Parua, Samik ; Das, Sumanta ; Pal Chaudhuri, P.
Author_Institution :
Dept. of Comput. Sci. & Eng., Netaji Subhash Eng. Coll., Calcutta, India
Abstract :
Genes contain their information as a specific sequence of nucleotides or bases that are found in DNA molecules. These specific sequences of bases encode instructions on how to make proteins. But, the regions of these genes that code for proteins may occupy only a small region of the sequence. Identifying the coding region is of vital importance in understanding these genes. In this paper, we propose a cellular automata (CA) based pattern classifier to identify the coding region of a DNA sequence. CA is very simple, efficient, and produces more accurate classifier than that have previously been obtained for a range of different sequence lengths. Extensive experimental results establish that the proposed classifier is a cost-effective alternative in protein coding region identification problem.
Keywords :
DNA; cellular automata; genetics; medical computing; pattern classification; proteins; DNA molecules; DNA sequence; cellular automata; genetics; nucleotide sequences; pattern classification; protein coding region identification problem; Biological information theory; Biology computing; Computer applications; Computer science; DNA; Educational institutions; Gene expression; Information technology; Protein engineering; Sequences;
Conference_Titel :
Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on
Print_ISBN :
0-7803-8840-2
DOI :
10.1109/ICISIP.2005.1529502