DocumentCode :
2988356
Title :
Gene classification artificial neural system
Author :
Wu, Cathy H. ; Chen, Hsi-Lien
Author_Institution :
Dept. of Epidemiol. & Biomath., Texas Univ. Health Center, Tyler, TX, USA
fYear :
1995
fDate :
29-31 May 1995
Firstpage :
102
Lastpage :
107
Abstract :
A gene classification artificial neural system has been developed for rapid annotation of the molecular sequencing data being generated by the Human Genome Project. Three neural networks have been implemented, one full-scale system to classify protein sequences according to PIR (protein identification resources) superfamilies, one system to classify ribosomal RNA sequences into RDP (ribosomal database project) phylogenetic classes, and one pilot system to classify proteins according to Blocks motifs. The sequence encoding schema involved an n-gram hashing method to convert molecular sequences into neural input vectors, a SVD (singular value decomposition) method to compress vectors, and a term weighting method to extract motif information. The neural networks used were three-layered, feed-forward networks that employed backpropagation or counter-propagation learning paradigms. The system runs faster by one to two orders of magnitude than existing method and has a sensitivity of 85 to 100%. The gene classification artificial neural system is available on the Internet, and may be extended into a gene identification system for classifying indiscriminately sequenced DNA fragments
Keywords :
backpropagation; biology computing; encoding; feedforward neural nets; genetics; multilayer perceptrons; neural nets; pattern classification; string matching; Blocks motifs; Human Genome Project; Internet; PIR; RDP; SVD; backpropagation; counter-propagation; gene classification artificial neural system; gene identification; hashing; indiscriminately sequenced DNA fragments; molecular sequences; molecular sequencing data; motif information; protein identification resource superfamilies; protein sequence classification; rapid annotation; ribosomal RNA sequence classification; ribosomal database project phylogenetic classes; sequence encoding schema; singular value decomposition; term weighting; three-layered feedforward neural networks; vector compression; Artificial neural networks; Bioinformatics; Databases; Encoding; Genomics; Humans; Phylogeny; Proteins; RNA; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence in Neural and Biological Systems, 1995. INBS'95, Proceedings., First International Symposium on
Conference_Location :
Herndon, VA
Print_ISBN :
0-8186-7116-5
Type :
conf
DOI :
10.1109/INBS.1995.404273
Filename :
404273
Link To Document :
بازگشت