DocumentCode :
313566
Title :
Motif neural network design for large-scale protein family identification
Author :
Wu, Cathy H. ; Zhao, Sheng ; Simmons, Kevin ; Shivakumar, Sailaja
Author_Institution :
Dept. of Epidemiology/Biomath., Texas Univ. Health Center, Tyler, TX, USA
Volume :
1
fYear :
1997
fDate :
9-12 Jun 1997
Firstpage :
86
Abstract :
This paper describes an application of artificial neural networks for the sequence analysis and management of the large and rapidly growing molecular databases. The neural network system, based on a motif identification neural design (MOTIFIND) that incorporates both global and motif sequence information, has been implemented for large-scale protein family identification. More than nine hundred backpropagation networks were trained, one for each protein family. The protein families were defined collectively by the ProSite and PIR databases. As a part of an integrated protein family identification system, the neural networks were used as filters to quickly detect potential new members in comprehensive searches against the two major protein sequence databases, SwissProt and PIR. The integrated system identified a large number of false negative members missed by both ProSite and PIR. The speed, sensitivity, general applicability, together with its capability to learn existing protein classification schemes, make the MOTIFIND neural network system an ideal tool for the full-scale protein family classification effort
Keywords :
backpropagation; biology computing; molecular biophysics; neural nets; pattern classification; proteins; query processing; very large databases; PIR database; ProSite database; SwissProt; backpropagation networks; molecular databases; motif neural network; protein classification; protein family identification; protein sequence databases; Artificial neural networks; Backpropagation; Bioinformatics; Databases; Genomics; Humans; Large-scale systems; Neural networks; Proteins; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks,1997., International Conference on
Conference_Location :
Houston, TX
Print_ISBN :
0-7803-4122-8
Type :
conf
DOI :
10.1109/ICNN.1997.611642
Filename :
611642
Link To Document :
بازگشت