Title :
Motif detection in protein sequences
Author :
Gao, Yuan ; Mathee, Kalai ; Narasimhan, Giri ; Wang, Xuning
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We use methods from data mining and knowledge discovery to design an algorithm for detecting motifs in protein sequences. Based on this approach, we have implemented a program called “GYM”. The Helix-Turn-Helix Motif was used as a model system on which to test our program. The program was also extended to detect Homeodomain motifs. The detection results for the two motifs compare favorably with existing programs. In addition, the GYM program provides a lot of useful information about a given protein sequence
Keywords :
biology computing; data mining; pattern recognition; proteins; GYM program; Helix-Turn-Helix Motif; Homeodomain motifs; data mining; knowledge discovery; motif detection; protein sequences; Algorithm design and analysis; DNA; Data mining; Design methodology; Displays; Pharmaceuticals; Protein sequence; Sequences; Statistical analysis; System testing;
Conference_Titel :
String Processing and Information Retrieval Symposium, 1999 and International Workshop on Groupware
Conference_Location :
Cancun
Print_ISBN :
0-7695-0268-7
DOI :
10.1109/SPIRE.1999.796579