Title :
A Grammar Based Approach for Mining Bioinformatics Databases
Author :
Quest, Daniel ; Ali, Hesham H.
Author_Institution :
University of Nebraska at Omaha
Abstract :
In this paper we introduce a new formal approach for mining biological data sets. The proposed grammar based approach provides a flexible and powerful tool for advanced sequence comparison and data mining. The approach benefits from the power of regular grammars in allowing the use of advanced queries in comparing sequences and searching for motifs or interior-sequence attributes in biological databases. The formal grammar and the corresponding data mining engine is capable of extracting records from biological databases, filtering a subset of those records for mining, and then sorting those records based on similarity scheme designed by the user. This model is based on the objective (ontology) of the user and scoring is dynamic and is provided at runtime.
Keywords :
Bioinformatics; Biology; Computer science; Data mining; Databases; Educational institutions; Engines; Filtering; Information science; Robustness;
Conference_Titel :
System Sciences, 2005. HICSS '05. Proceedings of the 38th Annual Hawaii International Conference on
Print_ISBN :
0-7695-2268-8
DOI :
10.1109/HICSS.2005.17