Title :
Scientific data classification: a case study
Author :
Chirn, Gung-Wei ; Wang, Zhiyuan ; Wan, Jason T L
Author_Institution :
Dept. of Comput. & Inf. Sci., New Jersey Inst. of Technol., Newark, NJ, USA
Abstract :
Scientific data classification is the activity of determining whether or not an unlabeled scientific object belongs to an existing class. It is an important operation in the management of scientific databases. The authors present a case study for scientific data classification. Specifically, they develop a tool for DNA sequence classification. The tool works by generating and matching gapped fingerprints of DNA sequences. Experimental results obtained by applying our tool to classifying a set of Alu sequences demonstrate the good performance of the tool. While the reported research focuses on DNA classification, the techniques should generalize to any domain (e.g. multimedia) where data are naturally represented as sequences
Keywords :
DNA; biology computing; data handling; pattern classification; pattern matching; scientific information systems; sequences; Alu sequences; DNA sequence classification; data sequences; gapped DNA sequence fingerprint generation; gapped DNA sequence fingerprint matching; scientific data classification; scientific database management; unlabeled scientific object; Artificial intelligence; Bioinformatics; Classification algorithms; Computer aided software engineering; DNA computing; Databases; Fingerprint recognition; Information science; Research and development; Sequences;
Conference_Titel :
Tools with Artificial Intelligence, 1997. Proceedings., Ninth IEEE International Conference on
Conference_Location :
Newport Beach, CA
Print_ISBN :
0-8186-8203-5
DOI :
10.1109/TAI.1997.632259