DocumentCode
3478268
Title
A Fast Heuristic Algorithm for Similarity Search in Large DNA Databases
Author
Jeong, In-Seon ; Park, Kyoung-Wook ; Lim, Hyeong-Seok
fYear
2007
fDate
11-13 Oct. 2007
Firstpage
335
Lastpage
340
Abstract
Knowledge Discovery techniques seek to find new information about a domain. These techniques can either be manually performed by an expert, or automated using software algorithms (Machine Learning). However some domains (such as the field of lung function testing) contain volumes of data too vast for effective manual analysis, and require background knowledge too complex for Machine Learning algorithms. This study examines how the Multiple Classification Ripple-Down Rules (MCRDR) Knowledge Acquisition process can be adapted to develop a new Knowledge Discovery method, Exposed MCRDR. A prototype system was developed and tested in the domain of lung function. Preliminary results suggest that the EMCRDR method can be successfully applied to efficiently discover new knowledge in a complex domain. The study also reveals many potential areas of study and development for the MCRDR method, and Knowledge Acquisition and Knowledge Discovery methods in general.
Keywords
DNA; Databases; Heuristic algorithms; Knowledge acquisition; Lungs; Machine learning; Machine learning algorithms; Software algorithms; Software performance; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in the Convergence of Bioscience and Information Technologies, 2007. FBIT 2007
Conference_Location
Jeju City
Print_ISBN
978-0-7695-2999-8
Type
conf
DOI
10.1109/FBIT.2007.147
Filename
4524129
Link To Document