DocumentCode :
819353
Title :
Knowledge-Driven Multidimensional Indexing Structure for Biomedical Media Database Retrieval
Author :
Scott, Grant ; Shyu, Chi-Ren
Author_Institution :
Dept. of Comput. Sci., Missouri Univ., Columbia, MO
Volume :
11
Issue :
3
fYear :
2007
fDate :
5/1/2007 12:00:00 AM
Firstpage :
320
Lastpage :
331
Abstract :
Today, biomedical media data are being generated at rates unimaginable only years ago. Content-based retrieval of biomedical media from large databases is becoming increasingly important to clinical, research, and educational communities. In this paper, we present the recently developed entropy balanced statistical (EBS) k-d tree and its applications to biomedical media, including a high-resolution computed tomography (HRCT) lung image database and the first real-time protein tertiary structure search engine. Our index utilizes statistical properties inherent in large-scale biomedical media databases for efficient and accurate searches. By applying concepts from pattern recognition and information theory, the EBS k-d tree is built through top-down decision tree induction. Experimentation shows similarity searches against a protein structure database of 53 363 structures consistently execute in less than 8.14 ms for the top 100 most similar structures. Additionally, we have shown improved retrieval precision over adaptive and statistical k-d trees. Retrieval precision of the EBS k-d tree is 81.6% for content-based retrieval of HRCT lung images and 94.9% at 10% recall for protein structure similarity search. The EBS k-d tree has enormous potential for use in biomedical applications embedded with ground-truth knowledge and multidimensional signatures
Keywords :
computerised tomography; content-based retrieval; database indexing; decision trees; entropy; information theory; lung; medical computing; molecular biophysics; molecular configurations; pattern recognition; proteins; statistical analysis; visual databases; biomedical media database retrieval; content-based retrieval; decision tree induction; entropy balanced statistical k-d tree; ground-truth knowledge; high-resolution computed tomography; information theory; knowledge-driven multidimensional indexing; large-scale biomedical media databases; lung image database; multidimensional signatures; pattern recognition; real-time protein tertiary structure search engine; retrieval precision; statistical properties; Computed tomography; Content based retrieval; Entropy; Image databases; Image retrieval; Indexing; Information retrieval; Lungs; Multidimensional systems; Proteins; Biomedical media; content-based retrieval; databases; indexing;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2006.880551
Filename :
4167901
Link To Document :
بازگشت