DocumentCode :
822467
Title :
SOPHIA: an interactive cluster-based retrieval system for the OHSUMED collection
Author :
Dobrynin, Vladimir ; Patterson, David ; Galushka, Mykola ; Rooney, Niall
Author_Institution :
Dept. of Programming Technol., St. Petersburg State Univ., Russia
Volume :
9
Issue :
2
fYear :
2005
fDate :
6/1/2005 12:00:00 AM
Firstpage :
256
Lastpage :
265
Abstract :
The ability to perform an exploratory search and retrieval of relevant documents from a large collection of domain-specific documents is an important requirement both in the field of medicine and other areas. In this paper, we present a unsupervised distributional clustering technique called SOPHIA. SOPHIA provides a semantically meaningful visual clustering of the document corpus in conjunction with an intuitive interactive search facility. We assess the effectiveness of SOPHIA´s cluster-based information retrieval for the MEDLINE testset collection known as OHSUMED.
Keywords :
information retrieval; information retrieval systems; interactive systems; medical information systems; MEDLINE testset collection; OHSUMED collection; SOPHIA; cluster-based information retrieval; document corpus; domain-specific document; exploratory search; interactive cluster-based retrieval system; interactive search faculty; semantically meaningful visual clustering; unsupervised distributional clustering technique; Clustering algorithms; Euclidean distance; Helium; Indexing; Information retrieval; Knowledge engineering; Partitioning algorithms; Power system modeling; Testing; Vocabulary; Clustering; MEDLINE; information retrieval; Artificial Intelligence; Cluster Analysis; Information Storage and Retrieval; Vocabulary, Controlled;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2005.847184
Filename :
1435423
Link To Document :
بازگشت