• DocumentCode
    822467
  • Title

    SOPHIA: an interactive cluster-based retrieval system for the OHSUMED collection

  • Author

    Dobrynin, Vladimir ; Patterson, David ; Galushka, Mykola ; Rooney, Niall

  • Author_Institution
    Dept. of Programming Technol., St. Petersburg State Univ., Russia
  • Volume
    9
  • Issue
    2
  • fYear
    2005
  • fDate
    6/1/2005 12:00:00 AM
  • Firstpage
    256
  • Lastpage
    265
  • Abstract
    The ability to perform an exploratory search and retrieval of relevant documents from a large collection of domain-specific documents is an important requirement both in the field of medicine and other areas. In this paper, we present a unsupervised distributional clustering technique called SOPHIA. SOPHIA provides a semantically meaningful visual clustering of the document corpus in conjunction with an intuitive interactive search facility. We assess the effectiveness of SOPHIA´s cluster-based information retrieval for the MEDLINE testset collection known as OHSUMED.
  • Keywords
    information retrieval; information retrieval systems; interactive systems; medical information systems; MEDLINE testset collection; OHSUMED collection; SOPHIA; cluster-based information retrieval; document corpus; domain-specific document; exploratory search; interactive cluster-based retrieval system; interactive search faculty; semantically meaningful visual clustering; unsupervised distributional clustering technique; Clustering algorithms; Euclidean distance; Helium; Indexing; Information retrieval; Knowledge engineering; Partitioning algorithms; Power system modeling; Testing; Vocabulary; Clustering; MEDLINE; information retrieval; Artificial Intelligence; Cluster Analysis; Information Storage and Retrieval; Vocabulary, Controlled;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2005.847184
  • Filename
    1435423