• DocumentCode
    2079871
  • Title

    Visualizing large-scale RDF data using Subsets, Summaries, and Sampling in Oracle

  • Author

    Sundara, Seema ; Atre, Medha ; Kolovski, Vladimir ; Das, Souripriya ; Wu, Zhe ; Chong, Eugene Inseok ; Srinivasan, Jagannathan

  • Author_Institution
    Oracle, Nashua, NH, USA
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    1048
  • Lastpage
    1059
  • Abstract
    The paper addresses the problem of visualizing large scale RDF data via a 3-S approach, namely, by using, (1) Subsets: to present only relevant data for visualisation; both static and dynamic subsets can be specified, (2) Summaries: to capture the essence of RDF data being viewed; summarized data can be expanded on demand thereby allowing users to create hybrid (summary-detail) fisheye views of RDF data, and (3) Sampling: to further optimize visualization of large-scale data where a representative sample suffices. The visualization scheme works with both asserted and inferred triples (generated using RDF(S) and OWL semantics). This scheme is implemented in Oracle by developing a plug-in for the Cytoscape graph visualization tool, which uses functions defined in a Oracle PL/SQL package, to provide fast and optimized access to Oracle Semantic Store containing RDF data. Interactive visualization of a synthesized RDF data set (LUBM 1 million triples), two native RDF datasets (Wikipedia 47 million triples and UniProt 700 million triples), and an OWL ontology (eClassOwl with a large class hierarchy including over 25,000 OWL classes, 5,000 properties, and 400,000 class-properties) demonstrates the effectiveness of our visualization scheme.
  • Keywords
    data visualisation; knowledge representation languages; meta data; ontologies (artificial intelligence); optimisation; 3S approach; Cytoscape graph visualization; OWL ontology; Oracle; PL package; RDF data; SQL package; data summarization; data visualization; resource description framework; Data visualization; Delay; Displays; Large-scale systems; OWL; Ontologies; Packaging; Resource description framework; Sampling methods; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447795
  • Filename
    5447795