• DocumentCode
    2000772
  • Title

    Querying Semistructured Data with Compression in Distributed Environments

  • Author

    Alom, B. M Monjurul ; Henskens, Frans ; Hannaford, Michael

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., Univ. of Newcastle, Callaghan, NSW
  • fYear
    2009
  • fDate
    27-29 April 2009
  • Firstpage
    1546
  • Lastpage
    1553
  • Abstract
    As data management applications grow more complex, they are likely to need efficient distributed query processing. In Distributed Database Systems complete replication consists of maintaining complete copies of the database at each site; this has advantages such as highest locality of reference, highest reliability, availability, and is best for reading. The most promising and dominant data format for data processing and representing on the Internet is the semistructured data form termed XML. XML data has no fixed schema; it evolved and is self describing which results in management difficulties compared to, for example relational data. It is therefore a major challenge for the database community to design query languages and storage methods that can retrieve semistructured data. In this paper, we present a storing and querying scheme for semistructured data views of relational form in distributed environments. The proposed technique stores path dictionary, word dictionary, attribute dictionary, and the complete compressed replication of semistructured data in each distributed site of the DDBS. The presented technique provides query performance improvement due to the compression of semistructured data.
  • Keywords
    Internet; XML; distributed databases; query processing; relational databases; Internet; XML data; attribute dictionary; data management application; data processing; database community; distributed database system; distributed environment; distributed query processing; dominant data format; eXtensible Markup Language; performance improvement; query languages; relational data; semistructured data; stores path dictionary; word dictionary; Availability; Data processing; Database systems; Dictionaries; Distributed databases; Internet; Maintenance; Query processing; Relational databases; XML; Bitmap Indexing; Dictionary; Distributed database; XML; XQuery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations, 2009. ITNG '09. Sixth International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-3770-2
  • Electronic_ISBN
    978-0-7695-3596-8
  • Type

    conf

  • DOI
    10.1109/ITNG.2009.221
  • Filename
    5070847