• DocumentCode
    2618011
  • Title

    A first step for building a document warehouse: Unification of XML documents

  • Author

    Ben Messaoud, Ines ; Feki, Jamel ; Zurfluh, Gilles

  • Author_Institution
    Lab. Miracl, Univ. of Sfax, Sfax, Tunisia
  • fYear
    2012
  • fDate
    16-18 May 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The Web plays a key role for information publication and exchange between organizations. In this context, the XML format becomes a common standard for data representation and exchange. On the other hand, XML documents constitute an important source for decisional analyses since they help decision makers to better understand and control the evolution of their business processes. However, even though several XML documents may belong to a same domain, they may be described by multiple structures. In this paper, we present a method to unify XML document structures in order to build a global and generic perception/view of heterogeneous documents, to store them as a document warehouse, and finally, to query them easily. We also describe our software prototype USD (Unification of Structures of XML Documents) which supports the proposed method. We illustrate its functionalities through an example.
  • Keywords
    Internet; XML; data warehouses; document handling; software prototyping; USD; XML documents; business processes; data exchange; data representation; document warehouse; information exchange; information publication; software prototype; unification of structures of XML documents; Business; Dictionaries; Merging; Semantics; Unified modeling language; XML; Document Warehouse; XML document; similarity factor; unification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Challenges in Information Science (RCIS), 2012 Sixth International Conference on
  • Conference_Location
    Valencia
  • ISSN
    2151-1349
  • Print_ISBN
    978-1-4577-1936-3
  • Electronic_ISBN
    2151-1349
  • Type

    conf

  • DOI
    10.1109/RCIS.2012.6240440
  • Filename
    6240440