DocumentCode
1791730
Title
An open schema for XML data in Hive
Author
Wuheng Luo ; Bo Liu ; Watfa, Allie K.
Author_Institution
Enterprise Data Archit., Sears Holdings Hoffman Estates, Hoffman Estates, IL, USA
fYear
2014
fDate
27-30 Oct. 2014
Firstpage
25
Lastpage
31
Abstract
Big data in XML format poses a challenge to distributed data systems. This paper proposes an open Hive schema approach to XML data placement in Hadoop. Placing XML data in Hive with this generic schema to build column-oriented and OLAP-focused XML data warehouse of heterogeneous content has benefits for data access, maintenance and scalability.
Keywords
Big Data; XML; data mining; data warehouses; parallel processing; Big Data; Hadoop; OLAP-focused XML data warehouse; XML data placement; XML format; column-oriented XML data warehouse; data access; data maintenance; data scalability; distributed data systems; heterogeneous content; open Hive schema approach; Availability; Big data; Books; Companies; Data models; Distributed databases; XML; Hadoop; Hive; XML; column-oriented; open schema; schema-less;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location
Washington, DC
Type
conf
DOI
10.1109/BigData.2014.7004409
Filename
7004409
Link To Document