• DocumentCode
    610409
  • Title

    Big data integration

  • Author

    Dong, X.L. ; Srivastava, Divesh

  • Author_Institution
    AT&T Labs.-Res., Florham Park, NJ, USA
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    1245
  • Lastpage
    1248
  • Abstract
    The Big Data era is upon us: data is being generated, collected and analyzed at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of Big Data. BDI differs from traditional data integration in many dimensions: (i) the number of data sources, even for a single domain, has grown to be in the tens of thousands, (ii) many of the data sources are very dynamic, as a huge amount of newly collected data are continuously made available, (iii) the data sources are extremely heterogeneous in their structure, with considerable variety even for substantially similar entities, and (iv) the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This seminar explores the progress that has been made by the data integration community on the topics of schema mapping, record linkage and data fusion in addressing these novel challenges faced by big data integration, and identifies a range of open problems for the community.
  • Keywords
    data integration; decision making; sensor fusion; BDI; big data integration; data driven decision making; data fusion; data integration community; data sources; record linkage; schema mapping; unprecedented scale; Big data; Couplings; Data integration; Databases; Educational institutions; Joining processes; Seminars;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544914
  • Filename
    6544914