• DocumentCode
    2847878
  • Title

    Integrating data from disparate sources: a mass collaboration approach

  • Author

    McCann, Robert ; Kramnik, Alexander ; Shen, Warren ; Varadarajan, Vanitha ; Sobulo, Olu ; Doan, AnHai

  • Author_Institution
    Illinois Univ., Chicago, IL, USA
  • fYear
    2005
  • fDate
    5-8 April 2005
  • Firstpage
    487
  • Lastpage
    488
  • Abstract
    The rapid growth of distributed data at enterprises and on the WWW has fueled significant interest in building data integration systems. Such a system provides users with a uniform query interface (called mediated schema) to a multitude of data sources, thus freeing them from manually querying each individual source. To address some problems in the MOBS (Mass Collaboration to Build Systems) project at the University of Illinois, we develop solutions that learn from the multitude of users in the integration environment to improve the accuracy of integration tools. The improved accuracy in turn can significantly reduce the workload of the system builder. In developing MOBS we address the following key challenges: (i) obtaining user participation, (ii) learning from user participation, and (iii) combining user answers.
  • Keywords
    distributed databases; query processing; user interfaces; MOBS project; data integration systems; disparate sources; distributed data; integration tools; mass collaboration approach; mediated schema; query interface; user participation; Collaboration; Collaborative tools; Collaborative work; Costs; Data engineering; Large scale integration; Monitoring; Recruitment; Sprites (computer); World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.81
  • Filename
    1410160