• DocumentCode
    1968220
  • Title

    A New Complex Schema Matching System

  • Author

    Qian, Ying ; Zhang, Haitao ; Song, Jinling ; Liu, Zhenglin

  • Author_Institution
    Network & Modern Educ. Technol. Center, HeBei Normal Univ. of Sci. & Technol., Qin Huangdao, China
  • fYear
    2010
  • fDate
    30-31 Jan. 2010
  • Firstpage
    195
  • Lastpage
    198
  • Abstract
    Schema matching, the problem of finding semantic correspondences between elements of two schemas, plays a key role in many applications, such as data warehouse, E-Commerce. The existing approaches to automating schema matching almost focus on computing direct element matches (1:1 matches) between two schemas. However, relationships between real-world schemas involve many complex matches besides 1:1 matches. A new complex schema matching system called NCSM is introduced in this paper. Firstly it can filter unreasonable matches on data types and values by preprocessor, and employs a set of special-purpose searchers in match generator to explore a specialized portion of the search space and discovers 1:1 and complex matches. Then it estimates candidate matches and selects optimal candidate matches by using similarity estimator and match selector respectively. Finally, according to the problem that there are opaque columns in the schemas being matched, it can apply complementary matcher to discover matching relations between opaque columns further more. Thereby it can discover more general, reasonable matching pairs. Experiments show that, NCSM does not only discover matches between schemas roundly, but also improve the matching recall and precision in practice.
  • Keywords
    data handling; learning (artificial intelligence); pattern matching; relational databases; search problems; complex schema matching system; data warehouse; e-commerce; machine learning; opaque columns; search space; similarity estimator; Computer networks; Computer science education; Educational technology; Information technology; Marine technology; Matched filters; Oceans; Relational databases; Systems engineering education; Underwater communication; complex matching; machine learning; schema matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing & Communication, 2010 Intl Conf on and Information Technology & Ocean Engineering, 2010 Asia-Pacific Conf on (CICC-ITOE)
  • Conference_Location
    Macao
  • Print_ISBN
    978-1-4244-5634-5
  • Electronic_ISBN
    978-1-4244-5635-2
  • Type

    conf

  • DOI
    10.1109/CICC-ITOE.2010.57
  • Filename
    5439260