• DocumentCode
    3413719
  • Title

    A Framework for Mining Functional Dependencies from Large Distributed Databases

  • Author

    Ye, Feiyue ; Liu, Jixue ; Qian, Jin ; Xue, Xiaofeng

  • Author_Institution
    Coll. of Comput. Eng., Jiangsu Teachers Univ. of Technol., Changzhou, China
  • Volume
    3
  • fYear
    2010
  • fDate
    23-24 Oct. 2010
  • Firstpage
    109
  • Lastpage
    113
  • Abstract
    Discovering functional dependencies FDs from existing databases is important to knowledge discovery, machine learning and data quality assessment. A number of algorithms has been proposed in the literature for FD discovery. However these algorithms are designed to work with centralized databases. When they are applied to distributed databases, communication cost of transporting data from different sites makes the algorithms not efficient. In this paper, We analyze the characteristics of mining functional dependencies from large distributed database, and we propose an distributed mining framework for discovery FDs from distribute large databases. We develop a theorem that can prune candidate FDs effectively and extend the partition based approach for distributed databases.
  • Keywords
    data mining; distributed databases; very large databases; centralized database; data quality assessment; functional dependency mining; knowledge discovery; large distributed database; machine learning; partition based approach; Algorithm design and analysis; Computers; Data mining; Distributed databases; Partitioning algorithms; Relational databases; Data mining; Functional dependencies; Knowledge discovery; Mining functional dependencies;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-8432-4
  • Type

    conf

  • DOI
    10.1109/AICI.2010.262
  • Filename
    5656428