• DocumentCode
    2331507
  • Title

    A Modified Density Based Outlier Mining Algorithm for Large Dataset

  • Author

    Yang, Peng ; Huang, Biao

  • Author_Institution
    Chongqing Univ. of Arts & Sci., Chongqing
  • fYear
    2008
  • fDate
    20-20 Nov. 2008
  • Firstpage
    37
  • Lastpage
    40
  • Abstract
    Outlier mining is to discover the objects with exceptional behavior in dataset. It is an important challenge from the knowledge discovery standpoint and attracts much attention recently. The density based outlier mining algorithm is an effective approach to detect anomalous points. However, such algorithms usually need amounts of computations. In this paper, we propose a modified density based detection algorithm which utilizes the data partitioning method. Furthermore, it presents some speedup strategies such as the introduction of module information to avoid large number of unnecessary computations while finding outliers. The algorithm is applied on both synthetic and real datasets and the experimental results show that it is efficient for outlier detection in large dataset.
  • Keywords
    data mining; anomalous points detection; data partitioning; knowledge discovery; large dataset; outlier mining; Data engineering; Detection algorithms; Engineering management; Information management; Information technology; Intrusion detection; Partitioning algorithms; Seminars; Space technology; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Future Information Technology and Management Engineering, 2008. FITME '08. International Seminar on
  • Conference_Location
    Leicestershire, United Kingdom
  • Print_ISBN
    978-0-7695-3480-0
  • Type

    conf

  • DOI
    10.1109/FITME.2008.106
  • Filename
    4746436