• DocumentCode
    1654452
  • Title

    Data Cleansing Based on Mathematic Morphology

  • Author

    Tang, Sheng ; Chen, Si-Ping

  • Author_Institution
    Dept. of Biomed. Eng., Zhejiang Univ., Hangzhou
  • fYear
    2008
  • Firstpage
    755
  • Lastpage
    758
  • Abstract
    In the field of bioinformatics and medical image understanding, data noise frequently occurs and deteriorates the classification performance, therefore an effective data cleansing mechanism in the training data is often regarded as one of the major steps in the real world inductive learning applications. In this paper the related work on dealing with data noise is firstly reviewed, and then based on the principle of mathematic morphology, the morphological data cleansing algorithms are proposed and two concrete morphological algorithms, DILATE-DC and CLOSE-DC, are realized. The experiments which are arranged on 15 UCI datasets show that these morphological data cleansing algorithms can effectively improve the classification performance, comparing with other relative methods.
  • Keywords
    biology computing; learning by example; mathematical morphology; CLOSE-DC algorithm; DILATE-DC algorithm; bioinformatics; data cleansing; data noise; inductive learning; mathematic morphology; Bioinformatics; Biomedical engineering; Biomedical imaging; Clustering algorithms; Concrete; Filters; Mathematics; Morphology; Nearest neighbor searches; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1747-6
  • Electronic_ISBN
    978-1-4244-1748-3
  • Type

    conf

  • DOI
    10.1109/ICBBE.2008.184
  • Filename
    4535064