DocumentCode
1654452
Title
Data Cleansing Based on Mathematic Morphology
Author
Tang, Sheng ; Chen, Si-Ping
Author_Institution
Dept. of Biomed. Eng., Zhejiang Univ., Hangzhou
fYear
2008
Firstpage
755
Lastpage
758
Abstract
In the field of bioinformatics and medical image understanding, data noise frequently occurs and deteriorates the classification performance, therefore an effective data cleansing mechanism in the training data is often regarded as one of the major steps in the real world inductive learning applications. In this paper the related work on dealing with data noise is firstly reviewed, and then based on the principle of mathematic morphology, the morphological data cleansing algorithms are proposed and two concrete morphological algorithms, DILATE-DC and CLOSE-DC, are realized. The experiments which are arranged on 15 UCI datasets show that these morphological data cleansing algorithms can effectively improve the classification performance, comparing with other relative methods.
Keywords
biology computing; learning by example; mathematical morphology; CLOSE-DC algorithm; DILATE-DC algorithm; bioinformatics; data cleansing; data noise; inductive learning; mathematic morphology; Bioinformatics; Biomedical engineering; Biomedical imaging; Clustering algorithms; Concrete; Filters; Mathematics; Morphology; Nearest neighbor searches; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-1747-6
Electronic_ISBN
978-1-4244-1748-3
Type
conf
DOI
10.1109/ICBBE.2008.184
Filename
4535064
Link To Document