Title :
An analysis of a fuzzy dissimilarity measure to perform Escherichia coli source tracking
Author :
Suh, Hyo-Jin ; Keller, James M. ; Carson, C. Andrew
Author_Institution :
Dept. of Comput. Eng. & Comput. Sci., Missouri Univ., USA
Abstract :
To identify the source of Escherichia coli (E.coli) fecal bacterial contamination, we propose a fuzzy dissimilarity measure to calculate the similarity between the E.coli DNA patterns. The fuzzy dissimilarity measure preserves the dimension of the DNA patterns and at the same time allows variation among same host patterns. The fuzzy dissimilarity measure produces a dissimilarity matrix, a form of relational data. For classification of this type of data representation we present a weighted k-nearest neighbor algorithm. The weighted k.nearest neighbor technique uses the classical k-nearest neighbor rule but solves the problem of ´tie´ between multi-classes. In addition, we suggest an ensemble data set method for sample sets with a large range of class sizes. The proposed system showed potential as a stable system in detecting fecal bacterial hosts and as a base for future studies in interpreting DNA patterns.
Keywords :
DNA; fuzzy set theory; microorganisms; pattern classification; E.coli DNA patterns; Escherichia coli fecal bacterial contamination; Escherichia coli source tracking; classical k-nearest neighbor rule; data set method; detecting fecal bacterial hosts; dissimilarity matrix; fuzzy dissimilarity measurment analysis; fuzzy logic; fuzzy set theory; potential stable system; relational data; tie problem; weighted k-nearest neighbor algorithm; weighted k-nearest neighbor technique; Contamination; DNA; Fingerprint recognition; Fuzzy neural networks; Humans; Microorganisms; Neural networks; Performance analysis; Performance evaluation; Pollution measurement;
Conference_Titel :
Fuzzy Systems, 2003. FUZZ '03. The 12th IEEE International Conference on
Print_ISBN :
0-7803-7810-5
DOI :
10.1109/FUZZ.2003.1206540