DocumentCode
1603148
Title
A fuzzy classifier to taxonomically group DNA fragments within a metagenome
Author
Nasser, Sara ; Breland, Adrienne ; Harris, Frederick C., Jr. ; Nicolescu, Monica
Author_Institution
Univ. of Nevada, Reno, NV
fYear
2008
Firstpage
1
Lastpage
6
Abstract
Extracting microorganisms from their natural environment has become a popular technique. These metagenomic fragments lack enough information that can mark them into taxonomic groups. In this paper, we implement a fuzzy k-means classifier to separate fragments into taxonomic groups present in a metagenomic data set. The fuzzy classifier is used to group shotgun sequence fragments as small as 500 base pairs according to their DNA signatures, namely GC content and oligonucleotide frequencies. A comparison of using different signatures is done and we analyze results and compare them. The classifier is also tested to classify acid mine drainage metagenome into classes to represent the major Archea and Bacteria groups. The classification achieved an accuracy of 99% for acid mine drainage a published environmental genome sample.
Keywords
biocomputing; fuzzy systems; pattern classification; pattern clustering; DNA signatures; acid mine drainage; fuzzy k-means classifier; metagenomic data set; oligonucleotide frequencies; shotgun sequence fragments; taxonomically group DNA fragments; Assembly; Bioinformatics; DNA; Data mining; Genomics; Humans; Microorganisms; Organisms; Sampling methods; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Information Processing Society, 2008. NAFIPS 2008. Annual Meeting of the North American
Conference_Location
New York City, NY
Print_ISBN
978-1-4244-2351-4
Electronic_ISBN
978-1-4244-2352-1
Type
conf
DOI
10.1109/NAFIPS.2008.4531252
Filename
4531252
Link To Document