DocumentCode :
3745802
Title :
An Improved K-Means Algorithm for DNA Sequence Clustering
Author :
Nasssima Aleb;Narimane Labidi
Author_Institution :
USTHB-FEI, Univ. &
fYear :
2015
Firstpage :
39
Lastpage :
42
Abstract :
In recent years, billions of DNA and protein sequences are subject to sequencing. However, few of them have known structures and functions, most remain unknown. The solution to this problem is to link sequences between them rather than revisit each new sequence independently of other sequences. Thus, if we manage to assimilate a sequence S1 to another sequence S2 or to a group of previously studied sequences, this will allow us to directly deduce the structure, functions and phylogenetic classification of S2. The purpose of this work is to adapt clustering methods to the specific problem of classification of DNA sequences. We introduce a new method based on K-means clustering for DNA sequences clustering. We begin by explaining and motivating our approach, then we present obtained results.
Keywords :
"DNA","Clustering algorithms","Databases","Bioinformatics","Proteins","Sequential analysis","Clustering methods"
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2015 26th International Workshop on
ISSN :
1529-4188
Print_ISBN :
978-1-4673-7581-8
Electronic_ISBN :
2378-3915
Type :
conf
DOI :
10.1109/DEXA.2015.27
Filename :
7406266
Link To Document :
بازگشت