DocumentCode :
2737586
Title :
Learning Weight Assignment in Distance Function for Biological Sequence Feature Vector by Genetic Algorithm
Author :
Kuo, Huang-Cheng ; Tseng, Yu-Cheng ; Huang, Jen-Peng
Author_Institution :
Nat. Chiayi Univ., Chiayi
fYear :
2007
fDate :
5-7 Sept. 2007
Firstpage :
295
Lastpage :
295
Abstract :
Transforming sequences into numeric feature vectors is a promising method in bio-sequence similarity search. The process of transformation will lost some information of the origin sequence. In order to improve the accuracy, weight assignment can be used. Based on this notion, we proposed an adaptive weighting distance which is based on feature vector that contains three groups of features: count, RPD, and APD of a DNA sequence. In this paper, weighted LI distance is applied for computing the distance between two feature vectors. We compute the average of entropy of two sequences and then assign the weights to count, RPD and APD according to the weight table which is pointed by the variant of entropy of two sequences. The weight tables are then adjusted by genetic algorithms. Experiment shows that such adaptive weight distance mechanism helps reflect the distance between sequences.
Keywords :
biology computing; genetic algorithms; search problems; sequences; adaptive weighting distance; biological sequence feature vector; biosequence similarity search; entropy; genetic algorithm; weighted LI distance; Bioinformatics; Biology; Computer science; DNA; Dispersion; Entropy; Frequency; Genetic algorithms; Sequences; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Computing, Information and Control, 2007. ICICIC '07. Second International Conference on
Conference_Location :
Kumamoto
Print_ISBN :
0-7695-2882-1
Type :
conf
DOI :
10.1109/ICICIC.2007.369
Filename :
4427940
Link To Document :
بازگشت