Title :
A novel method of recognizing short coding sequences of human genes
Author :
Luo, Jiawei ; Yang Li ; Guo, Jiachen
Author_Institution :
Sch. of Comput. & Commun., Hunan Univ., Changsha, China
Abstract :
In this paper, we present a novel feature representation of DNA sequences based on the graphical representation. Support vector machine (SVM) is applied to classify the coding/non-coding sequence in short human genes. In the process of identifying, we propose an improved self-similar map method to avoid the lack of negative samples sequence. According to the GC content we divide the dataset into several groups and identify these sequences respectively. Finally, the results show that the proposed method obtains a higher accuracy with fewer parameters.
Keywords :
DNA; biology computing; data visualisation; support vector machines; DNA sequences; graphical representation; human genes; improved self-similar map; short coding sequences; support vector machine; Bioinformatics; Encoding; Genomics; Informatics; Sensitivity; DNA; coding/non-coding sequence; gene recognition; graphical representation; support vector machine;
Conference_Titel :
Bio-Inspired Computing: Theories and Applications (BIC-TA), 2010 IEEE Fifth International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-6437-1
DOI :
10.1109/BICTA.2010.5645154