DocumentCode :
2193385
Title :
A new approach using geometric moments of distance matrix image for risk type prediction of human papillomaviruses
Author :
Xiao, Xuan ; Wang, Pu
Author_Institution :
Comput. Dept., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
fYear :
2011
fDate :
9-11 Sept. 2011
Firstpage :
52
Lastpage :
55
Abstract :
Abstract-High-risk types of human papillomaviruses (HPVs) cause cervical cancer,and the second most common tumor in women worldwide, and the HPV E6 protein is one of two viral oncoproteins that is expressed in virtually all HPV-positive cancers. Therefore, how can we identify whether it is a risk type of HPVs by means of E6 properties is very useful and necessary to the diagnosis and the remedy of cervical cancer. Using the pseudo amino acid (PseAA) composition to represent the sample of a protein can incorporate a considerable amount of sequence pattern information so as to improve the prediction quality for the classification of risk type. In this paper, based on the characters of hydrophobicity, hydrophilicity, side-chain mass, we present a novel approach-protein distance matrix image(DMI) to classify HPV risk types from E6 protein sequences.Based on the protein DMI , two geometric moments were extracted from each of the protein sequences concerned are adopted for its PseAA. It was demonstrated thru the jackknife cross-validation test that the overall success rate are 100%. The results showed that bioinformatics based on theoretical approaches can direct and simplify experimental studies.
Keywords :
bioinformatics; cancer; feature extraction; hydrophilicity; hydrophobicity; medical image processing; microorganisms; molecular biophysics; proteins; risk analysis; tumours; HPV E6 protein sequences; HPV-positive cancer; PseAA composition; bioinformatics; cervical cancer; cervical cancer diagnosis; geometric moments extraction; human papillomaviruses; hydrophilicity; hydrophobicity; jackknife cross-validation test; protein DMI; protein distance matrix image; pseudoamino acid composition; risk classification; sequence pattern information; side-chain mass; tumor; viral oncoprotein; Amino acids; Cervical cancer; Humans; Matrix converters; Protein sequence; Distance Matrix; E6 protein; Fuzzy K-nearest neighbor; HPV; High Risk; Low Risk;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Communications and Control (ICECC), 2011 International Conference on
Conference_Location :
Ningbo
Print_ISBN :
978-1-4577-0320-1
Type :
conf
DOI :
10.1109/ICECC.2011.6067633
Filename :
6067633
Link To Document :
بازگشت