DocumentCode
2193385
Title
A new approach using geometric moments of distance matrix image for risk type prediction of human papillomaviruses
Author
Xiao, Xuan ; Wang, Pu
Author_Institution
Comput. Dept., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
fYear
2011
fDate
9-11 Sept. 2011
Firstpage
52
Lastpage
55
Abstract
Abstract-High-risk types of human papillomaviruses (HPVs) cause cervical cancer,and the second most common tumor in women worldwide, and the HPV E6 protein is one of two viral oncoproteins that is expressed in virtually all HPV-positive cancers. Therefore, how can we identify whether it is a risk type of HPVs by means of E6 properties is very useful and necessary to the diagnosis and the remedy of cervical cancer. Using the pseudo amino acid (PseAA) composition to represent the sample of a protein can incorporate a considerable amount of sequence pattern information so as to improve the prediction quality for the classification of risk type. In this paper, based on the characters of hydrophobicity, hydrophilicity, side-chain mass, we present a novel approach-protein distance matrix image(DMI) to classify HPV risk types from E6 protein sequences.Based on the protein DMI , two geometric moments were extracted from each of the protein sequences concerned are adopted for its PseAA. It was demonstrated thru the jackknife cross-validation test that the overall success rate are 100%. The results showed that bioinformatics based on theoretical approaches can direct and simplify experimental studies.
Keywords
bioinformatics; cancer; feature extraction; hydrophilicity; hydrophobicity; medical image processing; microorganisms; molecular biophysics; proteins; risk analysis; tumours; HPV E6 protein sequences; HPV-positive cancer; PseAA composition; bioinformatics; cervical cancer; cervical cancer diagnosis; geometric moments extraction; human papillomaviruses; hydrophilicity; hydrophobicity; jackknife cross-validation test; protein DMI; protein distance matrix image; pseudoamino acid composition; risk classification; sequence pattern information; side-chain mass; tumor; viral oncoprotein; Amino acids; Cervical cancer; Humans; Matrix converters; Protein sequence; Distance Matrix; E6 protein; Fuzzy K-nearest neighbor; HPV; High Risk; Low Risk;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Communications and Control (ICECC), 2011 International Conference on
Conference_Location
Ningbo
Print_ISBN
978-1-4577-0320-1
Type
conf
DOI
10.1109/ICECC.2011.6067633
Filename
6067633
Link To Document