• DocumentCode
    2193385
  • Title

    A new approach using geometric moments of distance matrix image for risk type prediction of human papillomaviruses

  • Author

    Xiao, Xuan ; Wang, Pu

  • Author_Institution
    Comput. Dept., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
  • fYear
    2011
  • fDate
    9-11 Sept. 2011
  • Firstpage
    52
  • Lastpage
    55
  • Abstract
    Abstract-High-risk types of human papillomaviruses (HPVs) cause cervical cancer,and the second most common tumor in women worldwide, and the HPV E6 protein is one of two viral oncoproteins that is expressed in virtually all HPV-positive cancers. Therefore, how can we identify whether it is a risk type of HPVs by means of E6 properties is very useful and necessary to the diagnosis and the remedy of cervical cancer. Using the pseudo amino acid (PseAA) composition to represent the sample of a protein can incorporate a considerable amount of sequence pattern information so as to improve the prediction quality for the classification of risk type. In this paper, based on the characters of hydrophobicity, hydrophilicity, side-chain mass, we present a novel approach-protein distance matrix image(DMI) to classify HPV risk types from E6 protein sequences.Based on the protein DMI , two geometric moments were extracted from each of the protein sequences concerned are adopted for its PseAA. It was demonstrated thru the jackknife cross-validation test that the overall success rate are 100%. The results showed that bioinformatics based on theoretical approaches can direct and simplify experimental studies.
  • Keywords
    bioinformatics; cancer; feature extraction; hydrophilicity; hydrophobicity; medical image processing; microorganisms; molecular biophysics; proteins; risk analysis; tumours; HPV E6 protein sequences; HPV-positive cancer; PseAA composition; bioinformatics; cervical cancer; cervical cancer diagnosis; geometric moments extraction; human papillomaviruses; hydrophilicity; hydrophobicity; jackknife cross-validation test; protein DMI; protein distance matrix image; pseudoamino acid composition; risk classification; sequence pattern information; side-chain mass; tumor; viral oncoprotein; Amino acids; Cervical cancer; Humans; Matrix converters; Protein sequence; Distance Matrix; E6 protein; Fuzzy K-nearest neighbor; HPV; High Risk; Low Risk;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Communications and Control (ICECC), 2011 International Conference on
  • Conference_Location
    Ningbo
  • Print_ISBN
    978-1-4577-0320-1
  • Type

    conf

  • DOI
    10.1109/ICECC.2011.6067633
  • Filename
    6067633