• DocumentCode
    1824909
  • Title

    Development and validation of an HIV risk scorecard model

  • Author

    Sibanda, Wilbert ; Pretorius, Philip

  • Author_Institution
    DST/NWU Preclinical Platform, North-West Univ., Potchefstroom, South Africa
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    916
  • Lastpage
    922
  • Abstract
    This research paper covers the development of an HIV risk scorecard using SAS Enterprise MinerTM. The HIV risk scorecard was developed using the 2007 South African annual antenatal HIV and syphilis seroprevalence data. Antenatal data contains various demographic characteristics for each pregnant woman, such as pregnant woman´s age, male sexual partner´s age, race, level of education, gravidity, parity, HIV and syphilis status. The purpose of this research was to use a scorecard to rank the effects of the demographic characteristics on influencing an individual´s risk of acquiring an HIV infection, not the probability of being sick. The project encompassed the selection of the data sample, classing, selection of demographic characteristics, fitting of a regression model, generation of weights-of-evidence (WOE), calculation of information values (IVs), creation and validation of an HIV risk scorecard. The educational level and syphilis status of the pregnant women produced information values below 0.05 and were rejected from inclusion in the final HIV risk scorecard. Based on their respective information values, the following four demographic characteristics of the pregnant women were found to be of medium predictive strength and thus included in the final HIV risk scorecard; age, age of male sexual partner, gravidity and parity. The age of the pregnant woman had the highest information value and Gini coefficient. The HIV risk scorecard showed that the risk of contracting an HIV infection increased gradually up to the age of 30 years for females and 34 years old for their male sexual partners. Thereafter, the risk decreased gradually towards the age of 45.
  • Keywords
    data mining; demography; diseases; education; medical computing; regression analysis; 2007 South African annual antenatal HIV data; HIV risk scorecard model; SAS Enterprise MinerTM; WOE generation; demographic characteristics; educational level; information value calculation; male sexual partner age; pregnant woman; race; regression model; syphilis seroprevalence data; weights-of-evidence generation; Conferences; Human immunodeficiency virus; Insurance; Pregnancy; Social network services; Synthetic aperture sonar; HIV; IV; WOE;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
  • Conference_Location
    Niagara Falls, ON
  • Type

    conf

  • Filename
    6785809