• Title of article

    Adverse drug reaction prediction using voting ensemble training approach

  • Author/Authors

    Besharatifard ، Milad Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic) , Ghorbanali ، Zahra Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic) , Zare Mirakabad ، Fatemeh Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic)

  • From page
    45
  • To page
    60
  • Abstract
    Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.
  • Keywords
    Adverse drug reaction , Machine learning , Random forest , Rare adverse reactions , Unbalanced dataset
  • Journal title
    AUT Journal of Mathematics and Computing
  • Journal title
    AUT Journal of Mathematics and Computing
  • Record number

    2757829