Title of article

Adverse drug reaction prediction using voting ensemble training approach

Author/Authors

Besharatifard ، Milad Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic) , Ghorbanali ، Zahra Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic) , Zare Mirakabad ، Fatemeh Department of Mathematics and Computer Science - Computational Biology Research Center (CBRC) - Amirkabir University of Technology (Tehran Polytechnic)

From page

To page

Abstract

Identifying and controlling adverse drug reactions (ADRs) is a challenging problem in the pharmacological field. For instance, the drug Rosiglitazone has been associated with adverse reactions that were only recognized after its release. Due to such experiences, pharmacists are now more interested in using computational methods to predict ADRs. The performance of computational methods is contingent upon the defined dataset. In some studies, the known drug-adverse reaction associations are regarded as positive while the unknown drug-adverse reaction associations are regarded as negative data. This consequently creates an unbalanced dataset, which can lead to inaccurate predictions from models and cause the classifiers to be flawed. We propose a framework named Adverse Drug Reaction using the Voting Ensemble Training Approach (ADRP-VETA) for ADR problem to overcome unbalanced dataset challenges. We construct the similarity vector of each drug with other drugs based on chemical structure as a drug feature. Also, the similarity vector of each ADR with other ADRs is computed based on the Unified Medical Language System (UMLS) as adverse reaction feature. With this approach, we can leverage the similarity of the features to more accurately capture the intricate relationships between drugs and adverse reactions. We compare ADRP-VETA to three state-of-the-art models and find that it outperforms them, achieving an AUC-ROC of 91% and an AUC-PR of 89.8%. Furthermore, we assess ADRP-VETA’s ability to predict rare adverse reactions, and find that its AUC-ROC and AUC-PR are 83.3% and 92.2%, respectively. As a case study, we focus on the associations between liver-injury adverse reactions and three drugs.

Keywords

Adverse drug reaction , Machine learning , Random forest , Rare adverse reactions , Unbalanced dataset

Journal title

AUT Journal of Mathematics and Computing

Journal title

AUT Journal of Mathematics and Computing

Record number

2757829

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2757829