Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting
- PMID: 31734254
- DOI: 10.1016/j.ab.2019.113507
Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting
Abstract
Accurate identification of drug-target interaction (DTI) is a crucial and challenging task in the drug discovery process, having enormous benefit to the patients and pharmaceutical company. The traditional wet-lab experiments of DTI is expensive, time-consuming, and labor-intensive. Therefore, many computational techniques have been established for this purpose; although a huge number of interactions are still undiscovered. Here, we present pdti-EssB, a new computational model for identification of DTI using protein sequence and drug molecular structure. More specifically, each drug molecule is transformed as the molecular substructure fingerprint. For a protein sequence, different descriptors are utilized to represent its evolutionary, sequence, and structural information. Besides, our proposed method uses data balancing techniques to handle the imbalance problem and applies a novel feature eliminator to extract the best optimal features for accurate prediction. In this paper, four classes of DTI benchmark datasets are used to construct a predictive model with XGBoost. Here, the auROC is utilized as an evaluation metric to compare the performance of pdti-EssB method with recent methods, applying five-fold cross-validation. Finally, the experimental results indicate that our proposed method is able to outperform other approaches in predicting DTI, and introduces new drug-target interaction samples based on prediction probability scores. pdti-EssB webserver is available online at http://pdtiessb-uestc.com/.
Keywords: Data imbalance; Drug-target interaction; Feature extraction; Feature selection; Molecular substructure fingerprint; XGBoost classifier.
Copyright © 2019 Elsevier Inc. All rights reserved.
Similar articles
-
A Systematic Prediction of Drug-Target Interactions Using Molecular Fingerprints and Protein Sequences.Curr Protein Pept Sci. 2018;19(5):468-478. doi: 10.2174/1389203718666161122103057. Curr Protein Pept Sci. 2018. PMID: 27875970
-
A Machine Learning Approach for Drug-target Interaction Prediction using Wrapper Feature Selection and Class Balancing.Mol Inform. 2020 May;39(5):e1900062. doi: 10.1002/minf.201900062. Epub 2020 Feb 11. Mol Inform. 2020. PMID: 32003548
-
PreDTIs: prediction of drug-target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques.Brief Bioinform. 2021 Sep 2;22(5):bbab046. doi: 10.1093/bib/bbab046. Brief Bioinform. 2021. PMID: 33709119 Free PMC article.
-
Computational prediction of drug-target interactions using chemogenomic approaches: an empirical survey.Brief Bioinform. 2019 Jul 19;20(4):1337-1357. doi: 10.1093/bib/bby002. Brief Bioinform. 2019. PMID: 29377981 Review.
-
Drug-Target Interactions: Prediction Methods and Applications.Curr Protein Pept Sci. 2018;19(6):537-561. doi: 10.2174/1389203718666161108091609. Curr Protein Pept Sci. 2018. PMID: 27829350 Review.
Cited by
-
Exploration of the link between COVID-19 and gastric cancer from the perspective of bioinformatics and systems biology.Front Med (Lausanne). 2024 Sep 20;11:1428973. doi: 10.3389/fmed.2024.1428973. eCollection 2024. Front Med (Lausanne). 2024. PMID: 39371335 Free PMC article.
-
A review of deep learning methods for ligand based drug virtual screening.Fundam Res. 2024 Mar 11;4(4):715-737. doi: 10.1016/j.fmre.2024.02.011. eCollection 2024 Jul. Fundam Res. 2024. PMID: 39156568 Free PMC article. Review.
-
Deciphering the molecular nexus between Omicron infection and acute kidney injury: a bioinformatics approach.Front Mol Biosci. 2024 Jul 4;11:1340611. doi: 10.3389/fmolb.2024.1340611. eCollection 2024. Front Mol Biosci. 2024. PMID: 39027131 Free PMC article.
-
Comparative bioinformatics analysis of transcriptomes between β-aminopropionitrile-induced aortic dissection murine model and human aortic dissection.J Thorac Dis. 2023 Nov 30;15(11):6058-6071. doi: 10.21037/jtd-23-981. Epub 2023 Nov 27. J Thorac Dis. 2023. PMID: 38090293 Free PMC article.
-
PepCNN deep learning tool for predicting peptide binding residues in proteins using sequence, structural, and language model features.Sci Rep. 2023 Nov 28;13(1):20882. doi: 10.1038/s41598-023-47624-5. Sci Rep. 2023. PMID: 38016996 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous