Prediction of protein-protein interactions using chaos game representation and wavelet transform via the random forest algorithm
- PMID: 26436504
- DOI: 10.4238/2015.October.2.13
Prediction of protein-protein interactions using chaos game representation and wavelet transform via the random forest algorithm
Abstract
Studying the network of protein-protein interactions (PPIs) will provide valuable insights into the inner workings of cells. It is vitally important to develop an automated, high-throughput tool that efficiently predicts protein-protein interactions. This study proposes a new model for PPI prediction based on the concept of chaos game representation and the wavelet transform, which means that a considerable amount of sequence-order effects can be incorporated into a set of discrete numbers. The advantage of using chaos game representation and the wavelet transform to formulate the protein sequence is that it can more effectively reflect its overall sequence-order characteristics than the conventional correlation factors. Using such a formulation frame to represent the protein sequences means that the random forest algorithm can be used to conduct the prediction. The results for a large-scale independent test dataset show that the proposed model can achieve an excellent performance with an accuracy value of about 0.86 and a geometry mean value of about 0.85. The model is therefore a useful supplementary tool for PPI predictions. The predictor used in this article is freely available at http://www.jci-bioinfo.cn/PPI.
Similar articles
-
Prediction of Protein-Protein Interactions with Physicochemical Descriptors and Wavelet Transform via Random Forests.J Lab Autom. 2016 Jun;21(3):368-77. doi: 10.1177/2211068215581487. Epub 2015 Apr 16. J Lab Autom. 2016. PMID: 25882187
-
iPPI-PseAAC(CGR): Identify protein-protein interactions by incorporating chaos game representation into PseAAC.J Theor Biol. 2019 Jan 7;460:195-203. doi: 10.1016/j.jtbi.2018.10.021. Epub 2018 Oct 9. J Theor Biol. 2019. PMID: 30312687
-
Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.BMC Syst Biol. 2016 Dec 23;10(Suppl 4):120. doi: 10.1186/s12918-016-0360-6. BMC Syst Biol. 2016. PMID: 28155718 Free PMC article.
-
Prediction of Protein-Protein Interactions by Evidence Combining Methods.Int J Mol Sci. 2016 Nov 22;17(11):1946. doi: 10.3390/ijms17111946. Int J Mol Sci. 2016. PMID: 27879651 Free PMC article. Review.
-
Sequence analysis by iterated maps, a review.Brief Bioinform. 2014 May;15(3):369-75. doi: 10.1093/bib/bbt072. Epub 2013 Oct 25. Brief Bioinform. 2014. PMID: 24162172 Free PMC article. Review.
Cited by
-
Protein-protein interaction and non-interaction predictions using gene sequence natural vector.Commun Biol. 2022 Jul 2;5(1):652. doi: 10.1038/s42003-022-03617-0. Commun Biol. 2022. PMID: 35780196 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases