Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences
- PMID: 18390576
- PMCID: PMC2396404
- DOI: 10.1093/nar/gkn159
Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences
Abstract
Compared to the available protein sequences of different organisms, the number of revealed protein-protein interactions (PPIs) is still very limited. So many computational methods have been developed to facilitate the identification of novel PPIs. However, the methods only using the information of protein sequences are more universal than those that depend on some additional information or predictions about the proteins. In this article, a sequence-based method is proposed by combining a new feature representation using auto covariance (AC) and support vector machine (SVM). AC accounts for the interactions between residues a certain distance apart in the sequence, so this method adequately takes the neighbouring effect into account. When performed on the PPI data of yeast Saccharomyces cerevisiae, the method achieved a very promising prediction result. An independent data set of 11,474 yeast PPIs was used to evaluate this prediction model and the prediction accuracy is 88.09%. The performance of this method is superior to those of the existing sequence-based methods, so it can be a useful supplementary tool for future proteomics studies. The prediction software and all data sets used in this article are freely available at http://www.scucic.cn/Predict_PPI/index.htm.
Figures
Similar articles
-
RVMAB: Using the Relevance Vector Machine Model Combined with Average Blocks to Predict the Interactions of Proteins from Protein Sequences.Int J Mol Sci. 2016 May 18;17(5):757. doi: 10.3390/ijms17050757. Int J Mol Sci. 2016. PMID: 27213337 Free PMC article.
-
Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set.BMC Bioinformatics. 2014;15 Suppl 15(Suppl 15):S9. doi: 10.1186/1471-2105-15-S15-S9. Epub 2014 Dec 3. BMC Bioinformatics. 2014. PMID: 25474679 Free PMC article.
-
Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding.BMC Bioinformatics. 2016 Apr 26;17(1):184. doi: 10.1186/s12859-016-1035-4. BMC Bioinformatics. 2016. PMID: 27112932 Free PMC article.
-
Application of Machine Learning Approaches for Protein-protein Interactions Prediction.Med Chem. 2017;13(6):506-514. doi: 10.2174/1573406413666170522150940. Med Chem. 2017. PMID: 28530547 Review.
-
Recent advances in predicting and modeling protein-protein interactions.Trends Biochem Sci. 2023 Jun;48(6):527-538. doi: 10.1016/j.tibs.2023.03.003. Epub 2023 Apr 14. Trends Biochem Sci. 2023. PMID: 37061423 Review.
Cited by
-
Graph-based machine learning model for weight prediction in protein-protein networks.BMC Bioinformatics. 2024 Nov 8;25(1):349. doi: 10.1186/s12859-024-05973-6. BMC Bioinformatics. 2024. PMID: 39511478 Free PMC article.
-
Decoding Missense Variants by Incorporating Phase Separation via Machine Learning.Nat Commun. 2024 Sep 27;15(1):8279. doi: 10.1038/s41467-024-52580-3. Nat Commun. 2024. PMID: 39333476 Free PMC article.
-
Computational analysis of pathogen-host interactome for fast and low-risk in-silico drug repurposing in emerging viral threats like Mpox.Sci Rep. 2024 Aug 12;14(1):18736. doi: 10.1038/s41598-024-69617-8. Sci Rep. 2024. PMID: 39134619 Free PMC article.
-
PETA: evaluating the impact of protein transfer learning with sub-word tokenization on downstream applications.J Cheminform. 2024 Aug 2;16(1):92. doi: 10.1186/s13321-024-00884-3. J Cheminform. 2024. PMID: 39095917 Free PMC article.
-
MGPPI: multiscale graph neural networks for explainable protein-protein interaction prediction.Front Genet. 2024 Jul 15;15:1440448. doi: 10.3389/fgene.2024.1440448. eCollection 2024. Front Genet. 2024. PMID: 39076171 Free PMC article.
References
-
- Fields S, Song O. A novel genetic system to detect protein–protein interactions. Nature. 1989;340:245–246. - PubMed
-
- Gavin AC, Boche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick J, Michon A, Cruciat C. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature. 2002;415:141–147. - PubMed
-
- Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams S, Millar A, Taylor P, Bennett K, Boutilier K, et al. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 2002;415:180–183. - PubMed
-
- Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, Lan N, Jansen R, Bidlingmaier S, Houfek T, et al. Global analysis of protein activities using proteome chips. Science. 2001;193:2101–2105. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources