Improved protein structure prediction using potentials from deep learning
- PMID: 31942072
- DOI: 10.1038/s41586-019-1923-7
Improved protein structure prediction using potentials from deep learning
Abstract
Protein structure prediction can be used to determine the three-dimensional shape of a protein from its amino acid sequence1. This problem is of fundamental importance as the structure of a protein largely determines its function2; however, protein structures can be difficult to determine experimentally. Considerable progress has recently been made by leveraging genetic information. It is possible to infer which amino acid residues are in contact by analysing covariation in homologous sequences, which aids in the prediction of protein structures3. Here we show that we can train a neural network to make accurate predictions of the distances between pairs of residues, which convey more information about the structure than contact predictions. Using this information, we construct a potential of mean force4 that can accurately describe the shape of a protein. We find that the resulting potential can be optimized by a simple gradient descent algorithm to generate structures without complex sampling procedures. The resulting system, named AlphaFold, achieves high accuracy, even for sequences with fewer homologous sequences. In the recent Critical Assessment of Protein Structure Prediction5 (CASP13)-a blind assessment of the state of the field-AlphaFold created high-accuracy structures (with template modelling (TM) scores6 of 0.7 or higher) for 24 out of 43 free modelling domains, whereas the next best method, which used sampling and contact information, achieved such accuracy for only 14 out of 43 domains. AlphaFold represents a considerable advance in protein-structure prediction. We expect this increased accuracy to enable insights into the function and malfunction of proteins, especially in cases for which no structures for homologous proteins have been experimentally determined7.
Comment in
-
A watershed moment for protein structure prediction.Nature. 2020 Jan;577(7792):627-628. doi: 10.1038/d41586-019-03951-0. Nature. 2020. PMID: 31988401 No abstract available.
-
Deep learning 3D structures.Nat Methods. 2020 Mar;17(3):249. doi: 10.1038/s41592-020-0779-y. Nat Methods. 2020. PMID: 32132733 No abstract available.
Similar articles
-
Applying and improving AlphaFold at CASP14.Proteins. 2021 Dec;89(12):1711-1721. doi: 10.1002/prot.26257. Proteins. 2021. PMID: 34599769 Free PMC article.
-
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25. Proteins. 2019. PMID: 30985027 Free PMC article.
-
Deep-learning contact-map guided protein structure prediction in CASP13.Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14. Proteins. 2019. PMID: 31365149 Free PMC article.
-
A glance into the evolution of template-free protein structure prediction methodologies.Biochimie. 2020 Aug;175:85-92. doi: 10.1016/j.biochi.2020.04.026. Epub 2020 May 15. Biochimie. 2020. PMID: 32417458 Review.
-
Protein Structure Prediction: Conventional and Deep Learning Perspectives.Protein J. 2021 Aug;40(4):522-544. doi: 10.1007/s10930-021-10003-y. Epub 2021 May 28. Protein J. 2021. PMID: 34050498 Review.
Cited by
-
Beyond AlphaFold2: The Impact of AI for the Further Improvement of Protein Structure Prediction.Methods Mol Biol. 2025;2867:121-139. doi: 10.1007/978-1-0716-4196-5_7. Methods Mol Biol. 2025. PMID: 39576578 Review.
-
Importance of Secondary Structure Data in Large Scale Protein Modeling Using Low-Resolution SURPASS Method.Methods Mol Biol. 2025;2867:55-78. doi: 10.1007/978-1-0716-4196-5_4. Methods Mol Biol. 2025. PMID: 39576575
-
Improving Protein Secondary Structure Prediction by Deep Language Models and Transformer Networks.Methods Mol Biol. 2025;2867:43-53. doi: 10.1007/978-1-0716-4196-5_3. Methods Mol Biol. 2025. PMID: 39576574
-
Structure of the human TSC:WIPI3 lysosomal recruitment complex.Sci Adv. 2024 Nov 22;10(47):eadr5807. doi: 10.1126/sciadv.adr5807. Epub 2024 Nov 20. Sci Adv. 2024. PMID: 39565846 Free PMC article.
-
ESM-scan-A tool to guide amino acid substitutions.Protein Sci. 2024 Dec;33(12):e5221. doi: 10.1002/pro.5221. Protein Sci. 2024. PMID: 39565080 Free PMC article.
References
-
- Dill, K. A. & MacCallum, J. L. The protein-folding problem, 50 years on. Science 338, 1042–1046 (2012). - PubMed
-
- Schaarschmidt, J., Monastyrskyy, B., Kryshtafovych, A. & Bonvin, A. M. J. J. Assessment of contact predictions in CASP12: co-evolution and deep learning coming of age. Proteins 86, 51–66 (2018). - PubMed
-
- Kirkwood, J. Statistical mechanics of fluid mixtures. J. Chem. Phys. 3, 300–313 (1935).
-
- Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K. & Moult, J. Critical assessment of methods of protein structure prediction (CASP)—Round XIII. Proteins 87, 1011–1020 (2019). - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources