Complete combinatorial mutational enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
- PMID: 38989563
- PMCID: PMC11237556
- DOI: 10.1002/pro.5109
Complete combinatorial mutational enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity
Abstract
Understanding how proteins evolve under selective pressure is a longstanding challenge. The immensity of the search space has limited efforts to systematically evaluate the impact of multiple simultaneous mutations, so mutations have typically been assessed individually. However, epistasis, or the way in which mutations interact, prevents accurate prediction of combinatorial mutations based on measurements of individual mutations. Here, we use artificial intelligence to define the entire functional sequence landscape of a protein binding site in silico, and we call this approach Complete Combinatorial Mutational Enumeration (CCME). By leveraging CCME, we are able to construct a comprehensive map of the evolutionary connectivity within this functional sequence landscape. As a proof of concept, we applied CCME to the ACE2 binding site of the SARS-CoV-2 spike protein receptor binding domain. We selected representative variants from across the functional sequence landscape for testing in the laboratory. We identified variants that retained functionality to bind ACE2 despite changing over 40% of evaluated residue positions, and the variants now escape binding and neutralization by monoclonal antibodies. This work represents a crucial initial stride toward achieving precise predictions of pathogen evolution, opening avenues for proactive mitigation.
Keywords: SARS‐CoV‐2; artificial intelligence; evolution; neutralizing antibody; protein binding; protein design; protein structure; structure prediction; virology.
© 2024 The Protein Society.
Conflict of interest statement
MSC, JTB, IM and CDB own stock in AI Proteins, Inc.
Update of
-
Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity.Res Sq [Preprint]. 2023 Sep 11:rs.3.rs-2248327. doi: 10.21203/rs.3.rs-2248327/v2. Res Sq. 2023. Update in: Protein Sci. 2024 Aug;33(8):e5109. doi: 10.1002/pro.5109 PMID: 36482980 Free PMC article. Updated. Preprint.
Similar articles
-
Complete Combinatorial Mutational Enumeration of a protein functional site enables sequence-landscape mapping and identifies highly-mutated variants that retain activity.Res Sq [Preprint]. 2023 Sep 11:rs.3.rs-2248327. doi: 10.21203/rs.3.rs-2248327/v2. Res Sq. 2023. Update in: Protein Sci. 2024 Aug;33(8):e5109. doi: 10.1002/pro.5109 PMID: 36482980 Free PMC article. Updated. Preprint.
-
Deep mutational learning predicts ACE2 binding and antibody escape to combinatorial mutations in the SARS-CoV-2 receptor-binding domain.Cell. 2022 Oct 13;185(21):4008-4022.e14. doi: 10.1016/j.cell.2022.08.024. Epub 2022 Aug 31. Cell. 2022. PMID: 36150393 Free PMC article.
-
V367F Mutation in SARS-CoV-2 Spike RBD Emerging during the Early Transmission Phase Enhances Viral Infectivity through Increased Human ACE2 Receptor Binding Affinity.J Virol. 2021 Jul 26;95(16):e0061721. doi: 10.1128/JVI.00617-21. Epub 2021 Jul 26. J Virol. 2021. PMID: 34105996 Free PMC article.
-
Mutations in the SARS-CoV-2 spike receptor binding domain and their delicate balance between ACE2 affinity and antibody evasion.Protein Cell. 2024 May 28;15(6):403-418. doi: 10.1093/procel/pwae007. Protein Cell. 2024. PMID: 38442025 Free PMC article. Review.
-
Structural basis of severe acute respiratory syndrome coronavirus 2 infection.Curr Opin HIV AIDS. 2021 Jan;16(1):74-81. doi: 10.1097/COH.0000000000000658. Curr Opin HIV AIDS. 2021. PMID: 33186231 Review.
References
MeSH terms
Substances
Supplementary concepts
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous