PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Duval, Alexandre; Schmidt, Victor; Miret, Santiago; Bengio, Yoshua; Hernández-García, Alex; Rolnick, David

Computer Science > Machine Learning

arXiv:2211.12020 (cs)

[Submitted on 22 Nov 2022 (v1), last revised 11 Mar 2024 (this version, v4)]

Title:PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Authors:Alexandre Duval, Victor Schmidt, Santiago Miret, Yoshua Bengio, Alex Hernández-García, David Rolnick

View PDF HTML (experimental)

Abstract:Mitigating the climate crisis requires a rapid transition towards lower-carbon energy. Catalyst materials play a crucial role in the electrochemical reactions involved in numerous industrial processes key to this transition, such as renewable energy storage and electrofuel synthesis. To reduce the energy spent on such activities, we must quickly discover more efficient catalysts to drive electrochemical reactions. Machine learning (ML) holds the potential to efficiently model materials properties from large amounts of data, accelerating electrocatalyst design. The Open Catalyst Project OC20 dataset was constructed to that end. However, ML models trained on OC20 are still neither scalable nor accurate enough for practical applications. In this paper, we propose task-specific innovations applicable to most architectures, enhancing both computational efficiency and accuracy. This includes improvements in (1) the graph creation step, (2) atom representations, (3) the energy prediction head, and (4) the force prediction head. We describe these contributions, referred to as PhAST, and evaluate them thoroughly on multiple architectures. Overall, PhAST improves energy MAE by 4 to 42$\%$ while dividing compute time by 3 to 8$\times$ depending on the targeted task/model. PhAST also enables CPU training, leading to 40$\times$ speedups in highly parallelized settings. Python package: \url{this https URL}.

Comments:	Journal of Machine Learning Research (JMLR)
Subjects:	Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
Cite as:	arXiv:2211.12020 [cs.LG]
	(or arXiv:2211.12020v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.12020

Submission history

From: Alexandre Duval [view email]
[v1] Tue, 22 Nov 2022 05:24:30 UTC (155 KB)
[v2] Tue, 20 Jun 2023 15:53:49 UTC (601 KB)
[v3] Thu, 22 Jun 2023 10:34:42 UTC (601 KB)
[v4] Mon, 11 Mar 2024 15:50:55 UTC (603 KB)

Computer Science > Machine Learning

Title:PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators