Message Passing Neural Networks for Predicting 1H and 19F Chemical Shifts

Jones, Adam, Ponte, Santiago, Iglesias, Isaac, Tonge, Nicola, Williamson, David, Martos, Vera, Orth, Till, Schoenberger, Torsten, Cobas, Carlos, Huber, Katharina and Kemsley, E. Kate (2025) Message Passing Neural Networks for Predicting 1H and 19F Chemical Shifts. In: Experimental Nuclear Magnetic Resonance Conference 2025, 2025-04-06 - 2025-04-10, Asilomar Conference Center, Pacific Grove, California.

[thumbnail of ENC_Poster-MPNN-final-KK1.pdf]
Preview
PDF (ENC_Poster-MPNN-final-KK1.pdf) - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview

Abstract

We are using advanced artificial intelligence techniques to improve NMR chemical shift prediction, spectral assignment, and automated structural validation in NMR spectroscopy. Graph neural networks (GNNs) are well-suited for molecular representation, treating atoms as nodes and bonds as edges, making them the leading framework for predictive modelling. Their primary strength lies in their simultaneous use of both node feature information and the atom connectivities encoded in the graph adjacency matrix. A specialised class of GNN, the Message Passing Neural Network (MPNN), defines a receptive field around each atom, enabling learning at the molecular substructure level. This is particularly valuable for modelling chemical shifts, which are highly dependent on the local electronic environment of the active nuclei. We are training ensembles of MPNNs from large collections of molecular structures (>10,000s) annotated with experimentally observed chemical shifts. Each structure in the training set is represented by the adjacency matrix of its underlying graph and a set of atomic features, which include stereo bond configurations and chiral centre information derived from three-dimensional graph embeddings. Outcomes from models trained on proton (1H) and fluorine (19F) chemical shifts are reported here. The median absolute prediction errors from unseen test data are approximately 0.08 ppm for ¹H and 2.2 ppm for ¹⁹F. In both cases, the error distributions are smooth, symmetric about zero, and can be accurately modelled using a Gaussian kernel density estimation. This approach, in turn, enables a data-driven, probabilistic method for spectral assignment and structural verification.

Item Type: Conference or Workshop Item (Poster)
Faculty \ School: Faculty of Science > School of Chemistry, Pharmacy and Pharmacology
Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Computational Biology
Depositing User: LivePure Connector
Date Deposited: 04 Apr 2025 15:30
Last Modified: 04 Apr 2025 15:30
URI: https://ueaeprints.uea.ac.uk/id/eprint/98967
DOI:

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item