Towards autonomous analysis of chemical exchange saturation transfer experiments using deep neural networks

Gogulan Karunanithy¹,
Tairan Yuwen²,
Lewis E. Kay^3,4,5,6 &
…
D. Flemming Hansen ORCID: orcid.org/0000-0003-0891-220X¹

3907 Accesses
6 Citations
11 Altmetric
Explore all metrics

Abstract

Macromolecules often exchange between functional states on timescales that can be accessed with NMR spectroscopy and many NMR tools have been developed to characterise the kinetics and thermodynamics of the exchange processes, as well as the structure of the conformers that are involved. However, analysis of the NMR data that report on exchanging macromolecules often hinges on complex least-squares fitting procedures as well as human experience and intuition, which, in some cases, limits the widespread use of the methods. The applications of deep neural networks (DNNs) and artificial intelligence have increased significantly in the sciences, and recently, specifically, within the field of biomolecular NMR, where DNNs are now available for tasks such as the reconstruction of sparsely sampled spectra, peak picking, and virtual decoupling. Here we present a DNN for the analysis of chemical exchange saturation transfer (CEST) data reporting on two- or three-site chemical exchange involving sparse state lifetimes of between approximately 3–60 ms, the range most frequently observed via experiment. The work presented here focuses on the ¹H CEST class of methods that are further complicated, in relation to applications to other nuclei, by anti-phase features. The developed DNNs accurately predict the chemical shifts of nuclei in the exchanging species directly from anti-phase ¹H^N CEST profiles, along with an uncertainty associated with the predictions. The performance of the DNN was quantitatively assessed using both synthetic and experimental anti-phase CEST profiles. The assessments show that the DNN accurately determines chemical shifts and their associated uncertainties. The DNNs developed here do not contain any parameters for the end-user to adjust and the method therefore allows for autonomous analysis of complex NMR data that report on conformational exchange.

Introduction

Many functional aspects of a macromolecule can be understood from its time-averaged three-dimensional structure. However, often the functionality of these molecules depends on their ability to exchange between different conformational states. Thus, quantifying the interconversion between these states is an important first step towards understanding how these biomolecules work (Yang et al. 2003; Karplus and Kuriyan 2005; Boehr et al. 2006; Henzler-Wildman and Kern 2007; Faust et al. 2020; Xie et al. 2020; Wurm et al. 2021). When conformational exchange is present, there is often one major populated state, the ground state, and a set of transiently low-populated states that, despite their low populations and short lifetimes, often play crucial roles for function. Several NMR techniques are now available to characterise reaction dynamics and transiently populated states at atomic resolution, including, chemical exchange saturation transfer (CEST) (Ward et al. 2000; Zhou and Zijl 2006; Vallurupalli et al. 2012), dark-state exchange saturation transfer (DEST) (Bertini et al. 1999; Hansen and Led 2006; Fawzi et al. 2011), Carr-Purcell-Meiboom-Gill (CPMG) (Meiboom and Gill 1958; Loria et al. 1999; Tollinger et al. 2001) relaxation dispersion, and relaxation in the rotating frame (R_1ρ, R_2ρ) (Palmer and Massi 2006; Hansen et al. 2009; Chao and Byrd 2016). CEST-based methods, which report on conformational exchange involving sparse states with lifetimes ranging from approximately 3–60 ms, have expanded tremendously over the last decade and have provided invaluable insights into the function of macromolecules (Vallurupalli et al. 2017). However, although several tools are available for the analysis of NMR data reporting on conformational exchange, challenges do exist, particularly when the exchange deviates from a simple two-state model (Neudecker et al. 2006). For ¹H CEST methods reporting on the exchange of amide-protons (Yuwen et al. 2017a) and methyl-protons (Yuwen et al. 2017b) analyses are further complicated by anti-phase features caused by the requirement to eliminate ¹H-¹H cross-relaxation effects, leading to broad lineshapes, with resolution significantly more limited than for ‘typical’ CEST profiles comprised of absorptive-like dips.

Deep learning and deep neural networks (DNNs) have led to huge advances in many fields of science, including computer vision and natural language processing, and the methodology is now a crucial component of many everyday technologies (LeCun et al. 2015). In supervised deep learning, DNNs are trained to map an input to a desired output, and once trained, these networks can perform analyses autonomously. Deep learning is particularly successful at extracting features in complex data (Goodfellow et al. 2016). It has been used for several years within the field of clinical magnetic resonance imaging (MRI) and some of the tools have already been approved by the FDA (Chaudhari et al. 2021) for image enhancement and classification. Within biomolecular NMR there has been a surge in applications of DNNs over the last couple of years, and networks are now available for the reconstruction of sparsely sampled spectra (Hansen 2019; Luo et al. 2020; Qu et al. 2020; Karunanithy and Hansen 2021), peak picking (Klukowski et al. 2018), estimating initial fitting parameters (Beckwith et al. 2021), and virtual decoupling (Karunanithy et al. 2021).

A key hurdle with many machine learning applications is that training robust models requires a large amount of curated training data. The in-depth understanding of the theory behind biomolecular NMR and the ability to simulate even complex NMR experiments means that the required amount of realistic training data can be generated synthetically. Importantly, it has now become clear that DNNs trained on fully synthetic data show robust performance on experimental data (Hansen 2019; Karunanithy and Hansen 2021; Karunanithy et al. 2021), which allows for sophisticated DNNs to be developed for the transformation and analysis of NMR spectra.

Overall, there is enormous potential for the development of deep learning approaches for the general analysis of NMR data and in particular for experiments reporting on conformational exchange. Below we have designed and trained DNNs to extract chemical shifts from the notably complex amide-proton anti-phase CEST experiment. The DNNs were trained solely on synthetically generated CEST profiles and are able to extract accurate chemical shifts of exchanging species as well as their uncertainties, thereby demonstrating that NMR data reporting on conformational exchange can be analysed autonomously using deep neural networks.

Methods

Deep neural network architectures

Figure S2 shows the architecture for the DNN used to transform time-domain anti-phase CEST profiles into time-domain in-phase CEST profiles, DNN_TR. This architecture is built from two modules, a module akin to a block in the FID-Net architecture (Karunanithy and Hansen 2021) and a modified LSTM module (Hansen 2019). The reason for this choice was that the main objective for the DNN is to ‘decouple’ anti-phase CEST profiles, which we have recently shown can be accomplished by the FID-Net architecture (Karunanithy and Hansen 2021). The python code for generating the model architecture in Tensorflow/Keras is provided in Supporting Material and can be downloaded from GitHub. The input to the DNN consists of two vectors of size 2 × 65 = 130. The first vector, cest_AP(t) = c₀ holds the zero-filled real Fourier transform (real and imaginary components) of the antiphase CEST profile and the second vector holds the time-points associated with the first vector, t₀. The output of the network is the in-phase CEST profile, sampled at 128 offsets. The network contained 3,782,423 trainable parameters.

The second DNN, DNN_CS, used to determine chemical shifts and their confidences was built using a densely connected convolutional neural network architecture (Huang et al. 2016), Fig. S4. The input for the network is the output from the first transformation described above, that is, frequency domain data describing the in-phase CEST profile, cest_IP(ω), a vector of 128 real points. In its current form, the network detects a maximum of three chemical shifts as well as their confidences and the output of the network is therefore a 3 × 2 tensor, whose elements comprise three chemical shifts and their confidence values. Overall, the network has 1,591,526 trainable parameters. The python code for generating the model in Tensorflow/Keras is provided in Supporting Material and can also be downloaded from GitHub.

Training the deep neural networks

The first DNN, DNN_TR, was trained on 15 × 10⁶ anti-phase CEST profiles over 1500 epochs, where the range of training data is detailed in Table 1. An epoch refers to a single cycle of training of the neural network with training data. The training data was generated on-the-fly using code written in python and using functions from the Tensorflow and numpy libraries. To obtain smooth simulated CEST profiles, similar to those generated by experiment, previous simulations have used a distribution of B₁ fields or other dephasing methods (Vallurupalli et al. 2012). Here the dephasing was achieved by only retaining the eigenvectors of the Liouvillian corresponding to real eigenvalues in the propagator. Thus, if L is the matrix describing the Liouvillian, under which the spin-system evolves during the CEST period, then the eigenvalues and eigenvectors of L are initially found: L Λ = Λ D, where Λ is a matrix of eigenvectors and D is a diagonal matrix of eigenvalues. The submatrix of D that holds the real eigenvalues is denoted D_re and the matrix holding the eigenvectors corresponding to the real eigenvalues is denoted Λ_re. Propagation of the spin-system is carried out with the propagator, Λ_re exp(− T_exD_re) Λ⁻¹_re. As an example, for a simple Liouvillian, L, represented by a 3 × 3 matrix in the basis set of the three product operators, I_x, I_y, and I_z there is typically only one real eigenvalue. After an eigendecomposition of L, the matrix holding the eigenvectors, Λ, and the diagonal matrix holding the eigenvalues, D, are 3 × 3 matrices. The submatrix Λ_re has dimensions 3 × 1, D_re, is a 1 × 1 matrix, and Λ⁻¹_re is a 1 × 3 matrix. Thus, Λ_re D_re Λ_re⁻¹ produces a 3 × 3 matrix that is the projection of the original Liouvillian onto the space spanned by the real eigensystem and Λ_re exp(− T_exD_re) Λ_re⁻¹ is the propagator corresponding only to the real eigensystem. For the code written with the Tensorflow library functions, where sizes of matrices should remain constant, the dephasing is achieved by multiplying any eigenvalue that has an imaginary part larger than 10^–3 by 10⁹, which means that evolutions caused by non-real eigenvalues are eliminated within nanoseconds.

Table 1 Parameters used to generate training data

Full size table

The anti-phase CEST profiles were then obtained by propagating the Liouvillian over the first INEPT and the CEST element in the anti-phase ¹H^N pulse sequence. For each anti-phase CEST profile an in-phase CEST profile was also generated by setting ¹J_HN = 0 Hz and integrating the Liouvillian over the CEST element (Vallurupalli et al. 2012). The stochastic ADAM (Kingma and Ba 2014) optimiser was employed with standard parameters and an adaptive learning rate calculated as $0.0004 \times \left( {L_{{{\text{freq}}}} + L_{{{\text{uncer}}}} } \right)^{3/4}$ (final learning rate of 10^–6). A batch size of 256 was used throughout the training and random gaussian noise was added with a standard deviation of 0.01 of the maximum value of each anti-phase CEST profile.

After training the DNN_TR network the DNN_CS network was trained. The input data for training the DNN_CS network was obtained from output of the trained DNN_TR network. Random gaussian noise with a standard deviation between 0.001 and 0.04 of the maximum value of each anti-phase CEST profile was added to anti-phase CEST profiles before these were transformed with the DNN_TR network. A total of 1.5 × 10⁷ CEST profiles were used for training, which was done over 110 epochs, with a batch size of 128. The stochastic ADAM (Kingma and Ba 2014) optimiser with standard parameters and a learning rate of 3.3 × 10^–4 was used.

Initial training was carried out using a desktop computer (Intel Core I7-6900 K, 3.2 GHz, 64 GB RAM), equipped with an NVIDIA GeForce GTX 1080 TI GPU graphics card and subsequent training carried out using the CAMP cluster (NVIDIA Tesla V100 GPU). Although the training of the two DNNs has benefitted from access to nodes with GPUs, using the trained DNNs to transform new (experimental) data does not require high-end computational nodes or GPUs. As an example, the full set of ca. 140 ¹H anti-phase CEST profiles from L99A T4 Lysozyme, Fig. 5, can be transformed with both DNN_TR and DNN_CS in less than 2 min on a standard laptop using only the CPU (Intel i7-6700 CPU).

Experimental amide-proton CEST data

A 1.5 mM U-[¹⁵N, ²H] L99A T4L sample produced as described previously (Bouvignies et al. 2011) and dissolved in 50 mM sodium phosphate, 25 mM NaCl, 2 mM EDTA, 2 mM NaN₃, pH 5.5, 90%H₂O/10%D₂O was used to record the anti-phase ¹H^N CEST experiments. L99A T4L anti-phase ¹H^N CEST experiments were performed as described previously (Yuwen et al. 2017a). Briefly, the experiments were measured on a 800 MHz Bruker spectrometer equipped with an x, y, z-gradient cryogenically cooled probe. ¹H^N-CEST measurements were performed with a B₁ field of 30.5 Hz at 282 K using a CEST delay of T_ex = 400 ms. A range of ¹H offsets on a regular grid from 6.5 to 9.5 ppm was used, with step sizes of 30 Hz. An additional reference 2D dataset was obtained by setting the B₁ offset to − 12 kHz.

A 1.35 mM sample of [U-¹⁵N,²H; Ileδ₁-¹³CHD₂; Leu, Val-¹³CHD₂/¹³CHD₂; Met-¹³CHD₂] G48A Fyn SH3 domain was prepared as described previously (Yuwen et al. 2017a). The sample was dissolved in 50 mM sodium phosphate, 0.2 mM EDTA, 0.05% NaN₃, pH 7.0, 90% H₂O/10% D₂O. ¹H^N CEST experiments were measured for the G48A Fyn SH3 domain using a 600 MHz Bruker spectrometer at 285 K (x, y, z-gradient cryogenically cooled probe). The ¹H^N CEST datasets were recorded as described previously (Yuwen et al. 2017a); specifically, a pair of datasets was recorded using B₁ fields of 26.7 Hz and 42.0 Hz. A CEST delay of T_ex = 400 ms was used and B₁ offsets between 5.5 and 10.5 ppm with step sizes of 25 Hz (B₁ = 26.7 Hz) or 40 Hz (B₁ = 42 Hz) were recorded. In addition, a 2D reference dataset was obtained with a B₁ offset of − 12 kHz that is equivalent to setting B₁ = 0 Hz.

Results and discussion

Chemical exchange saturation transfer profiles are normally visualised and analysed as, I(ω_offset)/I₀, where I(ω_offset) is the intensity observed for a given site when a weak radio-frequency pulse (B₁) is applied at a frequency of ω_offset, and I₀ is the corresponding intensity with no B₁ pulse applied. A feature of standard CEST profiles is that they resemble inverted one-dimensional NMR spectra, where the ‘dips’ are centered at the chemical shifts of the exchanging species. Thus, the related CEST profile, max(I/I₀) − I/I₀, resembles a simple NMR spectrum and its real Fourier transform therefore resembles an FID. Analysis of the CEST profiles with DNNs shown below first involved transformation of the data into the time domain, through a real Fourier transform, Fig. 1A and B. It should be noted that for a real Fourier transform, or equivalently a discrete Fourier transform of pure real data (N data points), the output is Hermitian-symmetric and approximately half [N/2 − 1 for even N and (N − 1)/2 for odd N] of the points are therefore redundant, see Supporting Material and Fig. S1.

To show the strength of the developed DNNs for the analysis of CEST data, we consider the amide-proton anti-phase CEST (Yuwen et al. 2017a), whose profiles are complicated relative to those generated by other CEST experiments since the ‘dips’ are anti-phase in nature (i.e., multiplet components from the scalar coupling between one-bond ¹H-X spins are of opposite phase). These CEST profiles are challenging to analyse primarily because the chemical shifts may not be easily accessible directly from the profiles. To facilitate the analysis of amide-proton CEST profiles the overall process is divided into two tasks, each with their own optimal DNN. The first DNN, DNN_TR, transforms each anti-phase CEST profile into a ‘classical’ profile, where the doublet nature of the dips are eliminated, thereby improving resolution, and also upsamples the profile to a fixed number of points in the CEST dimension. The second DNN, DNN_CS, then determines the ¹H chemical shifts for each of the exchanging species and an associated confidence in the shift values.

A deep neural network for the transformation of amide-proton CEST profiles

It was recently shown how each of the hidden layers of a simple DNN can be mapped to specific mathematical transformations (Amey et al. 2021). Such an approach is naturally highly attractive in order to design DNNs for new challenges and to understand their strengths and weaknesses. However, with the large size of recent networks developed to analyse and transform NMR data, our focus here is on employing architectures that have been shown recently to work well for related tasks. We have previously developed DNNs using the FID-Net architecture (Karunanithy and Hansen 2021) to decouple and analyse NMR spectra (Karunanithy and Hansen 2021; Karunanithy et al. 2021) by using FIDs as input. Since amide-proton anti-phase CEST profiles resemble anti-phase one-dimensional NMR spectra, our rationale was that a DNN similar to FID-Net can be trained to transform anti-phase CEST profiles into ‘decoupled’ standard CEST profiles. Thus, the DNN_TR architecture used was built of two modules, a module akin to a block in the FID-Net architecture (Karunanithy and Hansen 2021) and a modified long short-term memory (LSTM) module (Hansen 2019). The architecture is described in detail in Supporting Material, Fig. S2, where the python code for generating the model in Tensorflow/Keras (Chollet 2015; Abadi et al. 2016) is also provided. The theory for spin-evolution during CEST experiments is well-established (Helgstrand et al. 2000; Hansen et al. 2008; Vallurupalli et al. 2012), and synthetic training data can therefore easily be generated by propagating the Liouvillian over the desired element.

The first DNN, referred to as DNN_TR, was trained to transform an input amide-proton anti-phase CEST profile to the hypothetical CEST profile of an isolated ¹H spin, with ¹J_HN = 0 Hz, Fig. 1. Thus, DNN_TR decouples the anti-phase amide proton CEST profile and upsamples it to 128 points. The upsampling to a constant size, in this case 128 real points, makes the prediction of chemical shifts with a second DNN feasible, since DNNs are typically trained with a constant size of the input and output data (see below). A maximum of three exchanging states was assumed and only the forked three-site exchange model was used to generate the data, that is, E₁ ⇌ G ⇌ E₂, where E₁ and E₂ are sparsely populated states. For 75% of the training data the population of E₂ was set to zero. Because of the strong correlation between CEST data reporting on different three-site exchange models, for example, E₁ ⇌ G ⇌ E₂ versus G ⇌ E₁ ⇌ E₂, it is anticipated that DNN_TR will robustly transform anti-phase CEST profiles derived from any three-site exchange process. Briefly, DNN_TR was trained on 15 × 10⁶ CEST profiles, where the range of training data is indicated in Table 1. The loss function was calculated from the mean-squared-error between the transformed in-phase CEST profile and the target function, see Fig. 1D. The network was trained to a normalised mean-squared-error (MSE) of 4 × 10^–4 and a mean-absolute-error (MAE) of 0.01.

The trained DNN_TR network was evaluated separately on synthetic data for two- and three-site exchanging systems. Figure 2 shows the evaluation on 100,000 randomly generated CEST profiles for two- (Fig. 2A) and three-site (Fig. 2B) exchanging systems. Figure S3 shows the performance of the DNN transformation as a function of the strength of the weak field, B₁, the population of the sparse state E, p_E, the overall exchange rate, k_ex (k_ex = k_GE + k_EG, for two-site interconversion) and the number of sampled offsets. The transformation of profiles from anti-phase to in-phase by the DNN_TR network is robust and there is only limited variation in the performance with different parameters used to generate the CEST profiles. Of particular interest is that the transformation is only minimally affected by the number of points sampled in the input profile, Fig. S3D, suggesting that the upsampling is robust.

Having evaluated the DNN_TR network on synthetic data it is important to assess how the DNN performs on experimental anti-phase ¹H^N CEST profiles. Figure 3 shows two examples, where ¹H^N anti-phase CEST profiles for the L99A mutant of T4 lysozyme recorded at 18.8 T have been transformed to in-phase CEST profiles (with the scalar coupling removed). This representation immediately allows estimation of the chemical shifts of ¹H nuclei of the exchanging states, which can be used as initial parameters for a least-squares analysis. However, these experimental CEST profiles are associated with uncertainty and since the ground truth (exact value) is not known a detailed evaluation of the performance is not directly possible.

Determining ¹H chemical shifts in exchanging states using a deep neural network

With the in-phase CEST profiles available it becomes substantially easier to estimate the chemical shifts of the exchanging species. DNNs are particularly adept at locating specific features in data, for example, localising particular elements in an image. Thus, it is expected that a DNN could be trained to determine the position of peaks in one-dimensional NMR spectra and, consequently, trained to determine the chemical shifts of the exchanging species from in-phase CEST profiles or the related profiles, max(I/I₀) − I/I₀. The densely connected convolutional neural network architecture (Huang et al. 2016), which was originally developed for object recognition tasks, was adapted here, Fig. S4, to determine the chemical shifts from CEST profiles. Moreover, our goal was not only to determine the chemical shifts of the interconverting conformers, but to also train the DNN to estimate the uncertainties with which it determined these shifts, thereby providing an output similar to a traditional least-squares fitting procedure.

The output from a DNN is typically a fixed length and a decision about the maximum number of exchanging states therefore has to be made before training the network. Since the time for training the DNN increases rapidly when increasing the maximum number of exchanging states, we chose for this application to only focus on CEST profiles reporting on three or less states, which covers most of the CEST-based studies reported to date. For a maximum of three exchanging states the output from the DNN_CS network is a 3 × 2 matrix whose elements are three chemical shifts, f_ω,pred, and their corresponding confidences, c_pred. When the input CEST profile derives from a two-site exchanging system, the DNN should report one confidence approaching zero and when the input CEST profile is only reporting on one state, two of the confidences should tend to zero.

To facilitate an end-to-end analysis, that is chemical shifts and their uncertainties obtained directly from the experimental anti-phase CEST profiles, the network to determine chemical shifts was trained on outputs from DNN_TR, i.e. in-phase CEST profiles generated from anti-phase profiles. Having the second DNN, referred to as DNN_CS, determine both chemical shifts and their confidences requires special attention to the loss function used for training. Naturally, the DNN_CS network should be trained to optimise the confidence and thus obtain as accurate peak positions as possible, however, it should also be penalised, when the predicted confidence does not match the accuracy of the predicted chemical shifts. A variety of DNN architectures and loss functions have previously been designed to provide measures of the uncertainty with which DNNs make their predictions and transformations, also for the predictions of chemical shifts (Jonas and Kuhn 2019). As detailed below, we have adopted a strategy, where the loss function bears resemblance with the cost function in a least-squares fitting procedure.

The last layer of DNN_CS has sigmoidal activation, Fig. S4, which means that the output values, three values reporting on chemical shifts and three confidences, are between 0 and 1. The predicted chemical shifts in the range (0, 1), referred to as f_ω,pred, are easily converted into the range of offsets obtained in the CEST dimension of the original data using a linear mapping. For example, if the CEST profile is recorded with points between 6.6 ppm and 10.0 ppm, then the linear mapping will be δ ← 3.4 ppm × f_ω,pred + 6.6 ppm. Moreover, a predicted uncertainty, σ_pred, was calculated from the predicted confidence as σ_pred = k (1/c_pred − 1), where k is a constant and σ_pred structured such that it can take values between 0 and infinity. In order to make the predicted uncertainties match actual uncertainties of the prediction, the first part of the loss function was defined in a manner similar to a standard χ², that is:

$$L_{{{\text{freq}}}} = \mathop \sum \limits_{i = 0,1,2} \frac{{\left( {f_{{\omega ,{\text{pred}},i}} - f_{{\omega ,{\text{ true}},i}} } \right)^{2} }}{{\sigma_{{{\text{pred}},i}}^{2} }}$$

(1)

where the sum is over the three states. The constant k was initially set to 1 during training, and subsequently set to $\left( {\max \left( {{}_{{}}^{1} {\text{H}}\ {\text{offsets}}} \right) - \min \left( {{}_{{}}^{1} {\text{H}}\ {\text{offsets}}} \right)} \right)\sqrt {L_{{{\text{freq}}}} }$ to rescale L_freq to have an expectation value of 1 and so that σ_pred reports on the expected uncertainty. The purpose of the loss function in Eq. (1) is to make the predicted chemical shifts approach their true values. However, if L_freq was the only loss function used during training, then training of DNN_CS would simply lead to very low confidences (high uncertainties), which would minimise the function in Eq. (1). A second loss function was therefore added during training:

$$L_{{{\text{uncer}}}} = 10^{ - 4} \mathop \sum \limits_{i = 0,1,2} 1_{i} \sqrt {\sigma_{{{\text{pred}},i}} }$$

(2)

where, 1 = {1,1,1} for three-state exchange input and 1 = {1,1,0} in the case of two-state exchange, thereby allowing large uncertainties, σ_pred, when a state is not present in the input. The loss function in Eq. (2) serves to force DNN_CS to predict high confidences (low uncertainties) where, and only where, the input profiles report on a real state. Briefly, the DNN_CS network was trained on 1.5 × 10⁷ randomly generated CEST profiles, with a final value of L_freq = 7.3 × 10^–5, and L_uncer = 2.8 × 10^–4. For the synthetic CEST data analysed below, the range of ¹H offsets was 3.4 ppm and therefore k = 0.029 ppm. Full details of the network architecture and the training are provided in the Methods and Supporting Information sections.

It is anticipated that with minimal additional training, the DNN_CS network will be able to accurately analyse common ‘in-phase’ CEST profiles such as those often obtained for ¹⁵N and ¹³C, since these CEST profiles strongly resemble the IP-CEST profiles, Fig. 3. However, it should be stressed that the current DNN_CS network has only been fully assessed with ¹H AP-CEST profiles that have been transformed with DNN_TR.

End-to-end one-shot analysis of amide proton CEST

The two DNNs, DNN_TR and DNN_CS, described above can be applied sequentially to provide an end-to-end one-shot analysis of anti-phase CEST profiles:

$${\text{AP-CEST,}}\,{\mathbf{cest}}_{{{\text{AP}}}} \left( \omega \right){ } \xrightarrow{{{ {\text{real}}\,{\text{FT}},\,{\text{DNN}}_{{{\text{TR}}}} ,\,{\text{inverse}}\,{\text{FT}} }}} {\text{IP-CEST,}}\,{\mathbf{cest}}_{{{\text{IP}}}} \left( \omega \right) \xrightarrow{{{ {\text{DNN}}_{{{\text{CS}}}} }}} \left\{ {f_{{\omega ,\,{\text{pred}},\,i}} ,\,\sigma_{{{\text{pred}},\,i}} } \right\}_{i = 0,\,1,\,2}$$

The overall performance of this sequential DNN was first evaluated using synthetically generated data. Specifically, (i) 100,000 anti-phase CEST profiles were generated for a variety of two-site chemical exchange processes and a further 100,000 profiles for three-site exchange. The range of B₁ offsets used was 3.4 ppm for all profiles, and all other input parameters are given in Table 1. (ii) Random gaussian noise with a standard deviation of 0.01 of the maximum value of each anti-phase CEST profile was added to the input anti-phase CEST spectrum. (iii) The DNN_TR network was first used to transform all the CEST profiles from anti-phase to in-phase. (iv) The second network, DNN_CS, was used to determine the chemical shifts of the exchanging states and their associated uncertainties.

Figure 4 shows a summary of the quantitative assessment of the 100,000 CEST profiles corresponding to a two-state chemical exchange process. From Fig. 4 it is clear that the sequential DNN is able to accurately predict the chemical shifts of exchanging states from anti-phase CEST profiles. From the chemical shift predictions made on the 100,000 random CEST profiles the difference between a predicted chemical shift, δ_pred, and a true chemical shift, δ_true, was calculated, which gives an estimate of the performance and the confidence levels of the DNN as a function of c_pred and σ_pred. Importantly, as shown in Fig. 4C and D, the DNN has also successfully been trained to predict the uncertainty associated with the predicted chemical shifts. Specifically, for c_pred ≥ 0.4, the predicted uncertainty, σ_pred, agrees well with the 68.3% confidence level estimated from the analysis of the 100,000 profiles. For c_pred < 0.4, σ_pred is no longer an accurate measure of the uncertainty. Not surprisingly, the ground state chemical shifts, Fig. 4E, are generally predicted with a higher accuracy than the chemical shifts of the low-populated state, Fig. 4F, where lower confidences are obtained for small chemical shift differences between the two states, see Fig S5. The corresponding assessment carried out on 100,000 synthetic anti-phase ¹H^N CEST profiles reporting on a three-site chemical exchange process, E₁ ⇌ G ⇌ E₂, is shown in Supporting Material, Fig. S6. Figure S7 shows the summary of evaluations where random gaussian noise with a standard deviation of 0.01, 0.02, 0.04 of the maximum value of each anti-phase CEST profile was added to the input anti-phase CEST spectrum. The performance of the sequential DNN shown above strictly only holds for the ranges of data that were used for training and for the quantitative assessments, Table 1. However, as shown below, the performance of the DNN is rather robust and if the parameters of the CEST profile to be analysed deviate only slightly from the training parameters one would still expect the analysis to be valid. The ranges of parameters shown in Table 1 cover those obtained in most of CEST-based studies to date and it is therefore expected that most experimental anti-phase CEST profiles can be accurately analysed using the DNNs.

Assessment of the sequential DNN to analyse experimental CEST profiles

Experimental anti-phase ¹H CEST profiles for the L99A mutant of T4 lysozyme were analysed using the sequential and stacked DNN to gain insight into its performance on experimental data. As a validation of the performance of the fully stacked DNN two analyses were performed: in the first all of the 86 B₁ offsets were used to predict chemical shifts, while in the second, half of the offsets (every second point) were removed. Figure 5A shows the example of Gly12, where the predicted chemical shifts and uncertainties using half of the B₁ offsets agrees well with the values obtained using the full dataset. Generally, this holds for all sites, Fig. 5B and the RMSDs obtained are in line with those expected from the predicted uncertainties, σ_pred. The differences in chemical shifts based on analyses of the full and half datasets, for all profiles, as a function of the confidence level are highlighted in Fig. 5C. Finally, it should be noted that the DNN_TR network was only trained on profiles with 50–128 input points. The fact that the stacked DNN is able to accurately predict the chemical shifts from profiles with less data (43 points) than those used for training points to the robustness of the DNN.

To further assess the performance of the stacked DNNs in determining the chemical shifts of the exchanging states, anti-phase CEST profiles were obtained for the G48A mutant of the SH3 domain from Fyn (Yuwen et al. 2017a). At a static magnetic field of 14.1 T, two sets of data were obtained with B₁ fields of 26.7 Hz and 42 Hz. Figure 6A shows that the chemical shifts predicted using the stacked DNNs, independently, on the two different datasets agree well (RMSD of 7 ppb), and Fig. 6B highlights the difference in shifts based on the separate analyses of the two full datasets. Subsequently, the two experimental datasets were analysed simultaneously using a standard least-squares analysis (Yuwen et al. 2017a) with the software package ChemEx (https://github.com/gbouvignies/chemex) and the results were compared with the predictions made by the DNN, Fig. 6C. Again, the agreement between the chemical shifts predicted by the DNN and those obtained by least-squares fitting agree well, with an RMSD of 7 ppb.

Uncertainties obtained from the covariance matrix in a least-squares analysis of CEST profiles are typically around 1 ppb, which is 6 times smaller than the uncertainties obtained from the DNN, indicating that the stacked DNNs have not fully reached the level of accuracy obtained by least-squares fitting. Still, the predictions obtained from the analysis with the stacked DNNs are of an accuracy where they can be used for downstream analyses and are well beyond the level of accuracy by which these shifts can be predicted from a high-resolution structure (Han et al. 2011). Alternatively, the DNN-predicted chemical shifts can serve as excellent starting parameters for a subsequent least-squares analysis. It is also possible that larger or alternative DNN architectures along with longer training periods could improve the performance of the DNN predictions.

Conclusions

A deep neural network was developed and trained to determine amide proton chemical shifts of exchanging states from anti-phase ¹H^N CEST profiles. The approach first leads to the conversion of anti-phase to in-phase ¹H^N CEST profiles, whereafter the chemical shifts are predicted along with their uncertainties. Compared with other analysis tools, the DNN does not require any additional training and there are no user adjustable parameters, which makes the analysis autonomous and suitable for automated processing pipelines. Thus far, the DNN only predicts chemical shifts. If additional parameters are sought, such as exchange rates and populations, the output shift values from the DNN can then serve as excellent starting points for a least-squares fitting procedure. The methodology and DNNs presented here add to the growing applications of deep learning and artificial intelligence for the analysis of NMR data, and provide an example of the autonomous analysis of complex NMR data reporting on macromolecular dynamics and chemical exchange.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request. The training data are also available as open source from Zenodo (Karunanithy et al. 2022) with scripts to analyse these. Moreover, scripts and code for performing the end-to-end one-shot analysis of amide proton CEST on experimental data using DNN_TR and DNN_CS are available on GitHub: https://github.com/gogulan-k/DNN_autoCEST.

References

Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M, Ghemawat S, Goodfellow I, Harp A, Irving G, Isard M, Jia Y, Jozefowicz R, Kaiser L, Kudlur M, Levenberg J, Mane D, Monga R, Moore S, Murray D, Olah C, Schuster M, Shlens J, Steiner B, Sutskever I, Talwar K, Tucker P, Vanhoucke V, Vasudevan V, Viegas F, Vinyals O, Warden P, Wattenberg M, Wicke M, Yu Y, Zheng X (2016) TensorFlow: large-scale machine learning on heterogeneous distributed systems. Accessed from https://www.tensorflow.org
Amey JL, Keeley J, Choudhury T, Kuprov I (2021) Neural network interpretation using descrambler groups. Proc Natl Acad Sci USA. https://doi.org/10.1073/pnas.2016917118
Article MathSciNet Google Scholar
Beckwith MA, Erazo-Colon T, Johnson BA (2021) RING NMR dynamics: software for analysis of multiple NMR relaxation experiments. J Biomol NMR 75:9–23
Article Google Scholar
Bertini I et al (1999) High-field NMR studies of oxidized blue copper proteins: the case of spinach plastocyanin. J Am Chem Soc 121:2037–2046
Article Google Scholar
Boehr DD, Dyson HJ, Wright PE (2006) An NMR perspective on enzyme dynamics. Chem Rev 106:3055–3079
Article Google Scholar
Bouvignies G et al (2011) Solution structure of a minor and transiently formed state of a T4 lysozyme mutant. Nature 477:111–117
Article ADS Google Scholar
Chao F-A, Byrd RA (2016) Geometric approximation: a new computational approach to characterize protein dynamics from NMR adiabatic relaxation dispersion experiments. J Am Chem Soc 138:7337–7345
Article Google Scholar
Chaudhari AS et al (2021) Prospective deployment of deep learning in MRI: a framework for important considerations, challenges, and recommendations for best practices. J Magn Reson Imaging 54:357–371
Article Google Scholar
Chollet F (2015) Keras. Accessed from https://keras.io
Faust O et al (2020) HSP40 proteins use class-specific regulation to drive HSP70 functional diversity. Nature 587:489–494
Article ADS Google Scholar
Fawzi NL, Ying J, Ghirlando R, Torchia DA, Clore GM (2011) Atomic-resolution dynamics on the surface of amyloid-β protofibrils probed by solution NMR. Nature 480:268–272
Article ADS Google Scholar
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge
MATH Google Scholar
Han B, Liu Y, Ginzinger SW, Wishart DS (2011) SHIFTX2: significantly improved protein chemical shift prediction. J Biomol NMR 50:43–57
Article Google Scholar
Hansen DF (2019) Using deep neural networks to reconstruct non-uniformly sampled NMR spectra. J Biomol NMR 73:577–585
Article Google Scholar
Hansen DF, Led JJ (2006) Determination of the geometric structure of the metal site in a blue copper protein by paramagnetic NMR. Proc Natl Acad Sci USA 103:1738–1743
Article ADS Google Scholar
Hansen DF, Vallurupalli P, Lundstrom P, Neudecker P, Kay LE (2008) Probing chemical shifts of invisible states of proteins with relaxation dispersion NMR spectroscopy: how well can we do? J Am Chem Soc 130:2667–2675
Article Google Scholar
Hansen AL, Nikolova EN, Casiano-Negroni A, Al-Hashimi HM (2009) Extending the range of microsecond-to-millisecond chemical exchange detected in labeled and unlabeled nucleic acids by selective carbon R(1rho) NMR spectroscopy. J Am Chem Soc 131:3818–3819
Article Google Scholar
Helgstrand M, Hard T, Allard P (2000) Simulations of NMR pulse sequences during equilibrium and non-equilibrium chemical exchange. J Biomol NMR 18:49–63
Article Google Scholar
Henzler-Wildman K, Kern D (2007) Dynamic personalities of proteins. Nature 450:964–972
Article ADS Google Scholar
Huang G, Liu Z, van der Maaten L, Weinberger KQ (2016) Densely connected convolutional networks. Accessed from https://arxiv.org/abs/1608.06993
Jonas E, Kuhn S (2019) Rapid prediction of NMR spectral properties with quantified uncertainty. J Cheminform 11:50
Article Google Scholar
Karplus M, Kuriyan J (2005) Molecular dynamics and protein function. Proc Natl Acad Sci USA 102:6679–6685
Article ADS Google Scholar
Karunanithy G, Hansen DF (2021) FID-Net: A versatile deep neural network architecture for NMR spectral reconstruction and virtual decoupling. J Biomol NMR 75:179–191
Article Google Scholar
Karunanithy G, Mackenzie HW, Hansen DF (2021) Virtual homonuclear decoupling in direct detection nuclear magnetic resonance experiments using deep neural networks. J Am Chem Soc 143:16935–16942
Article Google Scholar
Karunanithy G, Yuwen T, Kay LE, Hansen DF (2022) Towards autonomous analysis of chemical exchange saturation transfer experiments using deep neural networks. doi:10.5281/zenodo.6394499
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. Accessed from https://arxiv.org/abs/1412.6980
Klukowski P et al (2018) NMRNet: a deep learning approach to automated peak picking of protein NMR spectra. Bioinformatics 34:2590–2597
Article Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Article ADS Google Scholar
Loria JP, Rance M, Palmer AG (1999) A relaxation-compensated Carr-Purcell-Meiboom-Gill sequence for characterizing chemical exchange by NMR spectroscopy. J Am Chem Soc 121:2331–2332
Article Google Scholar
Luo J, Zeng Q, Wu K, Lin Y (2020) Fast reconstruction of non-uniform sampling multidimensional NMR spectroscopy via a deep neural network. J Magn Reson 317:106772
Article Google Scholar
Meiboom S, Gill D (1958) Modified spin-echo method for measuring nuclear relaxation times. Rev Sci Instrum 29:688–691
Article ADS Google Scholar
Neudecker P, Korzhnev DM, Kay LE (2006) Assessment of the effects of increased relaxation dispersion data on the extraction of 3-site exchange parameters characterizing the unfolding of an SH3 domain. J Biomol NMR 34:129–135
Article Google Scholar
Palmer AG, Massi F (2006) Characterization of the dynamics of biomacromolecules using rotating-frame spin relaxation NMR spectroscopy. Chem Rev 106:1700–1719
Article Google Scholar
Qu X et al (2020) Accelerated nuclear magnetic resonance spectroscopy with deep learning. Angew Chem 132:10383–10386
Article ADS Google Scholar
Tollinger M, Skrynnikov NR, Mulder FAA, Forman-Kay JD, Kay LE (2001) Slow dynamics in folded and unfolded states of an SH3 domain. J Am Chem Soc 123:11341–11352
Article Google Scholar
Vallurupalli P, Bouvignies G, Kay LE (2012) Studying ‘invisible’ excited protein states in slow exchange with a major state conformation. J Am Chem Soc 134:8148–8161
Article Google Scholar
Vallurupalli P, Sekhar A, Yuwen T, Kay LE (2017) Probing conformational dynamics in biomolecules via chemical exchange saturation transfer: a primer. J Biomol NMR 67:243–271
Article Google Scholar
Ward K, Aletras A, Balaban R (2000) A new class of contrast agents for MRI based on proton chemical exchange dependent saturation transfer (CEST). J Magn Reson 143:79–87
Article ADS Google Scholar
Wurm JP et al (2021) Molecular basis for the allosteric activation mechanism of the heterodimeric imidazole glycerol phosphate synthase complex. Nat Commun 12:2748
Article ADS Google Scholar
Xie T, Saleh T, Rossi P, Kalodimos CG (2020) Conformational states dynamically populated by a kinase determine its function. Science 370:eabc754
Article Google Scholar
Yang H et al (2003) Protein conformational dynamics probed by single-molecule electron transfer. Science 302:262–266
Article ADS Google Scholar
Yuwen T, Sekhar A, Kay LE (2017a) Separating dipolar and chemical exchange magnetization transfer processes in 1 H-CEST. Angew Chem Int Ed 56:6122–6125
Article Google Scholar
Yuwen T, Huang R, Kay LE (2017b) Probing slow timescale dynamics in proteins using methyl 1H CEST. J Biomol NMR 68:215–224
Article Google Scholar
Zhou J, van Zijl PCM (2006) Chemical exchange saturation transfer imaging and spectroscopy. Prog Nucl Magn Reson Spectrosc 48:109–136
Article Google Scholar

Download references

Acknowledgements

T.Y. acknowledges post-doctoral support from the Canadian Institutes of Health Research (CIHR). Computational aspects of this work were supported by the Francis Crick Institute (DFH) through provision of access to the Scientific Computing STP and the Crick data Analysis and Management Platform (CAMP). The Francis Crick Institute receives its core funding from Cancer Research UK (FC010233), the UK Medical Research Council (FC010233), and the Wellcome Trust (FC010233). DFH is supported by the Biotechnology and Biological Sciences Research Council UK (BBSRC) (ref: BB/T011831/1). LEK acknowledges support from the CIHR and the Natural Sciences and Engineering Council of Canada. LEK would like to dedicate this paper to the unbelievable goal scored by Connor McDavid of the Edmonton Oilers against the New York Rangers, November 5, 2021.

Author information

Authors and Affiliations

Division of Biosciences, Department of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
Gogulan Karunanithy & D. Flemming Hansen
Department of Pharmaceutical Analysis and State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, Beijing, 100191, China
Tairan Yuwen
Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
Lewis E. Kay
Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
Lewis E. Kay
Department of Biochemistry, University of Toronto, Toronto, ON, M5S 1A8, Canada
Lewis E. Kay
Program in Molecular Medicine, Hospital for Sick Children Research Institute, Toronto, ON, M5G 0A4, Canada
Lewis E. Kay

Authors

Gogulan Karunanithy
View author publications
You can also search for this author in PubMed Google Scholar
Tairan Yuwen
View author publications
You can also search for this author in PubMed Google Scholar
Lewis E. Kay
View author publications
You can also search for this author in PubMed Google Scholar
D. Flemming Hansen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Flemming Hansen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 1942 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Karunanithy, G., Yuwen, T., Kay, L.E. et al. Towards autonomous analysis of chemical exchange saturation transfer experiments using deep neural networks. J Biomol NMR 76, 75–86 (2022). https://doi.org/10.1007/s10858-022-00395-z

Download citation

Received: 23 December 2021
Accepted: 05 May 2022
Published: 27 May 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s10858-022-00395-z

Towards autonomous analysis of chemical exchange saturation transfer experiments using deep neural networks

Abstract

Introduction