Matthew Wallace*a,
James M. Sharpea,
Krzysztof Baj
b,
Michael Ngwubea,
Jenny Thirlwayc,
Patrick L. Kerigan Higgs
c,
G. Richard Stephenson
a,
Jonathan A. Iggo
b,
Thomas E. Storra and
Christopher J. Richards
*a
aSchool of Chemistry, Pharmacy and Pharmacology, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK. E-mail: matthew.wallace@uea.ac.uk; chris.richards@uea.ac.uk
bDepartment of Chemistry, University of Liverpool, Crown Street, Liverpool, L69 7ZD, UK
cIksuda Therapeutics Ltd., The Biosphere, Draymans Way, Newcastle Helix, Newcastle upon Tyne, NE4 5BX, UK
First published on 26th May 2025
We generate pH gradients spanning more than six units in standard NMR tubes and determine all the pKa values of polyprotic compounds in single 20 minutes chemical shift imaging (CSI) NMR experiments. The modest demands of our method in terms of sample quantity and preparation time allow it be performed as part of the routine characterisation and optimisation of organic molecules during synthesis campaigns. As proof of concept, we measure the pKa values of a family of vinylpyridines employed as antibody drug conjugate linkers. Our analysis reveals a strong correlation between the experimental aqueous pKa and the rate of conjugate addition of thiol nucleophiles to the vinyl group, representing a powerful predictive method.
![]() | ||
Fig. 1 (a) Predicted (solid line) and experimental (cross) pH versus position from the base of the NMR tube for determination of the pKa values of tyrosine (OH, NH2: blue), lysine (NH2: black), histidine (NH2, imidazole, COOH: red) and aspartic acid (COOH: green). Photograph of NMR sample with NMR-active region indicated as yellow dashed lines. Sketch of pH range accessed with each buffer system. (b) Example 1H spectra of histidine extracted from CSI image at pH value and vertical position indicated. (c) Plot of 1H chemical shift of α (blue circle), β (green diamond) and H5 proton (red triangle) of histidine versus pH. The solid lines are fits to eqn (1). |
To prepare a pH gradient, crystals of oxalic acid dihydrate were weighed directly into a Wilmad 528-PP 7′′ NMR tube. The nominal masses were within 20% of the stated values. Four 2 mm diameter glass beads were then placed in the tube. Finally, a solution of pH buffers, analyte and indicators was placed on top of the glass beads using a 9′′ NMR Pasteur pipette to a height of 40–45 mm above the absolute base of the NMR tube. The sample was then stood at 22 °C in the autosampler rack of the spectrometer prior to analysis.
CSI datasets were processed in phase-sensitive mode with 32 K points and an exponential line broadening factor of 3 Hz. Each row was phased, baseline corrected, referenced to DSS (0 ppm) and the chemical shifts of indicators and analytes exported to the spreadsheet accompanying this work using automation scripts written in house (section S14†). Only rows 20–120 (128 point dataset) were used in the analysis to avoid off-coil effects. The pH was determined from the 1H chemical shifts of organic indicator molecules and ions (sections S1 and S3†). Fitting of the analyte chemical shifts to eqn (1) was accomplished using the Solver module of Microsoft Excel optimising all variables, with the constraint for histidine that the fitted δH3 was less than 1 ppm higher than the lowest experimental chemical shift measured in the data series.
Buffer | pKa,Ha | pH rangeb |
---|---|---|
a pKa of protonated buffer species.b pH range over which buffer is between 90% and 10% protonated.c Tris(hydroxymethyl-d3)amino-d2-methane.d pH range accessible using oxalic acid/oxalate alone. | ||
NaOH | — | >11 |
Na2CO3 | 10 | 9.2–10.8 |
NaOB(OH)2 | 9 | 8.2–10.1 |
Trisc | 8 | 7.2–9.0 |
K2HPO4 | 7 | 6.1–7.9 |
NaHCO3 | 6 | 5.4–7.3 |
Acetate | 4.7 | 3.8–5.7 |
Oxalate2− | 4 | 2–5d |
This highly tuneable pH range allows us to determine the pKa values of molecules that possess several ionisable groups. Here, we use amino acids bearing ionisable side chains as exemplar systems. To extract the three pKa values of histidine in a single experiment, we require a smooth pH gradient spanning approximately pH 2 to pH 11. We choose a buffer system where the pKa,H values of the buffers are separated by approximately two units or less: NaOH, Na2CO3, NaOB(OH)2, tris, K2HPO4 and acetate-d3. With 3.7 mg of oxalic acid placed at the bottom of the NMR tube and with 10 mM of each buffer component, the concentration of oxalic acid is predicted to vary from 110 mM to 2 mM from the bottom to the top of the NMR active region of the sample (Fig. 1a), 6.5 hours after preparation, giving a pH gradient spanning the required range. We note that this calculated topt can vary considerably between samples due to differences in the identity and concentration of buffers selected, the pH range targeted, and the mass of acid weighed into the sample tube. However, there is a ca. 2-hour window either side of the calculated topt over which acceptable pKa values are obtained (section S4†).
Resolution of the two pKa values of aspartic acid associated with the COOH groups requires a pH gradient spanning pH 1 to pH 7. We choose buffer components with lower pKa,H values (10 mM NaHCO3, acetate-d3 and 20 mM oxalate), 3.4 mg of oxalic acid and leave the sample seven hours before analysis. Similarly, resolution of the pKa values of lysine (NH2) and tyrosine (NH2, Ar–OH) requires pH gradients spanning pH 6 to pH 12. This alkaline pH range is accomplished using more basic buffer components (0.01 M NaOH, 20 mM Na2CO3, tris and NaOB(OH)2), a smaller mass of oxalic acid (1.3 mg), and leaving the sample 8.5 hours before analysis.
When prepared and analysed by CSI, the pH exhibits a smooth variation over the desired pH ranges, as predicted by our spreadsheet (Fig. 1a). We attribute the discrepancy between the predicted and experimental pH values to the coupled diffusion of acid and base along the NMR tube and the ±0.3 mg uncertainty in the mass of oxalic acid used in the experiments (section S6, ESI†).
The 1H NMR chemical shifts of the pH indicator molecules and amino acids were extracted from the CSI datasets using the routines and scripts provided in section S14 of the ESI.† This data was then exported to the spreadsheets accompanying this work and fitted to extract the pH of each row. The pKa values of the amino acids were obtained by fitting the observed chemical shift, δobs, of each observed site on the molecule to eqn (1):
![]() | (1) |
![]() | (2) |
Amino acid | pK1,0 | pK2,0 | pK3,0 | pH rangea |
---|---|---|---|---|
Literature values are provided in parentheses, taken from ref. 16 unless otherwise stated.a pH range accessed in CSI experiment (Fig. 1a).b Ref. 17. Uncertainties were calculated as described in section S5 of the ESI.† | ||||
Histidine | 1.39 ± 0.14 (1.56) | 6.12 ± 0.12 (6.00) | 9.28 ± 0.04 (9.25) | 2.0–11.1 |
I = 0.12 M | Δz2 = 3 | Δz2 = 1 | Δz2 = −1 | |
Lysine | — | 9.12 ± 0.04 (9.09) | 10.90 ± 0.04 (10.90) | 7.7–11.4 |
I = 0.13 M | Δz2 = 1 | Δz2 = −1 | ||
Tyrosine | — | 9.22 ± 0.08 (9.09) | 10.59 ± 0.05 (10.69) | 6.5–11.2 |
I = 0.13 M | Δz2 = −1 | Δz2 = −3 | ||
Aspartic acid | 2.12 ± 0.08 (1.99b) | 3.98 ± 0.03 (3.89b) | — | 1.8–6.3 |
I = 0.10 M | Δz2 = 1 | Δz2 = −1 |
Negligible variation in these pKa values (≤0.05 units) is observed upon repetition of the CSI experiments (section S6†) while acquisition of CSI images ±2 hours around topt causes <0.1 units variation in the fitted pKa values (section S4†). 1H spectra are provided in section S7 of the ESI.† We note that the accuracy and reproducibility of CSI experiments are comparable to potentiometric titration and consume less compound (section S3.3).
We sought to synthesise several 4-vinylpyridine derivatives to which can be linked two cytotoxic payloads as a means to improve the drug-antibody ratio (DAR) of a resulting ADC, with this study providing also the opportunity to investigate the relationship between vinylpyridine pKa and the rate of a model bioconjugation reaction. Synthesis yielded single-arm compounds A–E, dual-armed compounds F–K containing functionality linked additionally to position 6, and also two heteroatom-substituted compounds L and M for additional reactivity comparison (Table 3).
Entry | R1 | R2 | kobs/M−1 min−1 | pKa,0 NMR | pKa,0 Epik | pKa, MolGpKa | pKa, Jaguar |
---|---|---|---|---|---|---|---|
a pKa,H of tertiary amine.b pKa,H of 1,2,3-triazole moiety.c Additional datapoint measured in homogeneous solution of 0.1 M HCl (Fig. S27†).d Reported pKa,H of pyrrolidine moiety.e 16 point CSI experiment recorded due to low solubility. | |||||||
A | Me | (CH2)2CO2Et | 0.80 | 6.18 ± 0.09 | 6.62 | 5.9 | 6.40 |
B | CH2CH(CH2CO2Et)CO2Et | 0.17 | 5.69 ± 0.09 | 6.25 | 5.5 | 5.49 | |
C | CH2N(CH2CO2Et)2 | 0.13 | 5.86 ± 0.07 | 5.50, 3.91a | 4.7, 4.6a | 7.29, 0.95a | |
D | (CH2)2N(CH2CO2Et)2 | 0.39 | 6.69 ± 0.06, 1.4 ± 0.07 | 6.48, 3.95a | 6.0, 5.1a | 7.38, 2.43a | |
E | (CH2)3N(CH2CO2Et)2 | 0.75 | 6.59 ± 0.04, 3.12 ± 0.37 | 6.87, 4.39a | 6.3, 5.5a | 6.75, 3.58a | |
F | (CH2)2CO2Et | (CH2)2CO2Et | 0.17 | 5.36 ± 0.04 | 5.39 | 5.2 | 5.89 |
G | ![]() |
0.0054 | 2.98 ± 0.05 | 3.37, 2.84b | 4.5, 3.0b | 3.27 | |
H | (CH2)NHBoc | 0.083 | 4.74 ± 0.03 | 4.57 | 4.7 | 4.47 | |
I | (CH2)NHBoc | (CH2)NHBoc | 0.087 | 4.11 ± 0.02 | 2.55 | 4.4 | 5.39 |
J | ![]() |
0.0022 | 2.39 ± 0.02 | 1.97, 2.84b | 4.2, 3.2b | 3.86 | |
K | ![]() |
![]() |
n.d. | 0.92 ± 0.15c | 0.69, 2.84b | 4.1, 3.1b | 1.22 |
L | H | ![]() |
0.08428 | 7.60 ± 0.07 | 7.35, 7.27d | 6.7, 5.4d | 7.60 |
M | ![]() |
0.0048 | 2.49 ± 0.49e | 2.42 | 3.1 | 1.62 |
Starting with compound A, we assumed a pKa value of approximately 6. We chose a buffer system comprising 20 mM NaHCO3, 10 mM tris, K2HPO4 and acetate. 1.5–2 mg of oxalic acid was predicted to give a gradient between pH 2 and pH 9 with a topt of 8 hours. The pKa of twelve other variously substituted 4-vinyl pyridines were then determined. For comparison, micro-pKa values were predicted using three computational packages: Jaguar pKa (Schrödinger Release 2024-2: Schrödinger, LLC, New York, NY, 2024),13–15 Epik version 6.812 and MolGpKa11 (Table 3). The values returned by these calculations were used to guide the selection of the buffer systems for the NMR experiments. For example, the pKa of compound J was predicted to be approximately 3 (Table 3). Hence a more acidic buffer system was used relative to A, comprising 20 mM oxalate, 10 mM acetate and NaHCO3 and 3 mg of oxalic acid was used. The resulting pH gradient was predicted to span from pH 1.5 to 7 with a topt of 6 hours. Example predicted and experimental gradients for each compound are provided in section S8 of the ESI.†
For compounds D and E additional pKa values of 1.4 and 3.1 were obtained that can be associated with the tertiary amine functionality based on the pH-dependence of the chemical shifts of different parts of the molecule (section S9, ESI†). A pKa value of 3.3 was determined for the analogous compound PhCH2N(CH2CO2Et)2, confirming the low basicity of the tertiary amine (Fig. S30, ESI†). DFT calculations confirmed a higher basicity of the pyridine in compounds C–E (section S10, ESI†). The pKa values of the amine sites presented in Table 3 (Jaguar pKa) were calculated explicitly with the pyridine in the protonated state and are thus closer to the experimentally observed macroscopic values. However, the pKa values returned by MolGpKa11 and Epik 6.812 consider the ionisation of each site in isolation (micro-pKa) without regard for the effect of the ionisation of neighbouring groups. The pKa values reported by these methods for the amine sites are thus higher than the second (lower) macroscopic pKa observed. That no additional values were obtained for the triazole containing compounds (G, J and K) is consistent with the low basicity of this heterocycle (pKa for 1-methyl-1,2,3-triazole 1.25).27 However, the pKa values of the triazole moiety calculated using Epik and MolGpKa are significantly higher, again highlighting the limitations of these predictors when applied to compounds that have more than one protonation site. The 2-pyrrolidinopyridine component of L may be regarded as a single functional group, giving this compound the highest basicity of the compounds studied.
As a model bioconjugation reaction we employed a reaction of each vinylpyridine with glutathione (GSH, Scheme 1). This cysteine containing tripeptide is a non-volatile crystalline solid making it relatively easy to handle, and it has been used previously as a model for reactivity studies on electrophilic covalent inhibitors of cysteine-containing proteins.28–30 One equivalent of glutathione was added as a solution in D2O (0.22 M) to the vinylpyridine in DMSO-d6 (0.073 M), with pyrazine included as an internal standard for integration. The reaction was monitored using 1H NMR spectroscopy and the rates of this second order addition25,26 are given in Table 3. Synthetic schemes for the preparation of A–M and example kinetic plots are provided in sections S11 and S12 of the ESI,† respectively.
![]() | ||
Scheme 1 Reaction of 4-vinylpyridines with glutathione (GSH) for the determination of rate as a function of R1 and R2. |
A plot of the log of rate versus the pKa value determined by NMR exhibits a strong positive correlation (Fig. 3a) consistent with a lowering of the activation energy barrier and an acceleration of reaction rate when the pyridine nitrogen bears a positive charge, as reported for the reaction of N-acetylcysteineamide with quaternized 2-vinylpyridine.26 Three different chain lengths were used to attach an iminodiacetate unit to the 2-position [(CH2)n, where n = 1–3, entries C–E] and an increase in chain length results in an increase in rate such that with a three-carbon spacer the electronegativity of the nitrogen in E is essentially negated. The other electron-withdrawing functional groups are attached with either a two-carbon (B) or one-carbon spacer (G and H) which significantly diminishes the rate, especially when attached to both positions 2 and 6 (I–K). Similarly, replacement of the methyl group in A with an ester functionality reduces the reactivity (F). The conjugated 2-pyrrolidine nitrogen in L significantly decreases the reactivity relative to A, despite L being the most basic of all the compounds studied as it is now too electron rich to react at a significant rate. The 2-oxygen substituent of M significantly reduces pyridine basicity, shutting down reactivity. Thus, a functionalised alkyl chain is the optimum substituent such that compounds A, D and E are in the ‘Goldilocks zone’ with respect to pyridine basicity and the enhanced electrophilicity of the protonated species.
![]() | ||
Fig. 3 (a) Plot of log![]() |
To confirm that protonation of the pyridines took place under the conditions of the reaction, the 1H chemical shift of the methyl resonance of A was measured in D2O/DMSO-d6 solutions containing 20 mM HCl (A fully protonated, 2.6074 ppm) and in the absence of acidic species (A non-protonated, 2.3548 ppm). Comparison to the chemical shift measured in the presence of GSH (2.3732 ppm) revealed 7 ± 2% of A was protonated in the presence of GSH under the conditions of the reaction.31 A full discussion of this calculation, including uncertainty, is provided in section S12 of the ESI.† We note that due to the acidic nature of GSH, an appreciable percentage protonation (>3%) is anticipated for vinylpyridines with pKa,0 NMR > 5.3, while the pH is expected to be weakly basic (Fig. S38†). An acceleration of the rate of reaction of N-acetylcysteineamide with 2-vinylpyridine has been reported under weakly acidic conditions in H2O (pH 5.7) relative to neutral buffer (pH 7.6).26 The acidity of GSH therefore aids the reaction under our experimental conditions, with a higher degree of protonation and thus a faster rate of reaction observed for the more basic vinylpyridines.
The correlation between the 13C NMR chemical shift of the β-carbon of para-substituted styrenes and the Hammett substituent constant (σp) has previously been noted.32 Accordingly, a correlation is observed between the pKa values of A–M and the 13C NMR chemical shift of the electrophilic β-vinyl carbon (Fig. 3b). This correlation suggests that 13C NMR can be used to predict the pKa values of substituted 4-vinylpyirindes and thus estimate their reactivity towards cysteine bioconjugation.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5an00406c |
This journal is © The Royal Society of Chemistry 2025 |