Kate P.
Nolan
,
Callum A.
Rosser
,
James L.
Wood
,
Josep
Font
,
Athavan
Sresutharsan
,
Joseph
Wang
,
Todd E.
Markham
,
Renae M.
Ryan
and
Rachel
Codd
*
The University of Sydney, School of Medical Sciences, New South Wales 2006, Australia. E-mail: rachel.codd@sydney.edu.au
First published on 25th November 2024
The trihydroxamic acid bacterial siderophore desferrioxamine B (DFOB, 1) produced by the DesABCD biosynthetic cluster coordinates metals beyond Fe(III), which identifies potential to modify this chelator type to broaden metal sequestration and/or delivery applications. Rather than producing discrete chelators by total chemical synthesis from native monomers including N-hydroxy-N-succinyl-cadaverine (HSC, 2), the recombinant siderophore synthetase from Salinispora tropica CNB-440 (StDesD) was used with different substrate combinations to produce biocombinatorial mixtures of hydroxamic acid chelators. The mixtures were screened with Ga(III) or Zr(IV) as surrogates of immunological positron emission tomography (PET) imaging radiometals 68Ga(III) or 89Zr(IV) to inform known or new coordination chemistry. The last-in-line enzyme DesD forms amide bonds between two equivalents of 2 and N-hydroxy-N-acetyl-cadaverine to produce trimeric 1. Although hexadentate 1 is the terminal product evolved for Fe(III) complexation, it was conceived amine-containing 1 might remain a viable DesD substrate for further iteration with 2 to generate higher-order hydroxamic acid multimers. Incubation of StDesD, cofactors ATP and Mg(II), and 1 and 2, generated the octadentate hydroxamic acid DFOB-HSC (3) (previously characterised and named DFO*), decadentate DFOB-(HSC)2 (4), dodecadentate DFOB-(HSC)3 (5) and tetradecadentate DFOB-(HSC)4 (6). The system with StDesD and 2 alone generated a set of linear multimers containing flanking amine and carboxylic acid groups (HSC)x-L (x = 2 (7), x = 3 (8), x = 4 (9), x = 5 (10)) and a subset of the cognate ring-closed macrocycles (HSC)x-MC (x = 3 (12), x = 4 (13), x = 5 (14), with x = 2 (11) not detected). Liquid chromatography-mass spectrometry metal screening experiments detected 1:1 complexes of Ga(III) or Zr(IV) and 1, 3–5, 8–10, and 12–14. Complexes of 2:1 stoichiometry were formed between Ga(III) and the high-denticity, high-cavity-volume chelators 4–6, and 14. A processive intra-cavity assembly mechanism has been posited for this flexible siderophore synthetase in delivering a large set of multimeric chelators.
The biosynthesis of 1 begins with the decarboxylation of L-lysine (DesA) to produce cadaverine (1,5-diaminopentane), which is mono-N-hydroxylated (DesB) to produce N-hydroxy-cadaverine (HC). The HC intermediate is processed (DesC) with succinyl-coenzyme A (Suc-CoA) or acetyl-coenzyme A (Ac-CoA) to produce N-succinyl-HC (HSC, 2) or N-acetyl-HC (HAC, 2a), respectively.6–14 In one proposed onward pathway, the terminal siderophore synthetase DesD condenses 2a with 2 to produce the HAC-HSC heterodimer 2b, which in a second DesD cycle is condensed with a second equivalent of 2 to produce 1 (Scheme 1, orange box).7 DesD requires cofactors ATP and Mg(II) to adenylate 2 primed for nucleophilic attack from the amine group in 2a (Cycle 1) or 2b (Cycle 2).
This work sought to examine the potential of using recombinant DesD from Salinispora tropica CNB-440 (StDesD) in a chemoenzymatic synthesis approach to generate biocombinatorial pools of hydroxamic acid compounds, and to probe these pools with metal ions to assess coordination chemistry. This organism was selected based on the availability of the S. tropica CNB-440 genome15 and the verified production of a wide range of linear and macrocyclic hydroxamic acid siderophores.16,17 The work showed that StDesD could use as substrates a combination of 1 and 2, or 2 alone, in multiple iterative cycles to generate known and new linear and macrocyclic hydroxamic acid compounds with higher-order multiplicities and octa-/deca-/dodeca- and tetradeca-dentate denticities, which displayed different coordination chemistry towards Ga(III) and Zr(IV). These data support StDesD as an elastic machine able to accommodate rubbery substrates to provide a facile pathway to widening the chemical space and potential function of this class of metal chelator.
Although there is debate about the biosynthetic sequence for 1 and its multimers, only one mechanism can be invoked for producing 3 from 1 and 2, namely the condensation between the primary amine group of 1 and the activated adenylated 2 monomer. This mechanism parallels that for the condensation of 2b with 2 to form 1. Ambiguity in the assembly mechanism arises in the case of 4, which could be formed from a reaction between 3 and 2, and/or between 1 and the homodimer of 2 (equivalent to 7). Both sequences towards 4 are reasonable and are not mutually exclusive, with further insights into assembly mechanisms provided in a later section.
The upper boundary for the StDesD-mediated production of 1-based multimers was examined using a higher-sensitivity LC-MS system (Fig. 2), which showed EIC signals and MS/MS fragmentation patterns matching theoretical patterns (Fig. S2†) characteristic of 3, 4, 5 and 6, as a set of high-denticity (octa-, deca-, dodeca-, tetradeca-) siderophores. Assuming similar ionisation properties among these structural analogues, and using peak areas normalised to 3 (100%), the relative concentrations of 4, 5 and 6 were estimated as 29%, 2% and 0.1%, respectively. There was a positive correlation between the number of HC units present in 1, 3, 4, 5 and 6 and the increase in the reverse-phase LC retention time, as a correlate of the increase in compound hydrophobicity.
Scheme 2 Linear (7, 8, 9, 10) and cognate macrocyclic hydroxamic acid multimers (11 (not observed), 12, 13, 14) produced from StDesD-mediated condensation reactions with 2 as sole substrate. |
The LC-MS trace from the StDesD and 2 system showed a major signal at tR 13.4 min attributed to the macrocycle (HSC)3-MC (desferrioxamine E (DFOE)) (12) (Fig. 3f, black; with exogenous addition of authentic 12 to the sample shown in gray), which is the macrocyclic product of linear (HSC)3-L (desferrioxamine G1 (DFOG1)) (8) which eluted at tR 10.2 min (Fig. 3e). The asterisked signals in the LC trace from the StDesD, 1 and 2 system (Fig. 1a) at 10.2 min and 13.4 min corresponded with 8 and 12, respectively. The minimal linear compound (HSC)2-L (7), known as bisucaberin B,23 was detected in the StDesD and 2 system (Fig. 3c), although the corresponding natural product bisucaberin macrocycle (HSC)2-MC (11)24,25 was not (Fig. 3d), which might indicate some strain in the macrocyclic pre-complex that prevented macrocyclisation.
The StDesD synthetase has evolved to produce siderophores optimised for Fe(III) binding,16,17 with hexadentate, macrocyclic 12 (logK = 32.5)26 ideal for this function, in accord with 12 as the major product. The system gave lower intensity signals that corresponded with (HSC)4-L (9) (Fig. 3g) and (HSC)4-MC (13) (Fig. 3h), with this latter macrocycle (known as DFOT1) previously characterised in nature,27 and in both in vitro14 and synthetic studies.28 Signals and MS/MS fragmentation patterns consistent with the linear hydroxamic acid amino-carboxylic acid pentamer (HSC)5-L (10) (Fig. 3i) and the cognate pentameric macrocycle (HSC)5-MC (14) (Fig. 3j) were detected. In the system with 2 as sole substrate under these analytical conditions, linear 10 and macrocyclic 14 appeared to mark the upper boundary of the multimeric assembly capacity of StDesD (Table 1).
No. | Species | Alternative name | RT (min) | EIC [M + H]+ | Ref. |
---|---|---|---|---|---|
a The first report of DFOB-HSC named the compound DFO*. The name DFOB-HSC is used here to maintain consistency among the set of DFOB-(HSC)x multimers. b Reported in ref. 14; named in this work. c N/D not detected. | |||||
1 | DFOB | DFO | 8.9 | 561.4 | 29 |
3 | DFOB-HSCa | DFO* | 10.5 | 761.5 | 30 |
4 | DFOB-(HSC)2 | DFOP1 | 11.5 | 961.6 | This work |
5 | DFOB-(HSC)3 | DFOQ1 | 12.2 | 1161.7 | This work |
6 | DFOB-(HSC)4 | DFOR1 | 12.8 | 1361.8 | This work |
7 | (HSC)2-L | Bisucaberin B | 8.0 | 419.3 | 23 |
8 | (HSC)3-L | DFOG1 | 10.2 | 619.4 | 31 |
9 | (HSC)4-L | DFOS1b | 11.7 | 819.5 | 14 |
10 | (HSC)5-L | DFOU1 | 12.7 | 1019.6 | This work |
11 | (HSC)2-MC | Bisucaberin | N/Dc | 401.2 | 24 |
12 | (HSC)3-MC | DFOE | 13.4 | 601.4 | 32 |
13 | (HSC)4-MC | DFOT1 | 14.5 | 801.5 | 14 and 27 |
14 | (HSC)5-MC | DFOV1 | 15.3 | 1001.6 | This work |
Complex formation was evident at a macroscopic level based on the change in the TIC signal profiles between the free chelator mixtures and those containing Ga(III) or Zr(IV) (Fig. 4). EIC traces were assessed for individual metal-chelator species (Chart 1) with a match between experimental and theoretical36m/z values and isotope patterns supporting the formation of known and new complexes. The structures (Chart 1) depict a single representative isomer, noting the possibility of coordination isomers depending on the combination of participant hydroxamic acid units. Species were detected as intrinsically charged adducts, as single-protonated adducts or for low-concentration species, as double-protonated adducts (Chart 1, Table 2). The coordination spheres of the native complexes would be expected to contain additional ancillary ligands (aqua, hydroxyl) which as labile species would be displaced under the LC-MS acquisition conditions, as shown in previous work on metal-hydroxamic acid speciation.37
No. | Species | Adduct | RT (min) | EIC calc. | EIC obs | No. | Species | Adduct | RT (min) | EIC calc. | EIC obs |
---|---|---|---|---|---|---|---|---|---|---|---|
1-Ga | [Ga(1(3−))] | [M − 3H + Ga + H]+ | 6.1 | 627.3 | 627.3 | 1-Zr | [Zr(1(3−))]+ | [M − 3H + Zr]+ | 4.9 | 647.2 | 647.2 |
3-Ga | [Ga(3(3−))] | [M − 3H + Ga + H]+ | 9.0 | 827.4 | 827.4 | 3-Zr | [Zr(3(4−))] | [M − 4H + Zr + H]+ | 7.9 | 847.4 | 847.4 |
4-Ga | [Ga(4(3−))] | [M − 3H + Ga + H]+ | 10.5 | 1027.5 | 1027.5 | 4-Zr | [Zr(4(4−))] | [M − 4H + Zr + H]+ | 9.6 | 1047.5 | 1047.5 |
5-Ga | [Ga(5(3−))] | [M − 3H + Ga + H]+ | 11.2 | 1227.6 | 1227.6 | 5-Zr | [Zr(5(4−))] | [M − 4H + Zr + 2H]2+ | 10.8 | 624.3 | 624.3 |
8-Ga | [Ga(8(3−))] | [M − 3H + Ga + H]+ | 6.7 | 685.3 | 685.3 | 8-Zr | [Zr(8(3−))]+ | [M − 3H + Zr]+ | 5.8 | 705.2 | 705.2 |
9-Ga | [Ga(9(3−))] | [M − 3H + Ga + H]+ | 9.1 | 885.4 | 885.4 | 9-Zr | [Zr(9(4−))] | [M − 4H + Zr + H]+ | 8.2 | 905.4 | 905.4 |
10-Ga | [Ga(10(3−))] | [M − 3H + Ga + H]+ | 10.4 | 1085.5 | 1085.5 | 10-Zr | [Zr(10(4−))] | [M − 4H + Zr + H]+ | 9.8 | 1105.5 | 1105.5 |
12-Ga | [Ga(12(3−))] | [M − 3H + Ga + H]+ | 9.0 | 667.3 | 667.3 | 12-Zr | [Zr(12(3 −))]+ | [M − 3H + Zr]+ | 8.2 | 687.2 | 687.2 |
13-Ga | [Ga(13(3−))] | [M − 3H + Ga + H]+ | 11.5 | 867.4 | 867.4 | 13-Zr | [Zr(13(4−))] | [M − 4H + Zr + H]+ | 10.8 | 887.3 | 887.3 |
14-Ga | [Ga(14(3−))] | [M − 3H + Ga + H]+ | 12.8 | 1067.5 | 1067.5 | 14-Zr | [Zr(14(4−))] | [M − 4H + Zr + H]+ | 12.4 | 1087.5 | 1087.5 |
4-Ga2 | [Ga2(4(5−))]+ | [M − 5H + 2Ga]+ | 8.1 | 1093.4 | 1093.4 | ||||||
5-Ga2 | [Ga2(5(6−))] | [M − 6H + 2Ga + H]+ | 10.3 | 1293.5 | 1293.5 | ||||||
6-Ga2 | [Ga2(6(6−))] | [M − 6H + 2Ga + 2H]2+ | 11.2 | 747.3 | 747.3 | ||||||
14-Ga2 | [Ga2(14(5−))]+ | [M − 5H + 2Ga]+ | 10.4 | 1133.4 | 1133.4 |
As would be expected, the addition of Ga(III) or Zr(IV) to the co-substrate system generated high intensity signals for 1-Ga or 1-Zr, respectively, due to the presence of unreacted 1 (Fig. S3†). Signals in the co-substrate system correlating with 1:1 complexes 3-Ga, 4-Ga, 5-Ga (Fig. 5a–c) and 3-Zr, 4-Zr, 5-Zr (Fig. 5g–i) were detected, with trends in relative concentration reflecting free ligand concentrations. Complexes with 6 were not detected likely due to its presence in low concentration. In the single-substrate system, 1:1 complexes 8-Ga, 9-Ga, 10-Ga, and 8-Zr, 9-Zr, 10-Zr were detected (Fig. S3†), together with signals for the cognate macrocycles 12-Ga, 13-Ga, 14-Ga (Fig. 5d–f) and 12-Zr, 13-Zr, 14-Zr (Fig. 5j–l). For each set of related chelators (blunt-end linear multimers (1, 3, 4, 5), open-chain linear multimers (8, 9, 10), or macrocycles (12, 13, 14)), the retention time of Ga(III) and Zr(IV) complexes increased as a function of the chelator multiplicity, in accord with the increased number of methylene units in the free chelators. The product profiles of the apo-multimers and metal complexes was robust, with repeat experiments giving reproducible results.
New coordination chemistry was identified with Ga(III) and the high-multiplicity chelators 4, 5, 6, and 14, with denticities and larger cavity sizes that could conceivably enable the formation of 2:1 Ga(III):chelator complexes. Signals were observed that correlated with EIC traces set to report 4-Ga2, 5-Ga2, 6-Ga2, and 14-Ga2, with each giving an isotope pattern that matched the calculated pattern distinct for a complex with two Ga(III) ions, and a compressed isotope pattern for 6-Ga2, which was detected as the double-protonated adduct (Fig. 6). Signals correlating with 2:1 metal:chelator complexes formed between Zr(IV) and this set of chelators were not detectable. This likely reflects one or more factors including analytical detection limits and the different coordination chemistry demands of Ga(III) (hexadentate) and Zr(IV) (octadentate), which could predict a requirement for even higher denticity chelators to form 2:1 Zr(IV):chelator complexes. Decadentate HOPO–O10 has been reported to form 2:1 metal:ligand complexes with La(III) and Tb(III).38
Together, this part of the study shows the use of a chemo-enzymatic approach to generate biocombinatorial pools of metal chelators amenable for screening with a given metal ion to inform known and new coordination chemistry.
In the single-substrate system with 2, the highest-order macrocycle detected was pentameric (HSC)5-MC (14), which is used to open the following discussion and draws upon this class of siderophore synthetase containing an activation site (carboxylic acid group positioned for adenylation) proximal to a condensation site (amine group as the nucleophile for amide bond formation),13 and the X-ray crystal structure of a complex between DesD from S. griseoflavus DSM 40698 (SgDesD) and an adenylated substrate mimic, which supports the presence of an activation site.12
It was considered reasonable that the stepwise assembly of 14 in both assembly mechanisms would begin following the entry of two equivalents of 2 into the active site (Scheme 3a and a′), with one equivalent positioned for adenylate-based carboxylic acid activation and the amine group of the other substrate positioned for condensation to form 7 (Scheme 3b and b′). The circle depicting the active site shows an aspartic acid and arginine residue lining the cavity predicted to stabilise 2 in the activation site, as identified (D497, R303) in the X-ray crystal structure of SgDesD bound to an adenylated 2 mimic.12 Both of these residues are preserved in the StDesD sequence (Fig. S4†).
At the point of the production of 7 (Scheme 3b and b′), the assembly mechanism can diverge. In one sequence (M1-1), the intermediate substrate 7 could re-orient within the active site cavity (Scheme 3b and c) to position its carboxylic acid group ready for activation, with the third equivalent of 2 entering as the nucleophile to generate 8, which would similarly re-orientate (Scheme 3d and e) to continue the stepwise assembly of 9 and 10 (Scheme 3f–i) for the final intramolecular condensation reaction to produce 14 (Scheme 3j). The intramolecular condensation of the discrete linear multimers 8 (Scheme 3e) or 9 (Scheme 3g) would produce 12 and 13, respectively.
This describes a processive assembly mechanism whereby the growing polymer chain is shunted around the active site cavity to position itself for carboxylic acid activation, with equivalents of 2 entering as the nucleophile (Scheme 3a–j). This first mechanism (M1) as applied to the two-substrate system with 1 and 2 (M1-2) produced the blunt-end multimers 3, 4, 5, and 6, which could be generated from the respective carboxylate-adenylated substrates 2 (Scheme 3k), 7 (Scheme 3l), 8 (Scheme 3m) or 9 (Scheme 3n) undergoing condensation with 1 as the nucleophile.
Returning to the production of 7 in the single-substrate system (Scheme 3b,b′), a different sequence (M2-1) could instead ascribe 7 (rather than 2) as the nucleophile. This would require 7 be re-positioned to the condensation site (Scheme 3c′) for reaction with adenylated 2 preserved in the activation site. This alternative logic would generate 8–10 (Scheme 3d′–h′) with 10 ultimately positioned (Scheme 3i′) as in the first sequence (M1-1) for intramolecular condensation to generate 14 (Scheme 3j′). This alternative sequence (M2-1) might imply the requirement for further substrate reorganisation to position the carboxylic acid group of 8 (Scheme 3d′) or 9 (Scheme 3f′) in the activation site for intramolecular condensation to generate 12 and 13, respectively.
In the two-substrate system in both sequences (M1-2, M2-2), blunt-end 3 as generated from 1 and 2, involves a unique condensation reaction (described earlier) between adenyl-activated 2 and 1 as the nucleophile (Scheme 3k and k′). Maintaining 3 as the nucleophile would require its re-organisation to enable condensation with the incoming equivalent of adenylated 2 (Scheme 3o and p), with this continued logic generating 4, 5 and 6 (Scheme 3q–t).
There was a difference in the multiplicity limit of the single- and two-substrate systems. The pentameric macrocycle 14 was the highest-order multimer detected in the single-substrate system with 2, with heptameric linear 6 detected in the system using 1 and 2. This might suggest capacity for longer-chain linear substrates to flex beyond the active site cavity at the point of entry and/or at the exit point for the chain-extended products.
These two sequences are united in proposing the processive movement of the growing polymer chain within the active-site cavity but differ in the positional assignment of co-substrates for activation or condensation. M1-1 assigns units of 2 as the diffusible nucleophile and the growing 2 multimer chain undergoing re-positioning for carboxylic acid activation, while M2-1 assigns the growing 2 multimer chain as the nucleophile and units of 2 entering to undergo carboxylic acid activation. Elements of these pathways could coalesce, with one example 9 being formed from the condensation between 7 positioned for carboxylate activation (Scheme 3c) with 7 positioned as the nucleophile (Scheme 3c′). It may be that the overall sequences or parts thereof are not mutually exclusive and operate to variable extents in parallel. The overarching proposition is of a processive mechanism allowing for the energetically preferred diffusion of relatively low-molecular-weight compounds (ATP, AMP, PPi, 2) and the growing multimer chains maintained within or extending beyond the active site (when exceeding a volume capacity limit) with products expelled along the way.
The wide product profile observed experimentally (3–10, 12–14) suggests the StDesD active site has sufficient elasticity to build and accommodate flexible (rubbery) multimers of variable length as intermediates or final products.
The integrity of chelator function was established by incubating each biocombinatorial pool with excess Ga(III) or Zr(IV) which formed known/expected 1:1 complexes, and in the case of high-denticity chelators, some new 2:1 Ga(III):chelator complexes (Scheme 4). Applications of 2:1 68Ga(III):chelator complexes could be useful as positron emission tomography (PET) imaging partners of targeted radiopharmaceutical agents, where the molar activity of the radiolabelled compound could be increased enabling a reduction in mass dose, which would reduce the toxicity of the agent itself and the risk of receptor blocking effects.
Scheme 4 Biocombinatorial pools of hydroxamic acid chelators (3–14) in cartoon form produced by StDesD-mediated synthesis using as substrates 1 and 2 (left path; compounds contain a terminal amine group) or 2 (right path; linear compounds contain flanking amine and carboxylic acid groups) and coordination complexes characterised in situ upon incubation of each pool with Ga(III) or Zr(IV). The structure of StDesD was produced by AlphaFold.39 |
The array of multimeric chelators generated from simple substrates prompted consideration of assembly mechanisms which led to the posit of a processive intra-cavity assembly mechanism, with the growing multimer chain being shunted around the enzyme cavity to conserve the structural and functional integrity of the activation and condensation sites. Ongoing work using structural and predictive biology, site-directed mutagenesis, and molecular dynamics calculations is underway to further interrogate this mechanism. It could be that acidic and basic amino acid residues are systematically patterned around the active site cavity to accommodate the intra-cavity repositioning of 2 multimers.
The work highlights the general scope in using recombinant biosynthetic enzymes of natural products together with synthetically tractable substrates as a facile chemoenzymatic approach to generate biocombinatorial pools of structurally diverse analogues to screen for function.
Ac-CoA | Acetyl-coenzyme A |
AMP | Adenosine monophosphate |
ATP | Adenosine triphosphate |
BIS | Bisucaberin |
DFOB | Desferrioxamine B |
EIC | Extracted ion chromatogram |
HAC | N-Hydroxy-N-acetylcadaverine |
HC | N-Hydroxycadaverine |
HSC | N-Hydroxy-N-succinylcadaverine |
L | Linear |
LC-MS | Liquid chromatography-mass spectrometry |
MC | Macrocycle |
PET | Positron emission tomography |
PPi | Pyrophosphate |
SpDesD | Streptomyces pilosus DesD |
StDesD | Salinispora tropica CNB-440 DesD |
Suc-CoA | Succinyl-coenzyme A |
TIC | Total ion current |
t R | Retention time |
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4sc04888a |
‡ Hydroxamic acid-based chelators and metal complexes under the acidic conditions of the LC-MS system are routinely detected as positively charged adducts. Studies on similar systems using LC-MS in positive and negative ion detection modes22 showed signals were significantly weaker in negative ion mode than positive ion mode and did not reveal core species beyond those detected by positive ion mode. |
§ The co-substrate system generated product set 3–6 (formed from 1 and 2) and product set 7–10, 12–14 (formed from 2), with the latter set also formed in the single-substrate system. For simplicity, the major discussion of the co-substrate system focussed on the system-exclusive products 3–6. |
This journal is © The Royal Society of Chemistry 2025 |