Davide
Mancinotti
,
Karen Michiko
Frick
and
Fernando
Geu-Flores
*
Section for Plant Biochemistry and Copenhagen Plant Science Centre, Department of Plant and Environmental Sciences, Faculty of Science, University of Copenhagen, Denmark. E-mail: feg@plen.ku.dk; Tel: +45-60571982
First published on 18th March 2022
Covering: up to 2022
Quinolizidine alkaloids (QAs) are a class of alkaloids that accumulate in a variety of leguminous plants and have applications in the agricultural, pharmaceutical and chemical industries. QAs are notoriously present in cultivated lupins (Lupinus spp.) where they complicate the use of the valuable, high-protein beans due to their toxic properties and bitter taste. Compared to many other alkaloid classes, the biosynthesis of QAs is poorly understood, with only the two first pathway enzymes having been discovered so far. In this article, we review the different biosynthetic hypotheses that have been put forth in the literature (1988–2009) and highlight one particular hypothesis (1988) that agrees with the often ignored precursor feeding studies (1964–1994). Our focus is on the biosynthesis of the simple tetracyclic QA (−)-sparteine, from which many of the QAs found in lupins derive. We examine every pathway step on the way to (−)-sparteine and discuss plausible mechanisms, altogether proposing the involvement of 6–9 enzymes. Together with the new resources for gene discovery developed for lupins in the past few years, this review will contribute to the full elucidation of the QA pathway, including the identification and characterization of the missing pathway enzymes.
In the cultivated lupins, QAs complicate the end-use of the valuable high-protein grain, as they are unpalatable and can cause acute anticholinergic poisoning in humans and animals.7,8 Accordingly, one of the major breeding aims for lupins is to reduce seed QAs to consistently low levels. This aim has only been met with some success,4 as the available low-QA cultivars can still unpredictably exceed the industry threshold for utilization as food or feed (0.01% and 0.02% dry weight, respectively).9 In addition, low-QA cultivars display a higher susceptibility to herbivores,10,11 which is consistent with a proposed role in plant defense.12
While QAs are a source of concern among lupin farmers and breeders, some QAs exhibit pharmacological activities of interest for the medical field. For example, (−)-sparteine [(−)-8] has both antiarrhythmic13 and anticonvulsant properties,14 and (+)-matrine (6) has proven activity against several types of cancer, including breast15 and ovarian cancers.16 In addition, (−)-cytisine (12) is effective in aiding smoking cessation and has been successfully commercialized for that purpose.17,18 Apart from its pharmaceutical applications, (−)-sparteine [(−)-8] and its (+) version [(+)-8] have found use in the field of chemical synthesis, where they are highly valued as chiral ligands in asymmetric synthesis protocols.19,20
Despite the importance of QAs in agriculture and their potential applications in medicine and chemistry, very little is known about how QAs are biosynthesized. Several biosynthetic pathway hypotheses have been put forth during the last few decades; however, many of them are not in accordance with the precursor feeding experiments carried out in the 1970s and 80s using isotopically labelled compounds. Here, we review these foundational feeding experiments and subsequently describe one biosynthetic hypothesis that fits these often-ignored constraints. Our focus is on the biosynthesis of the tetracyclic QA core represented by (−)-sparteine [(−)-8] from which many QAs are thought to be derived. In our description of the likely QA pathway, we highlight both stereochemical and mechanistic considerations. Used in combination with the advanced genomic and transcriptomic resources recently developed for lupins, this review will contribute to the full elucidation of the QA pathway, including the identification and characterization of the missing pathway enzymes.
The simplest classification of QAs relates to the number of joined 6-membered rings present in their structures. In this classification, QAs can be bicyclic [e.g. (−)-lupinine (4) and (+)-epilupinine (5)], tricyclic [e.g. (−)-angustifoline (11) and (−)-cytisine (12)], or tetracyclic [e.g. (−)-sparteine [(−)-8] and (+)-matrine (6)]. The tetracyclic QAs can be further divided into two main classes: the sparteine-like QAs [e.g. (−)-sparteine [(−)-8] and (+)-lupanine (9)] and the matrine-like QAs [e.g. (+)-matrine (6) and (+)-matrine N-oxide (7)] (Fig. 1B). Although most QAs are aliphatic, some tri- and tetracyclic QAs contain a pyridone ring [e.g. (−)-anagyrine (10) and (−)-cytisine (12)], and these are sometimes classified separately (purple box in Fig. 1B). In addition, there are a number of irregular QAs with divergent structural features, such as (−)-camoensidine (13) (Fig. 1C).
With the exception of some irregular QAs, the backbones of all legume QAs are exclusively derived from the amino acid L-lysine (2), which donates two C5 units in the case of the bicyclic QAs and three C5 units in the case of the tetracyclic QAs.25,26 The tricyclic QAs are derived from the sparteine-like tetracyclic QAs by oxidative ring-cleavage reactions (Fig. 1B).27,28 Notably, the bipiperidine alkaloid ammodendrine (3) co-occurs with QAs in a variety of QA-containing species21 and is therefore likely to be an early by-product of the biosynthesis of QAs (Fig. 1B).
Typically, genistoid legumes accumulate mixtures of QAs belonging to several different sub-classes. For example, lupins accumulate bicyclic, tricyclic and sparteine-like QAs, while Sophora species accumulate tricyclic, sparteine-like, and matrine-like QAs.1 The accumulation of QAs has been most extensively studied in lupins. Within these, each species features a characteristic QA profile,3 with the exact composition varying according to tissue type,3,29 accession,30 developmental stage,31 growth conditions,30 and time of the day.29 The most common QAs in lupins are those derived from (−)-sparteine [(−)-8] such as (+)-lupanine (9) and (−)-angustifoline (11) (Fig. 1B). This review focuses on the biosynthetic steps that are likely to be required for converting the amino acid precursor L-lysine (2) into (−)-sparteine [(−)-8] (blue box in Fig. 1B).
Despite the number of possible stereoisomers, most QA-containing species accumulate only one or two stereoisomeric forms of any given QA, with one of the forms being predominant. In addition, the predominant forms of different QAs in a given species tend to share a particular backbone, suggesting that they belong to a biosynthetic series. This is particularly well documented among lupins, where the most commonly encountered tetracyclic QAs belong to the (−)-sparteine series (6R,7S,9S,11S backbone).3 Exceptions include varieties of L. argenteus, where the (−)-α-isosparteine backbone (6R,7S,9S,11R) predominates,3,32 and varieties of L. sericeus and L. pusillus, where the (−)-β-isosparteine backbone (6R,7R,9R,11R) predominates.3,33,34 The enantiomeric purity of a given QA tends to be high but varies between species. For example, in L. angustifolius and L. polyphyllus, only (+)-lupanine (9) has been reported,35–37 whereas both (+) and (−) enantiomers have been reported in L. albus, albeit with an excess of the (+) form.35
The fact that specific QA backbones predominate in individual plant species suggests that QA backbone formation occurs under stereoselective control. Thus, any proposed biosynthetic hypothesis must account for the selective formation and subsequent preservation of the four aforementioned stereocenters.
Radioactive tracer studies carried out in the 1960s using L. luteus and L. angustifolius provided the first evidence that the carbon skeleton and nitrogen atoms of the sparteine-like QAs are derived exclusively from L-lysine (2).25,38 These early results were confirmed and refined in the 1980s using stably labelled (non-radioactive) precursors and nuclear magnetic resonance (NMR) spectroscopy.39 The exact mode of incorporation of L-lysine (2) is well understood. Two key experiments were the feeding of DL-[2-14C]lysine† to L. angustifolius25 and the feeding of DL-(6-13C)lysine to the same species.26 In these experiments, the single label from either of the two precursors became distributed over the same six carbon atoms of (+)-lupanine (9): C2, C6, C10, C17, C11 and C15 (Fig. 3A). This observation implies that C2 and C6 of L-lysine (2) must become equivalent at some point during biosynthesis. Furthermore, it suggests that (+)-lupanine (9) is made from three discreet five-carbon (C5) units derived from L-lysine (2): two C5 units that constitute the outer rings (rings A and D) and a third C5 unit that accounts for the carbon atoms in between these rings (including the methylene bridge between rings B and C) (Fig. 3B). Taken together, these studies suggest that the skeleton of the sparteine-like QAs originates from three L-lysine (2) molecules via a symmetrical C5 intermediate (i.e. with C2v symmetry).
Fig. 3 Patterns of incorporation of isotopically labelled precursors into sparteine-like QAs and into the bicyclic QA lupinine (4). (A) Incorporation of DL-[2-14C]lysine25 and DL-[6-14C]lysine26 into (+)-lupanine (9) in L. angustifolius. (B) Model specifying the incorporation of three C5 units derived from L-lysine (2) into (+)-lupanine (9) (based on feeding experiments shown in (A)). (C) Incorporation of (2-15N, 1-13C)cadaverine into (+)-lupanine (9) in L. angustifolius.27,39 The grey bonds in bold indicate how the original 13C–15N bond from labelled cadaverine (16) is incorporated intact into labelled (+)-lupanine (9). (D) Incorporation of (R)-(1-2H)cadaverine and (S)-(1-2H)cadaverine (depicted as a single molecule) into (+)-lupanine (9) in L. angustifolius and into (−)-sparteine [(−)-8] in L. luteus.39 (E) Incorporation of [2-14C]Δ1-piperideine and [6-14C]Δ1-piperideine (depicted as a single molecule) into (+)-lupanine (9) in L. angustifolius.42 (F) Incorporation of (3,3-2H2)cadaverine into (+)-lupanine (9) in L. angustifolius.43 (G) Incorporation of (2,2,4,4-2H4)cadaverine† into (−)-sparteine [(−)-8] in L. luteus.44 (H) Incorporation of (R)-(2-2H)cadaverine and (S)-(2-2H)cadaverine (depicted as a single molecule) into (−)-lupinine (4) in L. luteus.44 |
The most obvious candidate for this C5 symmetric intermediate is cadaverine (16), a diamine that can be formed directly from L-lysine (2) by decarboxylation. Indeed, feeding of (1-15N, 1-13C)cadaverine to L. angustifolius caused the labelling of (+)-lupanine (9) at the same six carbons as when feeding DL-[2-14C]lysine or DL-(6-13C)lysine (Fig. 3C).27,39 This experiment also showed that both nitrogen atoms in the tetracyclic QAs derive from cadaverine (16) and therefore, ultimately, from L-lysine (2). The three L-lysine/cadaverine units that make up the tetracyclic QAs possess a total of six nitrogen atoms. Since only two of these are retained in the final QAs, four deamination events must occur. Important clues about these deamination events emerged when observing that the 13C–15N bond of labelled cadaverine was incorporated intact into (+)-lupanine (9) only at positions C2–N1 and C15–N16 (grey bonds in bold in Fig. 3C).27,39 This implies that the cadaverine unit that gives rise to ring A is deaminated at a position that later becomes C6 in (+)-lupanine (9). Likewise, the cadaverine unit giving rise to ring D must be deaminated at a position that later becomes C11. Finally, it also implies that the middle cadaverine unit must lose both of its nitrogen atoms by undergoing deamination at the positions that later become C10 and C17.
The deamination of terminal, linear amines such as cadaverine (16) is typically oxidative, converting the amine into the corresponding aldehyde with concomitant loss of one of the two hydrogens at the alpha carbon (Cα).40 The stereochemistry of hydrogen loss in the four deaminations in QA biosynthesis has been revealed by studying the incorporation of monodeuterated cadaverine precursors labelled at Cα.28,39,41 It was consistently observed in L. angustifolius and L. luteus that the deuterium from (R)-(1-2H)cadaverine enters the sparteine-like QA backbone at C6, C11 and C17, but not at C10 (Fig. 3D).39 This implies that the pro-S hydrogen is the one that is specifically lost from the Cα atoms later to become C6, C11, and C17. In addition, the deuterium from (S)-(1-2H)cadaverine was shown to be incorporated at position C10 (Fig. 3D).39 This confirms that the fourth deamination, which affects only the central C5 unit, proceeds through a different route involving the loss of the pro-R hydrogen from the Cα later to become C10.
The fact that C6, C11, and C17 derive from three different cadaverine molecules that are deaminated with the same stereoselective mechanism suggests a direct deamination of cadaverine (16) into 5-aminopentanal (26) as the sole mode of entry of cadaverine (16) into the pathway. 5-Aminopentanal (26) cyclizes spontaneously via intramolecular Schiff-base formation to yield Δ1-piperideine (17). When L. angustifolius was fed with cyclic Δ1-piperideine labelled at either of the two different carbon atoms adjacent to the nitrogen atom, the label was incorporated into all three C5 units of the sparteine-like QAs (Fig. 3E).42 Specifically, the aldimine carbon (deaminated carbon; asterisk in Fig. 3E) became C6, C11, and C17, and the amine carbon (bold dot in Fig. 3E) became C2, C10 and C15, as expected (compare with Fig. 3D). This regiospecific incorporation also implies that Δ1-piperideine (17) cannot be converted into any compound with C2v symmetry later during biosynthesis.
The experiments described above suggest that the sparteine-like QAs are derived from three units of Δ1-piperideine (17) (or a close, equally asymmetric derivative). One interesting insight into the coupling of these C5 units emerged from observing the relative extent of labelling of the different units in the final QAs. Consistently, the incorporation of label from [2-14C]Δ1-piperideine, [6-14C]Δ1-piperideine, and DL-[6-14C]lysine was more pronounced in the C5 unit corresponding to ring D than in the other two C5 units.26 This indicates that this C5 unit is subject to a lower degree of endogenous dilution before being incorporated in the QA backbone, which suggests that it is added last during biosynthesis. In these experiments, the incorporation of label was equal for the other two C5 units, indicating that the dimerization of Δ1-piperideine (17) is a plausible first step in the coupling of C5 units. The non-enzymatic dimerization of Δ1-piperideine (17) occurs readily at slightly basic pH values (discussed in detail in Section 6).
Finally, a few extra constraints have been revealed by experiments using deuterated cadaverine (16) molecules labelled at positions different than Cα. When cadaverine (16) was doubly deuterated at the γ position and was fed to L. angustifolius, both deuterium atoms were found at positions C4, C8, and C13 of (+)-lupanine (9), as expected (Fig. 3F).43 This means that no direct chemical modifications occur at the carbons that will become C4, C8, and C13 during the entire biosynthesis. By contrast, when cadaverine (16) was fully deuterated at the β positions and was fed to L. luteus, the deuterium atoms were found at C3, C5, C12, and C14 of (−)-sparteine [(−)-8], but not at the bridgehead carbons C7 or C9 (Fig. 3G).44 This implies that both hydrogen atoms at the positions that will become the bridgehead carbons must be lost during biosynthesis. The implications of this on the proposed biosynthetic hypotheses will be discussed in Section 6. One last experiment with cadaverine molecules labelled at the β position is worth mentioning. In this experiment, (R)- and (S)-(2-2H)cadaverine were fed separately to L. luteus, and the incorporation into the bicyclic QA (−)-lupinine (4) was assessed (Fig. 3H). Interestingly, only the deuterium from (R)-(2-2H)cadaverine was retained at C1 of (−)-lupinine (4) (Fig. 3H).44 Assuming that lupinine is a side product of the pathway towards the sparteine-like QAs and that these pathways diverge after the formation of the quinolizidine core (1), the selective loss of the pro-S hydrogen at the position later to become the bridgehead carbon C7 in e.g. (−)-sparteine [(−)-8] [equivalent to C1 in (−)-lupinine (4)] must occur before the pathways diverge. Mechanistic implications of this will be discussed in Section 6.
The four remaining hypotheses postulate that the first common tetracyclic intermediate is the hypothetical di-iminium cation (20), which can yield sparteine by sequential reduction (Fig. 4). One of these hypothesis was inspired by the 17-oxosparteine model and proposes the single-enzyme conversion of cadaverine (16) into the di-iminium cation (20) via a series of enzyme-bound intermediates (Fig. 4, Saito & Murakoshi, 1995).46 However, the proposed mechanism (not shown) conflicts with the observed labelling patterns of QAs derived from [2-14C]- and [6-14C]Δ1-piperideines, which reveal that the cadaverine carbon that is initially oxidized ends up at positions 6, 11, and 17 of sparteine (Fig. 3E). By contrast, this model predicts that the initially oxidized cadaverine carbon ends up at positions 6, 11, and 10.
The three remaining hypotheses postulate the oxidation of cadaverine (16) to Δ1-piperideine (17) and agree on the stereoselective dimerization of Δ1-piperideine (17) into tetrahydroanabasine (18) as the next step (Fig. 4). One hypothesis from 2009 proposed that the third C5 unit is supplied to the pathway as fully reduced piperidine (23) rather than Δ1-piperideine (17) (Fig. 4, Dewick, 2009).47 However, feeding studies have shown that the third Δ1-piperideine (17) molecule must be incorporated in a regiospecific manner that is not compatible with prior conversion into a symmetrical intermediate such as piperidine (23).42
The two remaining models, both proposed by Golebiewski & Spenser in 1988,26 are the only ones generally consistent with the body of evidence from feeding studies presented in the previous section. The first of these two models suggests that tetrahydroanabasine (18) reacts with Δ1-piperideine (17) to form isotripiperideine (22), which is then modified to yield the di-imunium cation (20) through an unconventional mechanism that eliminates ammonia as the last step (Fig. 4, Golebiewski & Spenser, 1988-I).26,42 The second of these two models postulates the conversion of tetrahydroanabasine (18) to a bicyclic quinolizideine intermediate (19) via hydrolysis, oxidation, and intramolecular Schiff base formation (Fig. 4, Golebiewski & Spenser, 1988-II). This quinolizideine intermediate (19) is then coupled to Δ1-piperideine (17) in a reaction that is analogous to the earlier dimerization of Δ1-piperideine (17). The tetracyclic di-iminium cation (20) is then produced spontaneously by intramolecular Schiff base formation.26
Although both of these models are in general accordance with the precursor feeding studies, its authors favored the one involving the bicyclic quinolizideine intermediate (19) (Fig. 4, Golebiewski & Spenser, 1988-II). As noted by them, this model offers a more satisfactory explanation for the differential incorporation of radioactive precursors into the three C5 units of (+)-lupanine (9) as defined in Fig. 3B. Indeed, a significantly higher incorporation into the third C5 unit (leading to ring D) was observed when feeding L. angustifolius with 14C-labelled DL-lysine or Δ1-piperideine (17) and subjecting the resulting (+)-lupanine (9) to controlled chemical degradation.26 Such distribution of label is better explained by a model in which the third Δ1-piperideine (17) unit is incorporated at a much later biosynthetic step than the dimerization of Δ1-piperideine (17), thus suffering less endogenous dilution en route to the (+)-lupanine (9) product.
Like ODC, LDC is a pyridoxal phosphate (PLP)-dependent enzyme that can be inhibited by α-difluoromethylornithine.48 Thus, a mechanism similar to that of ODC can be postulated, namely a conventional PLP-dependent decarboxylation with retention of configuration (Fig. 5). The retention of configuration implies that the quinonoid intermediate (24) is protonated at the Re face so that the new proton ends up occupying the same position as the original carboxylate group (Fig. 5).49 Notably, feeding studies using deuterated versions of L-lysine (2) and cadaverine (16) are consistent with the retention of configuration. In particular, the label in (R)- or (S)-(1-2H)cadaverine leads to different patterns of labelling of the tetracyclic QAs (Fig. 3D),39 and the label in L-(2-2H)lysine gives the same labelling pattern as (S)-(1-2H)cadaverine.39 Accordingly, it is very likely that the LDCs cloned from L. angustifolius, S. flavescens, and E. koreensis operate with retention of configuration, although this remains to be shown experimentally.
Recently, a CAO that is tightly co-regulated with LDC was cloned from L. angustifolius. This CAO was shown to possess the three highly conserved L-His residues that chelate the catalytically important Cu2+ ion in canonical CAOs. The enzyme also featured the highly conserved L-Tyr residue that is auto-catalytically converted to a crucial topaquinone residue (25) prior to entering the catalytic cycle. When heterologously expressed in E. coli, this CAO was able to oxidize cadaverine (16) with an unusually high affinity,52 thus supporting a proposed role in QA biosynthesis.
In the oxidation of cadaverine (16) to 5-aminopentanal (26), the carbon atom that is being oxidized loses one of its two protons. According to the feeding experiments described in Section 4, the lost proton must be the pro-S proton, specifically. The evidence can be summarized in two parts as follows: (1) the 14C label at the aldimine carbon of Δ1-piperideine (17) results in the specific labelling of C6, C11, and C17 of (+)-lupanine (9), indicating that these three carbons correspond to the oxidized carbon of 5-aminopentanal (26) (Fig. 3E);42 (2) a single 2H label at Cα of cadaverine (16) gets incorporated into carbons C6, C11, and C17 only if the label is placed at the pro-R position (Fig. 3D).39 The specific abstraction of the pro-S proton from Cα of cadaverine (16) is represented in Fig. 6 in the context of a plausible oxidation mechanism catalyzed by CAO.
The two newly formed stereocenters in tetrahydroanabasine (18), C2 and C3′, correspond to carbons C6 and C7 of (−)-sparteine [(−)-8], respectively, (Fig. 2A), with C6 tracing back to the imine carbon of Δ1-piperideine (17) that receives the nucleophilic attack during dimerization (Fig. 7A). It is very plausible that the final stereochemistry at C6 of (−)-sparteine [(−)-8] is established at this early step, given that the proton at this position is retained all the way from labelled cadaverine (pro-R proton at Cα) to (−)-sparteine [(−)-8] in feeding experiments (Fig. 3D).39 If so, the configuration of C2 in the tetrahydroanabasine intermediate (28) should be restricted to R. By contrast, the proton at C7 in (−)-sparteine [(−)-8] is not derived from cadaverine (16) (Fig. 3G),44 and this allows for a potential change in configuration of C3′ in tetrahydroanabasine (28) upon proton loss during its conversion to (−)-sparteine [(−)-8]. However, in the case of the bicyclic QA (−)-lupinine (4), which often co-occurs with (−)-sparteine [(−)-8] in lupins, the corresponding proton [pro-R proton at Cβ in cadaverine (16)] is retained at position C1, which is in the R configuration (Fig. 3H).44 Assuming that the pathways toward the bicyclic and tetracyclic QAs diverge after the dimerization step, this provides a crucial insight into the stereochemistry of the dimerization product at this second stereocenter. Altogether, the feeding experiments strongly suggest that only the (2R,3′R) form of tetrahydroanabasine (18) is used for QA biosynthesis in lupins.‡
We envision two possible ways for the exclusive use of (2R,3′R)-tetrahydroanabasine (18) in QA biosynthesis in lupins. First, if the (2R,3′R) and (2S,3′S) forms are in equilibrium with the piperideine monomers (19 and 28), then a stereospecific enzyme metabolizing the (2R,3′R) form exclusively would be enough to ensure a complete conversion into the next pathway intermediate (Fig. 7B). However, we favor a different possibility in which the dimerization is actually an enzymatic process that only produces the (2R,3′R) form. The involvement of an enzyme could explain the often ignored fact that only the pro-R proton at Cβ of cadaverine (16) appears at C1 of (−)-lupinine (4) in feeding experiments (Fig. 3H).44 Indeed, we postulate that an enzyme removes the pro-S proton from Cβ during the first part of the reaction, namely, during the tautomerization to Δ2-piperideine (27) (Fig. 7C). It is not unusual for a spontaneous isomerization to be catalyzed by an enzyme, as enzymes can increase the speed of spontaneous reactions that might otherwise occur at sub-optimal rates in an organism.55 Subsequent coupling of Δ2-piperideine (27) to an unreacted Δ1-piperideine molecule (17) by the same or by a different enzyme would complete the dimerization.§ In order to yield the (2R,3′R) form (18), the coupling enzyme must coordinate an attack from the Si face of Δ2-piperideine (27) to the Si face of Δ1-piperideine (17) (Fig. 7D).¶
Fig. 8 Postulated formation of the quinolizideine intermediate (19) and branching towards the bicyclic QAs. (A) (2R,3′R)-Tetrahydroanabasine (18) undergoes hydrolysis, oxidative deamination, and formation of a new Schiff base to give the quinolizideine intermediate (19). The oxidative deamination occurs with concomitant loss of the pro-R proton and must be enzymatic.39 By contrast, the hydrolysis and the formation of a new Schiff base may be spontaneous under physiological conditions. (B) Proposed conversion of the quinolizideine intermediate (19) to (−)-lupinine (4). The stereochemistry of hydride donation is indicated (inferred from precursor feeding studies).56 The sequence shown indicates a reduction of the imine carbon followed by a reduction of the aldehyde carbon; however, the precise order remains unknown. |
With respect to the hydrolysis and the formation of a new Schiff base, both of these processes are likely part of chemical equilibria and may occur spontaneously under physiological conditions. Whether enzymes are used in vivo to increase the rate of these equilibrations remains unknown.
The quinolizideine intermediate (19) is likely to be the last common intermediate between the tetracyclic and the bicyclic QAs. From here, two consecutive, enzyme-catalyzed reductions would afford the bicyclic QA (−)-lupinine (4) (Fig. 8B). The stereochemistry of these reductions has been determined by feeding L. luteus with (R)-(1-2H)cadaverine and (S)-(1-2H)cadaverine and analyzing the resulting (−)-lupinine (4) via2H-NMR (not shown in Fig. 3).56 The results show that the iminium carbon is reduced from the Si face, whereas the aldehyde carbon is reduced from the Re face. The precise order of these reactions remains to be investigated.
The coupling reaction is similar to the dimerization of Δ1-piperideine (17) described in sub-Section 6.3. By analogy, it has been proposed that the quinolizideine intermediate (19) undergoes base-catalyzed tautomerization to give an enamine that then attacks a protonated Δ1-piperideine (17) molecule (Fig. 9A).26 In the case of the dimerization of Δ1-piperideine (17) (the first coupling event), we noted earlier that the deprotonation at C3 necessary to form Δ2-piperideine (27) occurred stereoselectively, thus strongly arguing for the involvement of an enzyme (Fig. 7C). Selective proton abstraction could be asserted given that the unreacted proton was retained in (−)-lupinine (4) (hydrogen at C1 in Fig. 3H).44 In the case of this second coupling event, however, both of the analogous hydrogens are lost on the way to (−)-sparteine [(−)-8] (hydrogens at C9 in Fig. 3G),44 thus making it impossible to formulate a similar argument. However, given the high likelihood of enzyme involvement in the dimerization of Δ1-piperideine, we postulate that this second coupling event is also enzymatic.
In order to give the right stereochemistry to the product, the coupling enzyme must facilitate an attack from the Si face of the tautomerized quinolizideine intermediate to the Re face of protonated Δ1-piperideine (17) (Fig. 9B). This creates the two stereocenters later to become C9 and C11 in (−)-sparteine [(−)-8] (Fig. 2C).|| To create the (−)-α-isosparteine backbone instead, a backbone that predominates in L. argenteus,3,32 the attack should be performed on the Si face of protonated Δ1-piperideine (Fig. 9C), thus ensuring the opposite configuration in the new stereocenter later to become C11 in (−)-α-isosparteine (14) (Fig. 2C).
One last interesting result from the feeding studies with cadaverine labelled at Cβ is the loss of the label at the bridgehead carbons C7 and C9 in (−)-sparteine [(−)-8] (Fig. 3G). These two carbons are located next to the iminium carbons of the di-iminium cation (20), and it is tempting to speculate that this position has to do with the loss of label. In particular, it is conceivable that each of the iminium groups is in equilibrium with the corresponding enamine form (Fig. 10B). A rapid establishment of these equilibria prior to reduction could explain the observed loss of the hydrogen atom at these positions. Whether this mechanism or an alternative one is at play remains to be shown experimentally.
Fig. 11 Summary of the proposed QA pathway from L-Lys (2) to (−)-sparteine [(−)-8]. Stereoselective losses/gains of hydrogen are indicated. The configuration of stereocenters is specified only when first produced as well as in the final product. Apparent stereochemical discrepancies with the final (−)-sparteine [(−)-8] product are only due to differences in Cahn–Ingold–Prelog priorities (see footnotes** and ‡). We postulate the involvement of up to 9 enzymes as shown by the encircled numbers. Only the first two enzymes are known: LDC and CAO, respectively. Enzymes 3 and 4 might be the same enzyme or different enzymes acting as part of a tightly bound complex. The same applies to enzymes 6 and 7. Up to two enzymes (enzymes 8 and 9) may be involved in the double reduction of the di-iminium cation intermediate (20) to give (−)-sparteine [(−)-8]. (A) Conversion from L-Lys (2) to (2R,3′R)-tetrahydroanabasine (18). (B) Conversion from (2R,3′R)-tetrahydroanabasine (18) to the quinolizideine intermediate (19). (C) Conversion from the quinolizideine intermediate (19) to (−)-sparteine [(−)-8]. |
Recent years have seen the development of a multiplicity of resources for gene discovery in QA-producing species, most notably in lupins. Combined with the curated understanding of the QA pathway presented in this review, the prospects for the full elucidation of the QA pathway are very good. The available resources include high-quality genome drafts for L. angustifolius61–63 and L. albus,64,65 a pan-genome for L. albus,66 and transcriptomic data for a range of lupin species and tissues, as compiled by Kamphuis et al.67 Additionally, transcriptomic data has been generated for Sophora species,68,69 and these are of interest to uncover the pathway towards the matrine-like tetracyclic QAs. However, equivalent resources do not exist for species that primarily accumulate α-pyridone QAs, such as Laburnum anagyroides, which produces the smoking cessation agent (−)-cytisine (12). The fact that the α-pyridone QAs are derived from (+)-sparteine [(+)-8] and not (−)-sparteine [(−)-8] (Fig. 1) increases the interest to target this species for sequencing and QA pathway elucidation.
The sequencing resources mentioned above can be used in different ways for the discovery of QA pathway genes. One powerful way to identify candidates involves the critical inspection of genes that are co-regulated with already known QA genes. Indeed, most plant specialized metabolite pathways are regulated tightly at the transcriptional level, showing distinct expression patterns across organs, tissues, or cell types. This is how the second QA pathway gene CAO was identified in L. angustifolius.52 Another strategy involves mining the published genome drafts for potential QA biosynthetic gene clusters (BGCs) using software such as plantiSMASH.70 While BGCs are less prevalent in plants than in bacteria or fungi, several examples of plant specialized metabolite BGCs have been identified in the past decade.71
Notably, QA pathway genes may also be discovered by uncovering the causative mutations for low-QA phenotypes. In lupin breeding, several different loci are known to control QA levels in L. angustifolius, L. albus, L. luteus and L. mutabilis; however, the identity of the underlying genes remains unknown.72 Candidate genes have been proposed for the iucundus73–75 and pauper76,77 loci, but further work is needed to pinpoint single genes and establish causality. Similarly, mutagenized populations in a high-QA background may yield several low-QA mutant phenotypes, and the causative mutations can be identified in forward genetics approaches. Currently, a mutagenized population in a high-QA background only exists for L. mutabilis.78 It would be of interest to generate analogous mutant populations for the two species with the most sequencing resources, L. angustifolius and L. albus, and subsequently screen for low-QA phenotypes. For both the breeding loci and the eventual new loci obtained via mutagenesis, the underlying genes will likely code for enzymes, regulators, or transporters involved directly or indirectly in the QA pathway.
Once identified, candidate QA pathway genes may be tested and characterized in vitro or in heterologous hosts such as yeast or Nicotiana benthamiana. In addition, there are methods for in planta gene characterization in lupin species. Most notably, gene downregulation in L. angustifolius is now possible using a recently published virus-induced gene silencing (VIGS) method.79 Downregulation of LDC using this method resulted in a marked decrease in QA accumulation in leaves. Stable transformation protocols are also available for L. angustifolius, L. luteus, and L. mutabilis;80–82 however, transformation efficiencies are low. Thus, new and more efficient protocols must be developed before stable transformation can become a viable approach for gene characterization in lupins. An increased focus on such protocols will also enable the use of gene editing technology (e.g. CRISPR/Cas9) in the process of lupin crop improvement.
The full elucidation of the QA pathway will represent an important milestone in the field of alkaloid biosynthesis. It will also contribute to the development of improved lupin crops, as it will allow the creation of new, low-QA varieties with potentially more stable, low-QA phenotypes. Finally, it will allow the cost-effective production of industrially or medicinally important QAs such as (−)-sparteine [(−)-8] or (+)-matrine (6) via synthetic biology.
Footnotes |
† To name the isotopically modified compounds used in precursor feeding studies, we here use square brackets surrounding the nuclide symbols to indicate partial labeling and round brackets for isotopically substituted compounds (full or near full labeling). |
‡ For readers who have noticed that the equivalence of the (6R,7S,9S,11S) and the (6S,7S,9S,11R) forms of (−)-sparteine [(−)-8] (Fig. 2C and Section 3) virtually enables an alternative biosynthetic route via (2S,3′S)-tetrahydroanabasine, we would like to point out that this alternative route is not supported by the precursor feeding experiments that show preferential incorporation of label into ring D of the (6R,7S,9S,11S) form specifically (Section 4). Such alternative route would lead to preferential incorporation into ring D of the (6S,7S,9S,11R) form, but this ring corresponds to ring A of the (6R,7S,9S,11S) form, as visualized via the 180°-rotation shown in Fig. 2C. |
§ Support for a single-enzyme hypothesis might appear to come from the fact that the two C5 units coupled here are incorporated into the sparteine-like QAs to an equivalent extent.26 This absence of differential extent of labelling, however, may be explained by either the action of a single enzyme catalyzing both proposed steps [without releasing the intermediate Δ2-piperideine (27) into solution] or by two tightly bound enzymes acting as part of a so-called metabolon. |
¶ For readers that have noticed that the stereocenter at C3′ in the proposed (2R,3′R)-tetrahydroanabasine intermediate (18) is the R configuration whereas the corresponding stereocenter at C7 in (−)-sparteine [(−)-8] is in the S one, we would like to note that this difference corresponds to a difference in Cahn–Ingold–Prelog priorities rather than a true difference in stereochemistry. |
|| Note that the two newly formed stereocenters depicted in Fig. 9B must be in the (R,S) configuration. The fact that (−)-sparteine [(−)-8] possesses an (S,S) configuration at the corresponding C9 and C11 stereocenters is due to a difference in Cahn–Ingold–Prelog priorities between (−)-sparteine [(−)-8] and the di-iminium cation intermediate (20) at the first stereocenter and does not indicate to a true difference in stereochemistry. |
** In this case, (2,2,4,4-2H4)cadaverine was mixed with an unspecified molar amount of radiolabelled [1,5-14C2]cadaverine tracer before being fed to L. luteus. |
This journal is © The Royal Society of Chemistry 2022 |