Matthew A.
Davis‡
a,
Vivian Yaci
Yu‡
a,
Beverly
Fu
b,
Miao
Wen
b,
Edward J.
Koleski
b,
Joshua
Silverman
c,
Charles A.
Berdan
b,
Daniel K.
Nomura
abd and
Michelle C. Y.
Chang
*abe
aDepartment of Molecular & Cellular Biology, University of California, Berkeley, CA 94720-3200, USA. E-mail: mcchang@berkeley.edu
bDepartment of Chemistry, University of California, Berkeley, CA 94720-1460, USA
cCalysta, 1900 Alameda de las Pulgas Suite 200, San Mateo, CA 94404, USA
dDepartment of Nutritional Sciences & Toxicology, University of California, Berkeley, CA 94720-3104, USA
eDepartment of Chemical & Biomolecular Engineering, University of California, Berkeley, CA 94720-1462, USA
First published on 9th October 2023
Living organisms carry out a wide range of remarkable functions, including the synthesis of thousands of simple and complex chemical structures for cellular growth and maintenance. The manipulation of this reaction network has allowed for the genetic engineering of cells for targeted chemical synthesis, but it remains challenging to alter the program underlying their fundamental chemical behavior. By taking advantage of the unique ability of living systems to use evolution to find solutions to complex problems, we have achieved yields of up to ∼95% for three C4 commodity chemicals, n-butanol, 1,3-butanediol, and 4-hydroxy-2-butanone. Genomic sequencing of the evolved strains identified pcnB and rpoBC as two gene loci that are able to alter carbon flow by remodeling the transcriptional landscape of the cell, highlighting the potential of synthetic pathways as a tool to identify metabolic control points.
A major challenge in the development of cell-based chemical synthesis is that the reaction network used to produce target compounds is also used to carry out basic cell functions. These reactions are thus subject to many levels of regulation in order to maintain the necessary coordination between parts of the metabolic network.1–3 In particular, key hubs of the metabolic map, such as the central carbon pathways of glycolysis and the tricarboxylic acid (TCA) cycle, form many connections with the rest of the network and are difficult to manipulate as their behavior is affected by multiple inputs and outputs.4 As a result, the construction of high-yielding pathways can be difficult to achieve as evolution drives the cell to direct carbon flux to cell growth and biomass in competition with engineered biosynthesis.
Since these central carbon pathways are closely tied to cell state, they are correspondingly subject to homeostatic mechanisms to ensure robustness to change. Therefore, many simultaneous alterations are needed to rationally engineer carbon flow to insufficiently active nodes.5–8 However, an advantage that living systems provide is that evolution can be used to solve this multi-dimensional problem if product titers can be tied to cell growth.9,10 In this work, we demonstrate the design and evolution of synthetic pathways to selectively produce three industrially-relevant C4 compounds: 1,3-butanediol (butylene glycol, BDO), 4-hydroxy-2-butanone (HB), and n-butanol (Fig. 1A). These three compounds are used for various purposes, ranging from pharmaceutical precursors (BDO and HB) to a drop-in gasoline replacement (n-butanol).11–13 In particular, BDO can be used as a humectant or solvent for a variety of different high-value products, as well as a co-monomer for production of various polymers. These three compounds can also be further dehydrated to produce the C4 monomers 1,3-butadiene (from BDO),14 methyl vinyl ketone (from HB),15 and 1-butene (from n-butanol).16 Using a genetic selection, the yields of these pathways were improved from 11–20% to near quantitative yields. Genome sequencing of the evolved strains showed that two gene loci, pcnB and rpoBC, were mutated in the most successful daughter cells. Subsequent characterization demonstrated that mutations at these two loci are sufficient to capture the majority of the evolved phenotype and likely operate by large-scale shifts in the transcriptome. Taken together, these results highlight the possibility of synthetic pathways to be used not only for scalable chemical production but also as a platform for discovery and study of cellular function.
Like ethanol and lactate, the C4 alcohol, n-butanol, can serve to balance glucose fermentation because its biosynthesis recycles the four NADH produced per molecule of glucose. However, a major challenge for production of longer-chain target compounds is that they typically require building blocks from pathways downstream of glycolysis and whose intracellular concentrations are regulated at many levels. One of the most important of these building blocks is acetyl coenzyme A (CoA), which is a two-carbon intermediate that serves as a central point of many metabolic decision points.20,21 Acetyl-CoA synthesis and usage are tightly controlled with flux dropping under anaerobic conditions as both biosynthesis and cell growth are greatly reduced during fermentative growth (Fig. 1B). Indeed, n-butanol titers are greatly lowered when our first-generation Escherichia coli production strain was cultured anaerobically (Fig. 1C).11 In order to reduce carbon flow to competing native pathways, the major fermentation pathways were knocked out of E. coli DH1 to generate DH1 ΔldhA ΔadhE ΔfrdBC ΔpoxB ΔackA-pta (DH1Δ5),10,22 a selection strain that would require production of n-butanol for growth under anaerobic conditions (Fig. 2A, ESI Table S1 and Fig. S1†).13 In order to provide a means to increase flux to acetyl-CoA, the pyruvate dehydrogenase complex (PDHc, aceEF-lpd) was overexpressed for the oxidative decarboxylation of pyruvate to produce acetyl-CoA and NADH to stoichiometrically balance n-butanol production from glucose.
The fermentation knockout strain DH1Δ5 was found competent to grow under anaerobic conditions when the synthetic n-butanol pathway consisting of phaA, hbd, crt, ter, and adhE2 was expressed (Fig. 1). However, increased ethanol production, rather than n-butanol, was observed (Fig. S2†). The most likely source of ethanol was the promiscuity of AdhE2, which is a bifunctional aldehyde-alcohol dehydrogenase that produces both n-butanol and ethanol from the respective two-step reduction of butyryl-CoA and acetyl-CoA in its native host (Fig. 2A).23,24 In this case, direct reduction of two equivalents of acetyl-CoA to ethanol would also regenerate the necessary four NAD+ per glucose and creates a short circuit in the pathway to circumvent n-butanol production (Fig. 2A). Biochemical analysis of different AdhE2 constructs was carried out in order to assess the selectivity of each domain for the C4 and C2 substrates (Fig. S3†). Although there is a ten-fold preference for butyryl-CoA over acetyl-CoA, the higher kcat/KM for the C4 substrate arises directly from a 10-fold lower KM (10 ± 1 μM) with no change in kcat within error. Given that the KM for acetyl-CoA (100 ± 10 μM) is well within the expected physiological range (0.5–1.0 mM), it is likely that AdhE2 is capable of producing both n-butanol and ethanol at competitive rates under intracellular conditions. Although the mechanism of substrate channeling between the aldehyde dehydrogenase (ALDH) and alcohol dehydrogenase (ADH) domains is not yet fully understand, the high KMs (4.0–4.5 mM) measured for the aldehyde intermediate imply that substrate selection is controlled by the ALDH domain. Thus, we set out to identify enzymes that could efficiently carry out the reduction of butyryl-CoA while excluding acetyl-CoA.
Given the modest gains using this approach, we turned our attention to screening wild-type ALDH sequences falling within the C4-selective branch since it seemed likely that the sequence information derived mostly from monofunctional ALDHs did not accurately predict the selectivity of their bifunctional counterparts. The C4 branch of the tree was widely sampled to incorporate the full diversity of this branch in a small number of sequences comprising 15 bifunctional AdhE2 homologs and 18 monofunctional ALDHs (Table S1†). We found that all bifunctional enzymes except one yielded lower n-butanol:ethanol ratios compared to AdhE2 (Fig. S5†), possibly because a large majority of sequenced AdhE2 homologs are thought to be involved in ethanol generation and likely display a natural preference for acetyl-CoA. In contrast, 15 out of 16 monofunctional ALDHs produced more n-butanol than ethanol, possibly because the natural substrate range is larger with this enzyme family (Fig. S6†). Out of these monofunctional ALDHs, ALDH46 was selected as the final candidate for butyryl-CoA reduction. We next sought to improve overall n-butanol production to the levels observed using AdhE2 (Fig. 2B). We reasoned that the bottleneck was the reduction of butyraldehyde to n-butanol due to the absence of the ADH domain. In our biochemical studies of AdhE2, we characterized a truncation mutant consisting of solely the ADH domain of AdhE2 (ADHAdhE2) that surprisingly showed an order of magnitude decrease in KM for butyraldehyde to 300 ± 50 μM. We thus supplemented our pathway containing ALDH46 with the ADHAdhE2 domain, which more than doubled n-butanol titers. Increasing the promoter strength for expression of ALDH46-ADHAdhE2 then improved n-butanol titers in the two-protein system beyond that observed in the original bifunctional AdhE2-dependent pathway, with no ethanol production above background (Fig. 2B).
We hypothesized that the differences in n-butanol compared to BDO production might arise from limitations in ADH activity for reduction of 3-hydroxybutyraldehyde. As such, we generated an ADH sequence similarity network with the goal of identifying a subfamily with the desired substrate selectivity within the larger superfamily (Fig. S8†).31,32 A list of candidates within the subfamilies defined by the known C4-selective ADHs (bdhA, bdhB, dhaT, and yqhD) was then generated and screened by co-expression with ALDH46, which showed no stereochemical preference for reduction of 3-hydroxybutyryl-CoA. While several hits were found, it was interesting to note that these ADHs appeared to all be highly specific for the (R)-isomer.
During this analysis, we identified HB as a side-product that appears to arise from the reduction of an earlier pathway intermediate, acetoacetyl-CoA (Fig. 1A). HB is also an interesting product as its dehydration produces methyl vinyl ketone, a reagent used in the production of fine chemicals.33 Additionally, methyl vinyl ketone is a monomer unit used polymer synthesis.34 We therefore set out to characterize the selectivity of ALDH-ADH pairs by examining partitioning between BDO and HB (Fig. S9†). This screen indicated HB production is highly specific to the ALDH7-ADH2 pair, providing an even distribution of products at high titer (3.4 ± 0.1 g L−1). On the other end, the ALDH3-ADH22 pair was found to capture a large fraction of the C4 product pool as BDO (81%), producing 2.9 ± 0.1 g L−1 of total products under screening conditions.
A selective pathway for production of HB over BDO was engineered by simply removing the PhaB ketoreductase, forming a truncated pathway by eliminating production of 3-hydroxybutyryl-CoA required for BDO formation (Fig. 1A). With this change, the PhaA-ALDH7-ADH2 pathway generated 2.0 ± 0.2 g L−1 HB (Fig. 2C). To selectively produce BDO over HB, we pursued an approach to redirect HB to BDO production by adding a secondary alcohol dehydrogenase (sADH). Specifically, we set out to find a sADH that would catalyze the reduction of HB, resulting from promiscuous ALDH activity on acetoacetyl-CoA, directly to BDO (Fig. S10†).
A number of sADHs have been reported to reduce 4-hydroxy-2-butanone or similar substrates.35 Several of these sADHs were co-expressed with the ALDH7-ADH2 pair, which consistently produced an even mixture of butanediol and hydroxybutanone. Several of the sADHs enabled a shift in the product profile, producing high levels of BDO (>2 g L−1) and minimal amounts of HB (<250 mg L−1) (Fig. S10†). With these sADHs in hand, we could now control the product profile between HB, BDO, or a mixture of the two (Fig. 2C).
Given the dependence of growth rate on pathway titer, we then tested our ability to enrich cell cultures for high producing variants. To do so, cultures of the low-production strain were seeded with either 0.1% or 1% of the medium-production inoculated strain. Throughout the course of extended anaerobic growth, we observed a significant lag phase dependent on the seeding level (Fig. S11†). In this simulated selection, we tracked n-butanol production as well as the abundance of the two different strains using qPCR. In agreement with the growth curves, the abundance of the low-production strain was largely static while the abundance of the medium-production strain was enriched >40-fold as n-butanol production initiated (Fig. 3B).
In order to select for variants with improved n-butanol productivity under anaerobic conditions, we turned to adaptive evolution after efforts using synthetic mutagenesis methods such as chemical mutagens or UV irradiation appeared to find only local minima in the evolutionary trajectory (Fig. S12†). In this approach, the natural mutation frequency is utilized, which requires longer evolution times but selects for more advantageous mutations and minimizes the occurrence of neutral mutations.36,37 Since every evolutionary trajectory has the potential to yield different results, we evolved two different host strains, DH1Δ5 and BW25113Δ5, using media ranging in richness from M9, 10% (v/v) LB in M9, and LB, by diluting the culture every 24 h over the course of 4–70 days (Fig. S13†). Using this approach, we obtained evolved strains capable of producing n-butanol at levels up to 75% theoretical yield, representing a six-fold improvement over the 12% theoretical yield achieved in the parent strain (Fig. 3C). Although the redox balance is not stoichiometric as it is with n-butanol, we were also able to evolve BDO and HB production in DH1Δ5 from 9% to 81% and from 4% to 68% theoretical yield, respectively, in TB (Fig. 3D and S14†). Isolation of pathway plasmids from the evolved strains and transformation into a clean background showed no improvement in product titers, indicating that the relevant mutations were generated on the chromosome (Fig. S14†). Cultures of the evolved strains in rich media in shake-flasks with an oleyl alcohol overlay further yielded titers of up to 47 ± 6 g L−1 and >95% yield due to increased n-butanol solubility (Fig. S15†). Taken together, the evolved strains demonstrate robust production of a range of C4 products from acetyl-CoA under anaerobic conditions.
The discovery that genes involved in RNA metabolism appear to drive metabolic network evolution led us to the hypothesis that the phenotypic changes were largely being controlled by alterations in the global transcriptional program. This model is consistent with pathway enzymatic activity measurements in cell lysates, which showed no significant increase between parent and evolved strains at the end of a production growth (Fig. S16†). This result suggests that yield increases were not derived from overexpression of heterologous pathway genes. To further characterize this phenomenon, we performed an RNA-Seq experiment on the evolved BDO strain with the largest improvement in production titer (DH1Δ5.2406) containing point mutations in pcnB and rpoC. We found 126 differentially-expressed genes (β value > 2) between the parental and evolved strain falling into a broad range of categories, including energy production and conversion, amino acid transport and metabolism, cell envelope biogenesis, and carbohydrate transport and metabolism (Fig. S17 and S18†). This large shift in the transcriptome indicates that alterations in acetyl-CoA and central carbon homeostasis may require changes at many metabolic nodes. This data is supported by metabolomics experiment that suggest that acetyl-CoA levels are higher in the evolved strains (Fig. S19†). Genes upregulated in the evolved strain were enriched for the transport GO term, likely to support metabolite uptake and export of BDO (Fig. S20†). Downregulated genes were enriched for the translation, cellular α-amino acid metabolism, nucleotide metabolism, and response to stress GO terms, representing a return to normal carbon homeostasis in the evolved strain (Fig. S20†).
In order to validate the impact of the pcnB and rpoC mutations, the two mutations observed in this BDO strain (pcnB R149L/rpoC M466L) were introduced into a clean genetic background. These experiments show that these mutations in rpoC and pcnB are synergistic, as both are required to achieve a substantive increase in BDO titer compared to the parent (Fig. 4C). Indeed, the double mutant demonstrates a 2.75-fold increase in BDO titers (parent, 2.1 ± 0.1 g L−1; double mutant, 5.8 ± 0.2 g L−1), which recapitulates 73% of the improvement observed in the fully evolved strain (8.1 ± 0.1 g L−1). We were also interested in the generality of these mutations and thus tested their ability to increase the yields of other synthetic pathways. When the n-butanol pathway is introduced into the double mutant, we observe a 3.2-fold increase in product titer from 2.3 ± 0.6 to 7.3 ± 1.1 g L−1 (Fig. 4C). Altogether, these data show mutations in only two genes, pcnB and rpoC, are capable of driving a large shift in central carbon metabolism that can be generalized to related pathways utilizing the acetyl-CoA building block.
In our system, we found that use of natural adaptive evolution allowed us to rapidly reach high production strains compared to the use of mutagens that increase the mutation rate but appeared to only find local minima in the evolutionary trajectory. Genome-level characterization of these strains revealed that mutations in two gene loci, pcnB and rpoBC, were sufficient to enable the needed shifts in carbon flow. Interestingly, these mutations have been previously observed in studies of E. coli evolution for both growth in M9 (rpoBC)41 and production of n-butanol (pcnB).13 Physiological studies suggest that this effect may rely on remodeling the transcriptome by influencing RNA metabolism along with rne. Interestingly, a wide range of mutations were identified within these three genes, some of which have been found to be important for activity in biochemical studies.40 This finding suggests that these four genes could be used for diversity generation at the phenotypic level by inducing pleiotropic changes in the transcriptional landscape. Furthermore, mutations found in the evolved BDO strain could be translated to significant increases in n-butanol yields, indicating that these strains could be relevant to the production of other acetyl-CoA-derived products such as fatty acids, polyketides, and isoprenoids.
In conclusion, living systems offer a unique advantage for chemical synthesis to increase product yields through evolution. In particular, central carbon metabolism plays an essential role in cell fitness and thus represents a key regulator and reporter of cellular state.42 These pathways are subject to tight homeostasis with multiple mechanisms to ensure robustness and reduce sensitivity to change.43 In this regard, engineered pathways provide an interesting platform where product titer can be treated as a synthetic phenotype or marker for quantitative assessment of genetic traits that lead to large shifts in the regulatory and metabolic network.44 By using evolution to solve difficult design challenges, we can take advantage of synthetic pathways to identify new strategies to alter behaviours that are hard-wired into the systems-level organization of the host.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc02773b |
‡ Matthew A. Davis and Vivian Yaci contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2023 |