Michael Muzika†
a,
Natali H. Muskat†a,
Shani Sarida,
Oshrit Ben-Davida,
Ryan A. Mehlb and
Eyal Arbely*ac
aDepartment of Chemistry and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, 8410501, Israel. E-mail: arbely@bgu.ac.il; Fax: +972-(0)8-6428449; Tel: +972-(0)8-6428739
bDepartment of Biochemistry and Biophysics, Oregon State University, Corvallis, 97331, Oregon, USA
cDepartment of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, 8410501, Israel
First published on 17th July 2018
Genetic code expansion technology enables the site-specific incorporation of dozens of non-canonical amino acids (NCAAs) into proteins expressed in live cells. The NCAAs can introduce various chemical functionalities into proteins, ranging from natural post-translational modifications, to spectroscopic probes and chemical handles for bioorthogonal reactions. These chemical groups provide powerful tools for structural, biochemical, and biophysical studies, which may require significant quantities of recombinantly expressed proteins. NCAAs are usually encoded by an in-frame stop codon, such as the TAG (amber) stop codon, which leads to the expression of C-terminally truncated proteins. In addition, the incubation medium should be supplemented with the NCAA at a final concentration of 1–10 mM, which may be challenging when the availability of the NCAA is limited. Hence, bacterial expression of proteins carrying NCAAs can benefit from improvement in protein yield per given amount of added NCAA. Here, we demonstrate the applicability of an optimized chemically-defined lactose-based autoinduction (AI) medium to the expression of proteins carrying a NCAA, using the archaeal pyrrolysyl-tRNA synthetase/tRNA pair from the Methanosarcina genus. Per given amount of added NCAA, the use of AI medium improved protein expression levels by up to 3-fold, compared to IPTG induction, without an increase in misincorporation of canonical amino acids in response to the in-frame stop codon. The suggested medium composition can be used with various Escherichia coli variants transformed with different expression vectors and incubated at different temperatures.
Expression of recombinant proteins in bacteria is fundamental to biochemical, biophysical and structural studies. Genetic encoding of NCAAs is of particular importance to such studies, as it enables the site-specific modification of proteins with ‘tailor-made’ functional groups. Unless the host organism was engineered to synthesize the NCAA,15 expression of modified proteins requires the addition of the NCAA to the growth medium at 1–10 mM concentration range. However, in many cases the availability of the NCAA is limited. In addition, encoding the NCAA with an in-frame stop codon leads to the expression of C-terminally truncated proteins, which can significantly reduce overall protein yield. Hence, expression of recombinant proteins carrying a NCAA can benefit from methodologies that improve protein expression levels per given amount of NCAA added to the medium. That said, over the years several advances have been made on that front. For example, in Escherichia coli (E. coli) the use of genome engineering to replace genomic TAG codons with other stop codons, along with knockout of bacterial release factor 1 (RF-1), have significantly improved amber suppression efficiency and protein expression levels.16–18
One way to improve recombinant protein expression yield in bacteria is to use media that supports the growth of high cell-density cultures, such as chemically-defined auto-induction (AI) media.19,20 Protein expression in E. coli cultured in AI medium is based on diauxic bacterial growth: during the first phase, culture growth is supported by utilization of preferred carbon substrates such as glucose; in the second phase, and at low glucose concentrations, other carbon sources such as glycerol and lactose (or arabinose) are used, while the latter also serves as the inducer for lac (or ara, respectively) operon-controlled protein expression. The presence of glucose in AI media also prevents the uptake of lactose and represses expression of proteins controlled by the lac operon. Following glucose depletion, glycerol can serve as an effective carbon and energy source. However, glycerol-based metabolism may reduce the pH of culture media to a level that can stop culture growth. In contrast, metabolism of amino acids and organic acids with relatively high pKa (such as succinate) can reduce medium acidification.20 Thus, bacteria cultured in such media may reach high cell density, and a fine balance between medium components and different carbon sources can support the growth of protein-expressing bacteria that undergo ‘auto-induction’ at a certain culture density, when glucose depletion allows lactose-induction of protein expression.19,20 It is important to note that this leads to an added advantage of AI media; protein expression is made easier and more reproducible, as there is no need to monitor the culture OD600.
AI media can be divided into two classes: chemically-defined and non-defined AI media. The former enables fine-tuning of amino acid composition and growth conditions for high-density cultures, as well as expression of proteins labeled with selenomethionine.20,21 Complete control over amino acid composition can be important for expression of proteins with NCAAs, as it eliminates potential misincorporation of canonical amino acids by promiscuous synthetases.22–26 That said, current evolved synthetases display high fidelity (ability to discriminate against canonical amino acids) in the presence of the NCAA. Low fidelity is often observed when proteins are expressed in the absence of the NCAA, particularly when permissive synthetases (capable of recognising more than one NCAA) are used. As the evolution of an aaRS is dependent on selection conditions,25 the ability to eliminate specific canonical amino acids from the selection medium (as in chemically-defined AI media) may enable the isolation of efficient aaRS with high fidelity, as long as the same amino acids are eliminated from the expression media. Importantly, protein expression in chemically-defined AI media usually provides superior yields. Therefore, it may improve protein yield per given amount of NCAA in particular, and medium volume in general, when compared to regular media. Indeed, proteins carrying NCAAs have been expressed in E. coli incubated in AI media,17,27,28 and arabinose-based chemically-defined AI media was optimized for NCAA-incorporation by the Methanocaldococcus jannaschii tyrosyl-tRNA synthetase/tRNATyr pair.29 However, the applicability of chemically-defined lactose-based AI media for protein expression using the PylRS/PylT pair has never been demonstrated.
One of the most frequently used bacterial expression systems is based on T7 RNA polymerase expression from an inducible promoter, such as the lacUV5.30 Induction is usually realized by the addition of isopropyl β-D-1-thiogalactopyranoside (IPTG), although it has been shown that lactose can also be used as an inducer.31 Many commercially available E. coli strains support the expression of proteins using lactose-inducible promoter based systems; e.g., BL21(DE3), B834(DE3), Origami™, Lemo21™, and Rosetta™. Moreover, two RF-1 knockout BL21(DE3) strains, B-95.ΔA and B-95.ΔAΔfabR, have been created to allow superior expression levels of proteins with NCAAs at 37 °C and low temperatures, respectively.17 Hence, the use of lactose-based chemically-defined AI media for NCAA incorporation using the PylRS/tRNAPylCUA pair can improve protein expression levels in this array of bacterial strains. Here we describe an optimized chemically-defined AI medium composition for high protein expression levels per given amount of supplemented NCAA, with no negative effect on the fidelity of the aaRS/tRNAPylCUA pair. We also demonstrate the applicability of the suggested AI medium to different NCAAs, expressed proteins, expression plasmids, incubation temperatures, and E. coli strains (including an RF-1 knockout strain). As an example for AI medium lacking specific amino acids, we eliminated lysine and glutamine, without negatively affecting protein expression levels. Overall, the suggested chemically-defined lactose-based AI medium improved protein yield per given amount of NCAA by up to 3-fold, when cultures were incubated for 24 h at 37 °C.
Component | Stock concentration | Dilution | Final concentration |
---|---|---|---|
a Concentration of carbon sources was adjusted as described in the Results section.b Hereafter simply referred to as lactose.c See Table 2 for list of trace metals.d Stock solution of amino acids was prepared as described in Methods section. | |||
Glycerola | 10% (w/v) | 1:20 | 0.5% (w/v) |
Glucosea | 37.5% (w/v) | 1:500 | 0.075% (w/v) |
α-Lactose monohydratea,b | 20% (w/v) | 1:400 | 0.05% (w/v) |
MgSO4 | 1 M | 1:500 | 2 mM |
Monosodium succinate (pH = 6.8) | 17.5% (w/v) | 1:40 | 0.438% (w/v) |
Na2HPO4 | 0.5 M | 1:20 | 25 mM |
KH2PO4 | 0.5 M | 1:20 | 25 mM |
NH4Cl | 1 M | 1:20 | 50 mM |
Na2SO4 | 0.1 M | 1:20 | 5 mM |
Trace metalsc | Variable | 1:5000 | Variable |
Amino acidsd | 5 mg mL−1 (each) | 1:25 | 0.2 mg mL−1 (each) |
Salt | Stock concentration | Final concentration |
---|---|---|
a Trace metal stock solution was filter-sterilized and used at 1:5000 dilution. | ||
FeCl3 | 50 mM | 10 μM |
CaCl2 | 20 mM | 4 μM |
MnCl2 | 10 mM | 2 μM |
ZnSO4 | 10 mM | 2 μM |
CoCl2 | 2 mM | 0.4 μM |
CuCl2 | 2 mM | 0.4 μM |
NiCl2 | 2 mM | 0.4 μM |
Na2MoO4 | 2 mM | 0.4 μM |
Na2SeO3 | 2 mM | 0.4 μM |
H3BO3 | 2 mM | 0.4 μM |
Fig. 1 Expression vectors and NCAA structure. (A) Two plasmid systems were used for bacterial expression of proteins with a site-specifically incorporated NCAA. In plasmid system A, the aaRS is cloned on a pBK vector (plasmid a), while the protein of interest with an in-frame stop codon is cloned on a specialized pCDF vector along with the pylT gene for the transcription of tRNAPylCUA (plasmid b). In plasmid system B, the NCAA-specific pylRS variant and pylT are cloned on the same plasmid (pDule vector,15 plasmid c) and the protein of interest is expressed using a pCDF vector (plasmid d). (B) Chemical structures of Nε-[(tert-butoxy)carbonyl]-L-lysine (1) and Nε-acetyl-L-lysine (2). |
(1) |
Fig. 2 Effect of carbon source composition on protein expression in AI medium. Expression levels of sfGFP150BocLys in BL21(DE3) cells transformed with plasmid system A (Fig. 1A) and incubated in chemically-defined AI media supplemented with 1 (1 mM) and indicated concentrations of carbon sources. Average values are presented ± SD, n = 3. (A) Fluorescence intensity as a function of glycerol concentration. [glucose] = 0.05% (w/v), [lactose] = 0.05% (w/v). (B) Fluorescence intensity as a function of glucose concentration. [glycerol] = 0.5% (w/v), [lactose] = 0.05% (w/v). (C) Fluorescence intensity as a function of lactose concentration. [glycerol] = 0.5% (w/v), [glucose] = 0.05% (w/v). |
We first tested the effect of glycerol concentration by keeping glucose and lactose concentrations at 0.05% (w/v) (Fig. 2A). Glycerol concentration within the range of 0.3–1.2% (w/v) had no statistically significant effect on culture density and expression levels of full-length sfGFP150BocLys. We therefore decided to follow the original protocol suggested by Studier F. W. and kept glycerol concentration at 0.5% (w/v).20 Next, we examined the effect of glucose concentration within the range of 0.025–0.5% (w/v), while keeping glycerol and lactose concentrations at 0.5% (w/v) and 0.05% (w/v), respectively (Fig. 2B). High glucose concentration of 0.5% (w/v) inhibited culture growth and protein expression. Glucose concentration of 0.3% (w/v) enabled culture growth to higher density, but overall protein yield was lower, compared to 0.05% glucose. Relatively high protein yield was obtained between 0.05% and 0.15% (w/v) glucose concentration. Finally, we measured the effect of lactose concentration on culture density and expression levels of sfGFP150BocLys (Fig. 2C). Within the range of 0.05–1.00% (w/v) lactose, lowest culture density and sfGFP150BocLys expression were measured at 0.70% and 1.00% (w/v) lactose. Culture density was similar within the range of 0.05% and 0.45% (w/v) lactose, while protein expression was slightly higher within the lower range of lactose concentrations.
Fig. 3 Fine-tuning of glucose and lactose concentrations. (A) E. coli BL21(DE3) cells transformed with plasmid system A (Fig. 1A) were incubated in chemically-defined AI media supplemented with 1 (1 mM), 0.5% (w/v) glycerol, and indicated concentrations of glucose and lactose. The highest expression level of sfGFP150BocLys was obtained when transformed bacteria were incubated in chemically-defined AI medium supplemented with 0.075% (w/v) glucose and 0.05% (w/v) lactose. Average values are presented ± SD, n = 3. (B) The final concentration as well as the ratio between glucose and lactose were verified using a small-scale expression media array. Chemically-defined AI media were supplemented with 1 (1 mM), 0.5% (w/v) glycerol and 0.01–0.09% (w/v) of glucose and lactose. The chosen optimal condition for expression of proteins with genetically encoded NCAA [0.5% (w/v) glycerol, 0.075% (w/v) glucose, 0.05% (w/v) lactose] is marked with a red circle. |
Fig. 4 Effect of amino acid composition and expression time on protein expression in chemically-defined lactose-based AI medium. (A) Transformed E. coli BL21(DE3) cells expressing sfGFP-150TAG and wild-type pyrrolysyl-tRNA synthetase/tRNAPylCUA (using plasmid system A, Fig. 1A) were incubated in the fine-tuned chemically-defined lactose-based AI medium, or medium without lysine, without glutamine, or without lysine and glutamine. The effect of lysine and/or glutamine exclusion on protein expression levels was monitored by measuring sfGFP150BocLys expression in the presence (+) or absence (−) of 1 (1 mM). Average values for biological replicates are presented ± SD, n = 3. (B) Total mass of sfGFP150BocLys expressed in E. coli BL21(DE3) cells incubated in 2×TY medium (IPTG induction), or fine-tuned chemically-defined lactose-based AI medium without lysine and glutamine; both media were supplemented with 1 mM of 1. Expected mass: 27941.5 Da. (C) sfGFP150BocLys fluorescence and OD600 measured as a function of time for E. coli BL21(DE3) cells transformed as described in A and incubated in fine-tuned chemically-defined lactose-based AI medium. Average values for biological replicates are presented ± SD, n = 3. |
To further ensure the fidelity of PylRS expressed in E. coli cultured in chemically-defined lactose-based AI medium, we verified the incorporation of 1 into expressed sfGFP by ESI-MS. As seen in Fig. 4B, the total mass of sfGFP150BocLys expressed in 2×TY (top, 27943.2 Da) or chemically-defined lactose-based AI medium (bottom, 27943.4 Da) was within error range from the expected mass of 27941.5 Da. Therefore, the fidelity of wild-type pyrrolysine tRNA synthetase was similar when sfGFP150BocLys was expressed in 2×TY or chemically-defined lactose-based AI medium. Interestingly, we noticed that expression of sfGFP in chemically-defined lactose-based AI medium increased the extent of hydrolytic cleavage of the N-terminal methionine (27811.8 Da, expected mass: 27808.4 Da). It should be noted that divalent cations such as Fe(2+), Mn(2+), and Co(2+) are cofactors of methionyl aminopeptidase,41,42 and that the AI medium is supplemented with several divalent cations. Finally, we followed protein expression levels and culture density as a function of time using BL21(DE3) incubated in the fine-tuned lysine- and glutamine-free chemically-defined lactose-based AI medium. As depicted in Fig. 4C, expression levels of sfGFP reached a plateau after approximately 24 h of incubation at 37 °C. We therefore conclude that lysine and glutamine can be omitted from the chemically-defined medium without negative effects on protein expression levels.
Fig. 5 aaRS fidelity and protein expression levels using evolved aaRSs and different E. coli strains or AI medium compositions. (A) To monitor the level of possible amino acid misincorporation by the amber suppression machinery, BL21(DE3) cells were transformed with plasmid system A and incubated in chemically-defined lactose-based AI media in the absence of 1 (−). Protein expression was quantified by measuring sfGFP150BocLys fluorescence in live bacteria. For comparison, sfGFP150BocLys fluorescence was measured in bacteria incubated in the presence of 1 (+). The aaRSs used in this study (names and mutations relative to wild-type PylRS): 1 – M. barkeri wild-type synthetase; 2 – M. mazei wild-type synthetase; 3 – AcKRS3 (M. barkeri L266M, L270I, Y271F, L274A, C313F);34 4 – AcKRS1 (M. barkeri L266V, L270I, Y271F, L274A, C313F);35 5 – BCNRS (M. barkeri Y271M, L274G, C313A);36 6 – PCKRS (M. barkeri M241F, A267S, Y271C, L274M);37 7 – ThzKRS (M. barkeri A267S, C313V, M315F, D344G);38 8 – δSHKRS (M. barkeri Y349W);39 9 – ONBYRS (M. barkeri L270F, L274M, N311G, C313G).40 (B) Expression of sfGFP150BocLys in BL21(DE3), Rosetta(DE3), or Lemo21(DE3) E. coli strains transformed as described in A and incubated for 24 h in chemically-defined lactose-based AI media with (+) or without (−) 1. Protein expression levels were quantified by GFP fluorescence measurement. (C) Fluorescence of sfGFP150AcLys expressed in BL21(DE3) incubated for 24 h in AI media suggested in the current study, Studier F. W.,20 or Fox B. G. and Blommel P. G.,19 and supplemented with 2. Average values for biological replicates are presented ± SD, n = 3. |
The E. coli BL21(DE3) strain is commonly used for the expression of recombinant proteins. That said, other commercially available E. coli BL21-based strains are often used for the expression of various proteins. For example, the Rosetta™(DE3) strain (chloramphenicol resistant, Novagen) that supplies additional tRNAs for rare codons, or the tunable T7 expression strain Lemo21™(DE3) (chloramphenicol resistant, NEB). As seen in Fig. 5B, NCAA-dependent expression of sfGFP150BocLys was observed in these bacterial strains. However, protein expression levels in Lemo21(DE3) and especially Rosetta(DE3) were lower than expression levels in BL21(DE3). While expression levels in Lemo21(DE3) and Rosetta(DE3) may improve by optimizing medium composition, we noted that expression levels of proteins carrying a NCAA are usually lower in these strains, compared to BL21(DE3), even when bacteria are incubated in 2×TY medium.
We also compared the expression levels of sfGFP150AcLys in E. coli BL21(DE3) incubated in different lactose-based AI media supplemented with Nε-acetyl lysine (2, Fig. 5C). Protein expression levels in bacteria incubated in the AI medium suggested in the current study were higher than those measured in bacteria incubate in AI media suggested by Studier F. W.,20 or Fox B. G. and Blommel P. G.19 (all media were not supplemented with vitamins). Hence, the suggested AI medium offers improved protein expression levels relative to lactose-based AI media compositions that were not optimized for NCAA incorporation.
Fig. 6 Improved protein expression in chemically-defined lactose-based AI medium. To compare between protein expression levels, total protein extracts normalized by culture volume were analysed by Western blot and proteins were visualized using an antibody against the C-terminal 6×His-tag. Representative membranes are shown for each set of experiments. Average values for biological replicates are presented ± SD, n ≥ 3. Statistical analysis was performed using Student's t-test (two-tailed, unpaired). *P < 0.05, **P < 0.01, ***P < 0.001. (A) sfGFP-150TAG expressed in E. coli BL21(DE3) cells transformed with plasmid system A (Fig. 1A) and incubated with NCAA 1 (left) or 2 (right). Expression was induced in chemically-defined lactose-based AI medium or by 1 mM IPTG in 2×TY medium at 37 °C. (B) sfGFP-150TAG expressed in E. coli B-95.ΔAΔfabR cells incubated with NCAA 1. Expression was induced in lactose-based AI medium or by 1 mM IPTG in 2×TY medium at 22 or 37 °C. (C) Expression of Lys685-acetylated STAT3 and Lys120-acetylated p53 variants utilizing plasmid system B (Fig. 1A, plasmids c and d), in the E. coli BL21(DE3) strain incubated in AI medium supplemented with NCAA 2. (D) Western blot analysis of Lys685-acetylated STAT3 expressed using plasmid system A, or plasmid system B. |
We have also compared between protein expression levels in bacteria incubated in AI media and IPTG-induced bacteria incubated in 2×TY, using the RF-1 knockout strain B-95.ΔAΔfabR.17 Although protein expression levels in this strain are usually higher compared to BL21(DE3), higher density cultures can further improve protein yield per given amount of added NCAA. Indeed, when B-95.ΔAΔfabR were incubated at 37 °C, protein expression in chemically-defined AI medium was significantly higher (Fig. 6B). However, when cells were incubated at 22 °C for 48 h, no improvement in protein expression levels was observed. That said, at 22 °C, expression level in the absence of the NCAA was lower in AI medium compared to 2×TY. Therefore, chemically-defined lactose-based AI medium without lysine and glutamine can improve protein yield in the B-95.ΔAΔfabR RF-1 knockout strain and reduce possible amino acid misincorporation.
The expression tests presented above, were performed using plasmid system A and sfGFP-150TAG as a model protein. To demonstrate the broad applicability of the chemically-defined AI medium, we used plasmid system B (Fig. 1A) in order to express two proteins, site-specifically acetylated at biologically relevant positions: the signal transducer and activator of transcription 3 (STAT3) site-specifically acetylated at position Lys685, and the DNA binding domain of the tumour suppressor protein p53, site-specifically acetylated at position 120.43–47 Acetylation of these lysine residues was shown to affect the transcriptional activity of STAT3 and p53, and as such, recombinant expression of the site-specifically acetylated proteins is important for in vitro acetylation-dependent functional and structural studies. In plasmid system B the genes required for amber suppression (the aaRS and pylT) are encoded on one plasmid (based on the pDule backbone), compared to plasmid system A that was used so far, where the gene of interest was encoded on a specialized plasmid carrying the pylT gene. Using plasmid system B we were able to express Lys685-acetylated STAT3 and Lys120-acetylated p53 in BL21(DE3) incubated in chemically-defined lactose-based AI medium (Fig. 6C). While protein expression may be improved by optimizing the composition of the chemically-defined AI medium, data show that our suggested medium supports the expression of different acetylated proteins using the pDule expression vector. That said, expression levels of acetylated proteins using plasmid system A were approximately 5-fold higher, compared to plasmid system B (Fig. 6D). The difference in expression levels between these two plasmid systems is expected because the ratio of pyrrolysyl-tRNA synthetase/tRNA is higher with system A resulting in higher suppression efficiency.48
Taken together, we demonstrated improved expression level of proteins with genetically encoded NCAAs in modified chemically-defined lactose-based AI medium, using the pyrrolysyl-tRNA synthetase/tRNAPylCUA pair. The suggested medium improved protein expression yield by up to 3-fold without measurable effects on the fidelity of wild type PylRS and evolved aaRSs. Our data show that the medium can be used for the expression of different proteins using different E. coli strains, following an ‘inoculate-and-forget’ protocol. The composition of the suggested medium supports the expression of proteins using plasmid system A (pBK vector) or B (pDule vector). The advantage of plasmid system B is that it allows convenient use of existing standard expression vectors (e.g., pET vectors) bearing only the target protein, and therefore saves additional cloning steps. However, in our hands, protein expression levels using plasmid system A were higher, compared to plasmid system B.
Protein expression levels using medium without lysine and glutamine were similar to those measured in media supplemented with these two amino acids. Lysine and glutamine were chosen as an example, based on their structural similarity to pyrrolysine and its non-canonical derivatives. It may be interesting to check if aaRS evolution performed in chemically defined media lacking specific canonical amino acids can provide efficient and permissive aaRS without compromised fidelity.
Due to the ability of the chemically-defined medium to support higher culture density, using this medium significantly improved protein yield per given amount of NCAA, even when an RF-1 knockout strain was used. Considering the scarce availability of many NCAAs, protein expression using AI medium offers an attractive alternative to ‘standard’ growth media and IPTG induction protocols.
Footnote |
† These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2018 |