Open Access Article
This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

A GFP-strategy for efficient recombinant protein overexpression and purification in Mycobacterium smegmatis

Anjana Radhakrishnan, Christopher M. Furze, Mohd Syed Ahangar and Elizabeth Fullam*
School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK. E-mail: e.fullam@warwick.ac.uk; Tel: +44 (0)2476 574239

Received 23rd July 2018 , Accepted 14th September 2018

First published on 25th September 2018


Abstract

One of the major obstacles to obtaining a complete structural and functional understanding of proteins encoded by the Mycobacterium tuberculosis (Mtb) pathogen is due to significant difficulties in producing recombinant mycobacterial proteins. Recent advances that have utilised the closely related Mycobacterium smegmatis species as a native host have been effective. Here we have developed a method for the rapid screening of both protein production and purification strategies of mycobacterial proteins in whole M. smegmatis cells following green fluorescent protein (GFP) fluorescence as an indicator. We have adapted the inducible T7-promoter based pYUB1062 shuttle vector by the addition of a tobacco etch virus (TEV) cleavable C-terminal GFP enabling the target protein to be produced as a GFP-fusion with a poly-histidine tag for affinity purification. We illustrate the advantages of a fluorescent monitoring approach with the production and purification of the mycobacterial N-acetylglucosamine-6-phosphate deacetylase (NagA)-GFP fusion protein. The GFP system described here will accelerate the production of mycobacterial proteins that can be used to understand the molecular mechanisms of Mtb proteins and facilitate drug discovery efforts.


Introduction

Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb), is now the leading cause of human mortality from an infectious agent. In 2016, 1.7 million deaths and 10.4 million new cases of TB were reported by the WHO.1 Whilst TB can be treated, the current drug-regimen is complicated and of long-duration requiring a combination of the four first-line drugs over a period of 6–9 months.2 Recently, there has been an increase in the emergence of numerous drug-resistant strains which further complicates the regimen, prolongs treatment and is more expensive to administer. Extensively-drug resistant (XDR-TB) strains of TB have now been reported in 123 countries and there are often no therapeutic agents to successfully treat these TB cases.1 Therefore, an increased knowledge of biochemical pathways employed by the Mtb pathogen is urgently required in order to develop new anti-tubercular agents with novel modes of action to reduce the global health threat from TB.

An important prerequisite towards obtaining a complete biochemical understanding of the Mtb pathogen for improved diagnostics and therapeutics is the ability to produce recombinant proteins for structural and functional studies. However, the production of high quality recombinant Mtb proteins in sufficient quantity has proved to be particularly challenging and has hampered progress in this area.3 Escherichia coli is one of the most commonly used bacterial host expression systems for the overexpression of recombinant proteins but often results in the production of insoluble inclusion bodies which is a major bottleneck for structural and functional studies.4,5 A number of studies have shown that the ability to produce soluble proteins from Mtb in E. coli that are correctly folded and active is particularly challenging3,6 and this has been partially attributed to the different G + C content between E. coli and Mtb which have G + C contents of 51%7 and of 66%8 respectively, although other factors may also contribute.

Given the limitations of E. coli as a host expression system for mycobacterial proteins a number of different systems have been explored for Mtb protein production. These include the use of Gram-negative Pseudomonas putida,9 Gram-positive Streptomyces lividans10 and Rhodococcus jostii,11 the yeast Saccharomyces cerevisiae12 as well as the baculovirus expression system in insect cells,13 although these alternative host-systems are not routinely used. It is generally considered that intrinsic difficulties in producing soluble protein can be overcome through the use of a host expression system that is more closely related to the target protein and therefore Mycobacterium smegmatis, which is a faster-growing mycobacterium and often used as a non-pathogenic model of Mtb, has been successfully utilised as a host system for the expression of a number of recombinant Mtb and mycobacterial proteins.3,6 To date, 55 structures of Mtb proteins expressed in M. smegmatis are now deposited in the protein data bank indicating the importance of this host-expression system for recombinant mycobacterial protein production.14 The advantages of using M. smegmatis as an expression system are numerous. The constitutive production of mycobacterial chaperones is likely to assist in the correct folding of the Mtb recombinant protein and is an approach that has been used successfully as a strategy in E. coli whereby mycobacterial chaperones are co-expressed with the target Mtb protein to obtain soluble proteins.15,16 Furthermore, the availability of mycobacterial specific metabolites, ligands and/or binding protein/s may lead to the production of correctly folded active protein.17 An example is the successful production of the Mtb F420-binding protein that could only be produced in an M. smegmatis host expression system and is believed to be due to the absence of the required F420 cofactor in E. coli.18 Furthermore, we have recently shown that the expression system can play a key role in the selection of the incorporated cofactor. Production of the mycobacterial N-acetylglucosamine-6-phosphate deacetylase (NagA) enzyme results in the incorporation of different metal ions depending on the host expression system with more stable, active protein produced in the M. smegmatis host.19

The vectors and strains that are currently available for the overexpression of proteins in M. smegmatis are not as varied or developed as the tools that are well established for E. coli. Recent progress in this area, however, has led to the adaption of the T7 promoter-based vector system, a commonly used system for overexpression of proteins in E. coli,20 for use in M. smegmatis by the introduction of the RNA polymerase from the T7 bacteriophage and the generation of the M. smegmatis mc24517 host strain.21 An alternative host is the M. smegmatis groEL1ΔC strain that has a mutated C-terminal GroEL1 chaperone.22 This strain has also been developed to enable efficient purification of poly-histidine-tagged mycobacterial proteins by mutating the histidine rich region of the GroEL1 chaperone to reduce its co-purification along with the target-protein during immobilised metal affinity chromatography (IMAC) and has been used for the successful expression of mycobacterial proteins.22,23 The two main vector systems for recombinant protein expression in M. smegmatis are an acetamidase promoter based system and include the pSD24 and pMyNT25 series of expression vectors and the T7 promoter-based vectors that include the shuttle vectors pYUB1049 and pYUB1062,21 a Gateway cloning system pDESTsmg6 and the pYUBDUET26 vector for the co-expression of proteins. Although the M. smegmatis expression host has improved the production of recombinant mycobacterial proteins it does not always result in correctly folded proteins. Therefore, we wanted to develop a mycobacterial expression vector using GFP as an indicator to rapidly monitor protein production in M. smegmatis. This is an approach that has been used successfully in E. coli where a C-terminal GFP tag has been used as a reporter for the production of correctly folded globular and membrane proteins.27–29 A direct correlation is observed between the GFP fluorescence and the production of correctly folded protein that is fused upstream and the reported output sensitive to protein misfolding and aggregation, with no detectable fluorescence observed when the protein is expressed in inclusion bodies.

In this study we demonstrate a widely applicable, efficient protocol using GFP as a tool for monitoring the overexpression of mycobacterial proteins in M. smegmatis. We have adapted the inducible T7-promoter based pYUB1062 vector and incorporated a cleavable C-terminal GFP-His6 reporter tag to produce the target protein as a GFP-fusion. We show that protein production can be efficiently monitored by GFP fluorescence in situ in whole M. smegmatis cells. The GFP-fusion-tag does not hinder protein production and can be used to monitor purification strategies by simple procedures that include in-gel fluorescence. Taken together, our findings demonstrate that GFP is suitable for widespread use as an excellent tool for the rapid production of mycobacterial proteins in an M. smegmatis host.

Materials and methods

Materials and reagents

All chemicals and reagents were purchased from Sigma-Aldrich, unless specified. PCR and restriction enzymes were obtained from New England Biolabs.

Bacterial strains and media

E. coli strain Top10 (Invitrogen) was used for the cloning of expression constructs. Transformations were selected using Luria Bertani (LB) medium containing the appropriate antibiotics. M. smegmatis mc24517 strain (a gift from Professor W. R. Jacobs) was used for protein expression and was maintained in LB medium supplemented with 0.2% glycerol and 0.05% Tween-80 with the addition of the appropriate antibiotics.

Construction of pYUB1062-GFP

The DNA fragment encoding the tobaccoetch virus (TEV) protease cleavage site and GFP fragment from the plasmid pWaldoD was amplified by PCR using the primers listed in Table 1. The PCR product was digested with the HindIII restriction enzyme and ligated into the HindIII site of the digested pYUB1062 vector (a gift from Professor W. R. Jacobs). The correct orientation of the TEV-GFP insert was confirmed by DNA sequencing. Subsequently, targeted site-directed mutagenesis was performed to maintain a single HindIII site within the multiple-cloning site using the primers listed in Table 1 and remove the second HindIII, incorporated during the first cloning step, with Phusion Polymerase and the PCR cycle (98 °C, 30 s; 25 cycles of 98 °C, 10 s; 60 °C, 30 s; 72 °C, 4 min; followed by 5 min at 72 °C), followed by digestion with DpnI. Plasmid sequences were verified by DNA sequencing (GATC) and removal of the additional HindIII site confirmed. This resulted in the formation of the resulting expression vector designated pYUB1062-GFP.
Table 1 Oligonucleotides used in this study. Restriction recognition sites are in italics. Codon encoding the amino acid mutation is indicated in bold type
Name Use Sequence (5′–3′)
GFP_pYUB1062_F Clone GFP into pYUB1062 AAAAAAAAGCTTCCGAAAACCTGTACTTCCAGGGTC
GFP_pYUB1062_R Clone GFP into pYUB1062 AAAAAAAAGCTTTTTGTAGAGCTCATCCATGC
pYUB_GFP_SDM_F Mutate HindIII site TGAGCTCTACAAAAAGGGTGCGGCCGCACTCGAGC
pYUB_GFP_SDM_R Mutate HindIII site GCTCGAGTGCGGCCGCACCCTTTTTGTAGAGCTCA
NagA_pYUBGFP_F Clone nagA into pYUB1062-GFP CACCAACATATGCTGCTGACCGCCGACACCGTG
NagA_pYUBGFP_R Clone nagA into pYUB1062-GFP TATAAAAAGCTTCCACCGTGTGCGCCGCGCCG


Cloning of nagA

The full-length N-acetylglucosamine-6-phosphate deacetylase (nagA) gene from Mycobacterium smegmatis was amplified by polymerase chain reaction from the MSnagA_pYUB1062 vector.19 The primer sequences used are listed in Table 1. The PCR products were digested with NdeI and HindIII and ligated into the pYUB1062-GFP vector digested with the same restriction enzymes, resulting in the construct nagA_pYUB1062-GFP. Plasmid sequences were verified by DNA sequencing (GATC) and used for protein expression.

Growth curves

Growth of M. smegmatis in the presence/absence of pYUB1062, pYUB1062-GFP and nagA_pYUB1062-GFP were carried out in a 96-well microtiter plate (Falcon: black with clear bottom). The optical density at 600 nm (OD600) and the GFP fluorescence at λex 485 nm λem 535 nm were monitored simultaneously every 20 min for 100 hours at 37 °C (orbital shaking 430 rpm, Tecan Infinite F200, gain 35). Cultures were grown in LB medium supplemented with 0.05% Tween-80, 0.2% glycerol with addition of the appropriate antibiotics. Induction of protein was at OD600 of 0.6 by the addition of acetamide (0–0.4%). All experiments were carried out in triplicate. The curves were fitted to the data points using the Lowess fit in GraphPad Prism V7.

Protein expression of GFP and NagA-GFP in Mycobacterium smegmatis

M. smegmatis mc24517 electrocompetent cells were transformed with either the pYUB1062-GFP or the nagA_pYUB1062-GFP construct and grown at 37 °C to an OD600 of 0.6 in LB medium supplemented with 0.05% Tween-80, 0.2% glycerol, 25 μg mL−1 kanamycin and 100 μg mL−1 hygromycin. Protein production was induced with 0.2% acetamide and the cultures were grown at 37 °C for an additional 20 hours with shaking (180 rpm). Prior to harvesting the cells, 1 mL of the culture was removed, centrifuged (10 min, 4 °C), the pellet resuspended in 100 μL PBS and the fluorescence monitored at λex 485 nm λem 535 nm (Tecan Infinite F200, gain 35). The cells were then centrifuged (5000g, 30 min 4 °C) and the pellets frozen at −80 °C until further use.

Protein purification

The cells were resuspended in lysis buffer (20 mM Tris, 300 mM NaCl, 10% glycerol pH 8.0 (buffer A)) supplemented with 0.1% Triton-X 100, Complete Protease Inhibitor Cocktail (Pierce), 5 mM MgCl2, 2 mg DNase and 20 mg lysozyme. The resuspended pellet was incubated at 37 °C for 10 min and the pellet passed through a Cell Disruptor at 25 kpsi at 4 °C (Constant Systems). Following centrifugation (27[thin space (1/6-em)]000g, 40 min, 4 °C) the supernatant was filtered (0.45 μm pore size) before loading onto a pre-equilibrated immobilised metal affinity chromatography (IMAC) resin.

GFP protein purification

The GFP protein was purified by loading onto a pre-equilibrated HisPur Ni2+-affinity resin (Thermo Scientific). The column was washed with buffer A (5 column volumes) and the recombinant GFP protein was eluted from the Ni2+-resin with increasing concentrations of imidazole. Fractions containing the purified GFP protein as determined by SDS-PAGE were dialysed at 4 °C for 12 hours against buffer A and the concentration of the protein determined.

NagA protein purification

The lysate containing the NagA-GFP-His6 protein (∼69 kDa) was loaded onto a pre-equilibrated HisPur Co2+-affinity resin (Thermo Scientific). The column was washed with buffer A (5 column volumes) and the recombinant NagA-GFP-His6 protein was eluted from the Co2+-resin with increasing concentrations of imidazole. Fractions containing the NagA-GFP-His6 protein were dialysed at 4 °C for 12 hours against 20 mM Tris–HCl, 100 mM NaCl, 10% glycerol pH 8.0 (buffer B) at 4 °C for 12 hours and applied to a HiTrap Q-column (1 mL, GE Healthcare Life Sciences) pre-equilibrated with buffer B and eluted with NaCl (0.1–1 M). Fractions containing NagA-GFP-His6 were pooled and purified further using size exclusion chromatography. Gel filtration experiments were carried out on a Superdex 200 16/60 column (GE Healthcare) using 20 mM Bis-Tris, 300 mM NaCl, 10% glycerol pH 8.0 (buffer C). To remove the C-terminal GFP-His6-tag, fractions containing purified NagA-GFP-His6, as determined by SDS-PAGE, were pooled and incubated for 16 hours at 4 °C in the presence of the histidine-tagged super TEV protease (1[thin space (1/6-em)]:[thin space (1/6-em)]30 ratio) and loaded onto a pre-equilibrated Ni2+-affinity resin. The column was washed with buffer C (5 column volumes) and the cleaved NagA protein (∼40 kDa) was eluted from the Ni2+ resin. The NagA protein was pooled, 0.03% DDM and 1 mM DTT were added and the protein concentrated to 5–10 mg mL−1 (Vivaspin 2, GE Healthcare) and stored at −80 °C.

In-gel fluorescence readings

Samples for SDS-PAGE were mixed in a 1[thin space (1/6-em)]:[thin space (1/6-em)]1 ratio with loading buffer (100 mM Tris, pH 6.8, 4% w/v SDS, 0.2% w/v bromophenol blue, 10% v/v β-mercaptoethanol and 20% v/v glycerol) and samples (20 μL) loaded directly onto the gel (Any kD™ Mini-PROTEAN TGX Precast protein gels, Bio-Rad) without heating. Following SDS-PAGE the in-gel fluorescence was imaged immediately using λex 460 nm and the detection filter 510DF10 GFP, with an exposure time of 0.25 s (ImageQuant LAS 4000). The exposure time was selected to ensure that the brightest bands were not saturated. The gels were then stained with InstantBlue™ Coomassie Protein stain.

Circular dichroism (CD) analysis

Purified proteins (0.3 mg mL−1) were dialysed in 20 mM Tris, 100 mM NaCl, 10% glycerol pH 8.0 buffer, transferred into a 1 mm path length quartz cuvette and analysed on Jasco J-810 DC spectrometer from 198–260 nm. Spectra were acquired in triplicate and averaged after subtraction of the buffer background.

NagA assay

The activity of both NagA and NagA-GFP-His6 enzymes was measured at 37 °C in an end point assay by following the production of the fluorescent product formed with fluorescamine and primary amines at λex 340 nm λem 460 nm. The reaction was carried out in a 96-well microtiter plate in 20 mM Bis–Tris, 300 mM NaCl, 10% glycerol pH 7.0 in a total reaction volume of 50 μL. The reaction was initiated by the addition of N-acetylglucosamine-6-phosphate (GlcNAc6P) (Carbosynth) and terminated by the addition of 50 μL 0.4 M borate buffer pH 10, 40 μL 5 mM fluorescamine (Carbosynth) and 50 μL DMF and the production of fluorescence was monitored (Tecan Infinite M200, constant gain). The production of free amine was quantified with a glucosamine standard. The kinetic parameters Km and Vmax were determined by fitting the velocity versus substrate concentration to the Michaelis-Menten equation using non-linear regression analysis (GraphPad Prism, V7). All measurements were performed in triplicate.

Results and discussion

Construction of the M. smegmatis pYUB1062-GFP expression vector

In order to construct a fluorescent indicator system to monitor recombinant protein expression in M. smegmatis mc24517 we selected the pYUB1062 shuttle vector,21 which can replicate in both E. coli and M. smegmatis, for incorporation of a C-terminal fused GFP-His6-tag to the target protein. The pYUB1062 vector contains a T7 promoter for inducible protein expression with either acetamide or IPTG, and a multiple-cloning site introduced from the pET30a plasmid and allows for either an N- and/or C-terminal His6-tag/s for purification depending on the selected cloning strategy (Fig. 1). The pWaldo vector, which contains a C-terminal GFP-His8 tag that can be cleaved by the tobacco etch virus (TEV) protease (Fig. 1(a)), has been used extensively to monitor the production of both cytosolic and membrane proteins in E. coli.27–29 In order to convert the pYUB1062 vector (Fig. 1(b)) we amplified the TEV-GFP DNA fragment from the pWaldo vector and inserted this fragment into the HindIII site of the pYUB1062 vector that following translation, is in-frame with the translational start-site (Fig. 1(c)) and encodes for a C-terminal His6 affinity-tag. The introduction of this fragment resulted in the generation of an additional HindIII site into the backbone of the vector (Fig. 1(c)), which was subsequently removed by using targeted site-directed mutagenesis. The resultant plasmid is shown in Fig. 1(d) and is designated pYUB1062-GFP.
image file: c8ra06237d-f1.tif
Fig. 1 Construction of pYUB1062-GFP vector. (a) Schematic of the pWaldoD vector. (b) Schematic of the pYUB1062 vector. (c) Insertion of the TEV-DNA fragment into the pYUB1062 vector. The second HindIII site that is introduced is highlighted in red dashed lines. (d) The final arrangement of the pYUB1062-GFP vector.

Production and purification of GFP from M. smegmatis mc24517

To test whether we were able to express and monitor the production of GFP in situ we first overexpressed GFP from the pYUB1062-GFP vector in M. smegmatis mc24517. M. smegmatis mc24517 strains, in the presence or absence of pYUB1062 or the newly constructed pYUB1062-GFP, were grown to an OD600 of 0.6 in LB expression media followed by induction with acetamide (0–0.4%). The growth rate and the fluorescence output following induction were measured simultaneously in whole cells (Fig. 2). The induction time point is defined as t = 0. Compared to the non-induced control, the addition of acetamide did not influence the growth of the different M. smegmatis strains over the 100 hours time period for all concentrations of acetamide tested (Fig. 2). Importantly, we were able to monitor the production of GFP in the pYUB1062-GFP M. smegmatis mc24517 expression strain in situ (Fig. 2(f)). The GFP fluorescence increased continually until it reached a maximum level (corresponding to 13 μg GFP, 100 μL culture), as detected by fluorescence analysis, at approximately 20–30 hours after induction (Fig. 2(e) and (f)). The high level of recombinant GFP expression was not detrimental to the growth of M. smegmatis which is important in the development of a mycobacterial reporter recombinant protein expression system (Fig. 2(e)). We did, however, observe that GFP was expressed at the t = 0 induction time point and that the highest level of GFP produced were in the absence of acetamide (Fig. 2(f)), indicating that the promoter is not tightly regulated and that background expression occurs. In E. coli the T7/lac promoter is well known to result in leaky expression due to the negative control of the lac promoter.4,30
image file: c8ra06237d-f2.tif
Fig. 2 Growth rates and green fluorescent protein (GFP) fluorescence of M. smegmatis mc24517 strains. (a) Growth curve of M. smegmatis mc24517. (b) GFP fluorescence of M. smegmatis mc24517. (c) Growth curve of M. smegmatis mc24517 transformed with pYUB1062. (d) GFP fluorescence of M. smegmatis mc24517 transformed with pYUB1062. (e) Growth curve of M. smegmatis mc24517 transformed with pYUB1062-GFP. (f) GFP fluorescence of M. smegmatis mc24517 transformed with pYUB1062-GFP. Concentrations of acetamide induction: red – 0%, green – 0.1%, blue – 0.2%, black – 0.4%. The dashed black line represents a media only control.

To demonstrate that the GFP protein produced from this expression system can be purified and the purification steps monitored, we produced GFP in 1 L culture volume. We used the conditions optimised from our test conditions and grew M. smegmatis mc24517 transformed with pYUB1062-GFP to an OD600 of 0.6 and then induced the culture with 0.2% acetamide for 20 hours. At this final 20 hour time point we measured the GFP fluorescence in whole cells which corresponded to the production of 100 mg of GFP. The M. smegmatis cell pellets were then lysed by cell disruption and the His6-tagged GFP protein purified by Ni2+-affinity-chromatography (IMAC) and the eluted fractions were monitored by in-gel fluorescence and Coomassie blue staining (Fig. 3). Following purification, we isolated ∼50 mg of purified GFP protein, indicating that there is a good correlation between the GFP that is detected in the whole M. smegmatis cells and the amount of protein purified. It is likely that incomplete cell lysis and loss of protein during the purification steps resulted in the slightly lower yield of purified GFP protein than expected. It was of interest to note that the yield of purified GFP obtained from the M. smegmatis expression host is comparable to the yields of GFP that we have obtained previously from E. coli. This indicates that the GFP-reporter for protein overexpression in an M. smegmatis expression system is robust and comparable to the fluorescent based systems that are routinely used for protein over-expression in E. coli28,29 and S. cerevisiae hosts.31


image file: c8ra06237d-f3.tif
Fig. 3 Expression and purification of GFP in M. smegmatis mc24517. (a) SDS-PAGE analysis of the overexpression and purification of GFP-His6. (b) In-gel fluorescence of the overexpression and purification of GFP-His6. M: molecular weight markers in kDa, S: soluble lysate, FT: column flow-through, W: buffer wash-step, numbers 20–1000 refer to the imidazole concentration in the elution buffer (units mM).

Production of NagA-GFP-His6 in M. smegmatis mc24517

To assess the use of the pYUB1062-GFP reporter strain for the production of mycobacterial proteins for structural and functional studies we chose to produce the mycobacterial N-acetylglucosamine-6-phosphate (NagA) enzyme as a GFP-fusion. We have recently produced NagA as His6-tag fused protein in M. smegmatis mc24517 as soluble protein19 and therefore wanted to determine whether the GFP-fusion impacted on the expression yield and/or activity of this protein. To determine the optimal expression conditions for the NagA-GFP-His6 fusion we measured the OD600 and the GFP fluorescence for 24 hours in microtiter plates following induction at an OD600 of 0.6 with acetamide (0–0.4%) and observed that the fluorescent signal continued to increase indicating that the NagA-GFP-His6 protein continued to be produced over this time period (ESI, Fig. S1). It was interesting to note that the results from this primary screen indicated that the optimal conditions to produce the NagA-GFP-His6 protein were identical to those used previously to produce His6-tagged NagA in 1 L culture volume.19 Following cell lysis, the NagA-GFP-His6 protein was purified by Co2+-affinity, anion exchange and size-exclusion chromatography (Fig. 4). The eluted fractions from each purification step were monitored by analysis of the fluorescence output using a plate-reader and by in-gel fluorescence following SDS-PAGE. The addition of the C-terminal GFP-tag to the NagA protein enabled the purification strategy to be optimised and monitored more rapidly compared to our previous studies with NagA as a His6 fusion.19 The quality of the protein can be assessed through a comparison of the intensity of the NagA-GFP-His6 fusion compared to GFP alone and has been used routinely to monitor the integrity of membrane proteins expressed as GFP proteins in E. coli.27–29,31–34 Our in-gel fluorescence results for the whole cell lysate indicated a prominent signal at ∼70 kDa with only low levels of GFP detected at ∼27 kDa, indicating that the NagA-GFP-His6 fusion protein was intact and minimal degradation had occurred (Fig. 4(a)). The size-exclusion profile indicated that the NagA-GFP-His6 fusion protein forms a dimer in solution (ESI, Fig. S2) which was also observed for the NagA-His6 fusion19 and is characteristic to NagA proteins from other bacterial species.35,36 This is important and indicates that the oligomeric complex is not altered by the addition of the C-terminal GFP-tag which has also been found for proteins expressed as GFP fusions in both E. coli28,32–34 and S. cerevisiae systems.31 To obtain NagA, the purified NagA-GFP-His6 fusion was digested with TEV-His6 protease. The digest went to completion and NagA was obtained by removal of the digested GFP-His6-tag and the TEV-His6 protease by Ni2+-chromatography (ESI, Fig. S3). Correct folding of the NagA-GFP-His6 fusion and NagA following cleavage of the GFP-His6-tag was confirmed by circular-dichroism (ESI, Fig. S4). The amount of purified NagA protein that we produced following expression as a GFP fusion (∼1 mg L−1 culture) was comparable to that obtained as a His6 fusion19 (∼2 mg L−1 culture) indicating, importantly, that the GFP reporter was not detrimental to the production and purification of the NagA protein.
image file: c8ra06237d-f4.tif
Fig. 4 Purification of NagA-GFP-His6 expressed in M. smegmatis mc24517. (a) Elution of NagA-GFP-His6 protein from a Co2+ column. M: molecular weight markers in kDa, WC: cell lysate, S: soluble lysate, FT: column flow through, NaCl: 1 M NaCl wash, numbers 0 to 1000 refer to the imidazole concentration in the buffer (units mM). (b) QHP anion chromatography of NagA-GFP-His6 following the Co2+ purification step. L: protein loaded onto the column after the Co2+ purification step, FT: column flow through, numbers 100–1000 indicate the NaCl concentrations in the elution buffer (units of mM). (c) Size exclusion chromatography of NagA-GFP-His6. (i) SDS-PAGE stained with Coomassie blue, (ii) corresponding in-gel fluorescence analysis and (iii) GFP fluorescence readings following the purification steps of NagA-GFP-His6.

Activity of NagA-GFP-His6

To confirm that production of protein fused with a C-terminal GFP reporter tag is not detrimental to activity of the protein we examined the catalytic deacetylation of N-acetylglucosamine-6-phosphate (GlcNAc6P) substrate by both the NagA-GFP-His6 fusion and the cleaved NagA enzymes using a fluorescence assay.19 The measurements indicated that both NagA-GFP-His6 and NagA were able to deacetylate GlcNAc6P with Km values of 6.7 mM and 6.2 mM and kcat values of 96.4 s−1 and 111 s−1 respectively (Fig. 5 and Table 2). Importantly, the kinetic values that were obtained are comparable to the values previously obtained for the NagA-His6 fused protein also expressed in M. smegmatis.19 This indicates that a C-terminal GFP fusion is not detrimental to the correct folding or catalytic activity of recombinant proteins expressed in M. smegmatis.
image file: c8ra06237d-f5.tif
Fig. 5 Kinetics of NagA-GFP-His6 before and after cleavage of the GFP-tag. (a) Kinetics of NagA-GFP-His6, (b) kinetics of NagA after cleavage of the GFP-His6-tag. Each assay was carried out in triplicate and expressed as ± standard deviation.
Table 2 Steady-state kinetic parameters of NagA
Enzyme Fusion-tag GFP-tag Km (mM) kcat (s−1) kcat/Km × 103 M− 1s−1 Reference
NagA His6 Not cleaved 3.2 ± 0.4 91.1 ± 5.5 28.8 ± 3.8 19
NagA GFP-His6 Not cleaved 6.7 ± 1.2 96.4 ± 7.3 14.4 ± 2.8 This study
NagA GFP-His6 Cleaved 6.2 ± 0.6 111.0 ± 4.0 17.9 ± 1.8 This study


Conclusions

In this study we have developed a new method for the efficient screening and purification of mycobacterial proteins using GFP and M. smegmatis as a host expression system. Here we have identified that GFP can be used as a fluorescence indicator to monitor recombinant protein expression in situ in whole M. smegmatis cells and that it is possible to undertake preliminary screens in microtiter plate format for the rapid optimisation of overexpression conditions. The GFP-reporter tag is not detrimental to the growth of M. smegmatis and enables the purification of mycobacterial proteins to be rapidly monitored from the GFP fluorescence detection. The activity of the NagA enzyme produced as a NagA-GFP-His6 fusion was comparable with the NagA-His6 fusion indicating that the C-terminal GFP tag does not affect the biochemical function of the target protein. We hope that the availability of the pYUB1062-GFP protein expression system will facilitate the production of Mtb, and other mycobacterial, proteins and enable new structural and biochemical insights into this important global pathogen.

Conflicts of interest

There are no conflicts of interest to declare.

Abbreviations

XDR-TBExtensively-drug resistant
IMACImmobilised metal affinity chromatography
MtbMycobacterium tuberculosis
GlcNAc6PN-Acetylglucosamine-6-phosphate
NagAN-Acetylglucosamine-6-phosphate deacetylase
ODOptical density
TEVTobacco etch virus
TBTuberculosis

Acknowledgements

The authors would like to thank Professor William R. Jacobs (Albert Einstein College of Medicine, USA) for providing us with the pYUB1062 expression vector and the M. smegmatis mc24517 expression system. We would like to thank Dr Debs Brotherton for providing the His-TEV protease used in these studies. This work was supported by a Sir Henry Dale Fellowship to EF jointly funded by the Wellcome Trust and Royal Society (Grant number 104193/Z/14/Z), a grant from the Wellcome Trust (Grant number 201442/Z/16/Z) and a research grant from the Royal Society (Grant number RG120405).

References

  1. WHO Global Tuberculosis Report, http://www.who.int/tb/publications/global_report/en/).
  2. R. S. Wallis, M. Maeurer, P. Mwaba, J. Chakaya, R. Rustomjee, G. B. Migliori, B. Marais, M. Schito, G. Churchyard, S. Swaminathan, M. Hoelscher and A. Zumla, Lancet Infect. Dis., 2016, 16, e34–46 CrossRef PubMed.
  3. G. Bashiri and E. N. Baker, Protein Sci., 2015, 24, 1–10 CrossRef PubMed.
  4. G. L. Rosano and E. A. Ceccarelli, Front Microbiol., 2014, 5, 172 Search PubMed.
  5. G. Georgiou and P. Valax, Curr. Opin. Biotechnol., 1996, 7, 190–197 CrossRef PubMed.
  6. R. M. Goldstone, N. J. Moreland, G. Bashiri, E. N. Baker and J. Shaun Lott, Protein Expression Purif., 2008, 57, 81–87 CrossRef PubMed.
  7. F. R. Blattner, G. Plunkett 3rd, C. A. Bloch, N. T. Perna, V. Burland, M. Riley, J. Collado-Vides, J. D. Glasner, C. K. Rode, G. F. Mayhew, J. Gregor, N. W. Davis, H. A. Kirkpatrick, M. A. Goeden, D. J. Rose, B. Mau and Y. Shao, Science, 1997, 277, 1453–1462 CrossRef PubMed.
  8. S. T. Cole, R. Brosch, J. Parkhill, T. Garnier, C. Churcher, D. Harris, S. V. Gordon, K. Eiglmeier, S. Gas, C. E. Barry 3rd, F. Tekaia, K. Badcock, D. Basham, D. Brown, T. Chillingworth, R. Connor, R. Davies, K. Devlin, T. Feltwell, S. Gentles, N. Hamlin, S. Holroyd, T. Hornsby, K. Jagels, A. Krogh, J. McLean, S. Moule, L. Murphy, K. Oliver, J. Osborne, M. A. Quail, M. A. Rajandream, J. Rogers, S. Rutter, K. Seeger, J. Skelton, R. Squares, S. Squares, J. E. Sulston, K. Taylor, S. Whitehead and B. G. Barrell, Nature, 1998, 393, 537–544 CrossRef PubMed.
  9. N. A. Lack, A. Kawamura, E. Fullam, N. Laurieri, S. Beard, A. J. Russell, D. Evangelopoulos, I. Westwood and E. Sim, Biochem. J., 2009, 418, 369–378 CrossRef PubMed.
  10. M. Lara, L. Servin-Gonzalez, M. Singh, C. Moreno, I. Cohen, M. Nimtz and C. Espitia, Appl. Environ. Microbiol., 2004, 70, 679–685 CrossRef PubMed.
  11. J. Carere, S. E. McKenna, M. S. Kimber and S. Y. Seah, Biochemistry, 2013, 52, 3502–3511 CrossRef PubMed.
  12. J. Deng, L. Bi, L. Zhou, S. J. Guo, J. Fleming, H. W. Jiang, Y. Zhou, J. Gu, Q. Zhong, Z. X. Wang, Z. Liu, R. P. Deng, J. Gao, T. Chen, W. Li, J. F. Wang, X. Wang, H. Li, F. Ge, G. Zhu, H. N. Zhang, J. Gu, F. L. Wu, Z. Zhang, D. Wang, H. Hang, Y. Li, L. Cheng, X. He, S. C. Tao and X. E. Zhang, Cell Rep., 2014, 9, 2317–2329 CrossRef PubMed.
  13. F. Bigi, O. Taboga, M. I. Romano, A. Alito, J. C. Fisanotti and A. A. Cataldi, Braz. J. Med. Biol. Res., 1999, 32, 29–37 CrossRef PubMed.
  14. https://www.rcsb.org/.
  15. J. Neres, F. Pojer, E. Molteni, L. R. Chiarelli, N. Dhar, S. Boy-Rottger, S. Buroni, E. Fullam, G. Degiacomi, A. P. Lucarelli, R. J. Read, G. Zanoni, D. E. Edmondson, E. De Rossi, M. R. Pasca, J. D. McKinney, P. J. Dyson, G. Riccardi, A. Mattevi, S. T. Cole and C. Binda, Sci. Transl. Med., 2012, 4, 150ra121 Search PubMed.
  16. S. M. Batt, T. Jabeen, V. Bhowruth, L. Quill, P. A. Lund, L. Eggeling, L. J. Alderwick, K. Futterer and G. S. Besra, Proc. Natl. Acad. Sci. U. S. A., 2012, 109, 11354–11359 CrossRef PubMed.
  17. R. Qamra, S. C. Mande, A. R. Coates and B. Henderson, Tuberculosis, 2005, 85, 385–394 CrossRef PubMed.
  18. G. Bashiri, E. F. Perkowski, A. P. Turner, M. E. Feltcher, M. Braunstein and E. N. Baker, PLoS One, 2012, 7, e45003 CrossRef PubMed.
  19. M. S. Ahangar, C. M. Furze, C. S. Guy, C. Cooper, K. S. Maskew, B. Graham, A. D. Cameron and E. Fullam, J. Biol. Chem., 2018, 293(25), 9770–9783 CrossRef PubMed.
  20. P. Davanloo, A. H. Rosenberg, J. J. Dunn and F. W. Studier, Proc. Natl. Acad. Sci. U. S. A., 1984, 81, 2035–2039 CrossRef.
  21. F. Wang, P. Jain, G. Gulten, Z. Liu, Y. Feng, K. Ganesula, A. S. Motiwala, T. R. Ioerger, D. Alland, C. Vilcheze, W. R. Jacobs Jr and J. C. Sacchettini, Antimicrob. Agents Chemother., 2010, 54, 3776–3782 CrossRef PubMed.
  22. E. E. Noens, C. Williams, M. Anandhakrishnan, C. Poulsen, M. T. Ehebauer and M. Wilmanns, BMC Biotechnol., 2011, 11, 27 CrossRef PubMed.
  23. M. T. Ehebauer, M. Zimmermann, A. J. Jakobi, E. E. Noens, D. Laubitz, B. Cichocki, H. Marrakchi, M. A. Laneelle, M. Daffe, C. Sachse, A. Dziembowski, U. Sauer and M. Wilmanns, PLoS Pathog., 2015, 11, e1004623 CrossRef PubMed.
  24. S. Daugelat, J. Kowall, J. Mattow, D. Bumann, R. Winter, R. Hurwitz and S. H. Kaufmann, Microbes Infect., 2003, 5, 1082–1095 CrossRef PubMed.
  25. C. Poulsen, S. Holton, A. Geerlof, M. Wilmanns and Y. H. Song, FEBS Lett., 2010, 584, 669–674 CrossRef PubMed.
  26. G. Bashiri, A. M. Rehan, D. R. Greenwood, J. M. Dickson and E. N. Baker, PLoS One, 2010, 5, e15803 CrossRef PubMed.
  27. D. Drew, D. J. Slotboom, G. Friso, T. Reda, P. Genevaux, M. Rapp, N. M. Meindl-Beinker, W. Lambert, M. Lerch, D. O. Daley, K. J. Van Wijk, J. Hirst, E. Kunji and J. W. De Gier, Protein Sci., 2005, 14, 2011–2017 CrossRef PubMed.
  28. D. E. Drew, G. von Heijne, P. Nordlund and J. W. de Gier, FEBS Lett., 2001, 507, 220–224 CrossRef PubMed.
  29. G. S. Waldo, B. M. Standish, J. Berendzen and T. C. Terwilliger, Nat. Biotechnol., 1999, 17, 691–695 CrossRef PubMed.
  30. S. K. Kim, D. H. Lee, O. C. Kim, J. F. Kim and S. H. Yoon, ACS Synth. Biol., 2017, 6, 1766–1773 CrossRef PubMed.
  31. S. Newstead, H. Kim, G. von Heijne, S. Iwata and D. Drew, Proc. Natl. Acad. Sci. U. S. A., 2007, 104, 13936–13941 CrossRef PubMed.
  32. L. E. Bird, H. Rada, A. Verma, R. Gasper, J. Birch, M. Jennions, J. Lwe, I. Moraes and R. J. Owens, J. Visualized Exp., 2015,(95), e52357 Search PubMed.
  33. L. E. Bird, J. E. Nettleship, V. Jarvinen, H. Rada, A. Verma and R. J. Owens, Adv. Exp. Med. Biol., 2016, 922, 1–11 CrossRef PubMed.
  34. D. Drew, M. Lerch, E. Kunji, D. J. Slotboom and J. W. de Gier, Nat. Methods, 2006, 3, 303–313 CrossRef PubMed.
  35. F. Vincent, D. Yates, E. Garman, G. J. Davies and J. A. Brannigan, J. Biol. Chem., 2004, 279, 2809–2816 CrossRef PubMed.
  36. F. M. Ferreira, G. Mendoza-Hernandez, M. Castaneda-Bueno, R. Aparicio, H. Fischer, M. L. Calcagno and G. Oliva, J. Mol. Biol., 2006, 359, 308–321 CrossRef PubMed.

Footnote

Electronic supplementary information (ESI) available. See DOI: 10.1039/c8ra06237d

This journal is © The Royal Society of Chemistry 2018
Click here to see how this site uses Cookies. View our privacy policy here.