Hyeokjune
Choi
a,
Hyoin
Park
a,
Kabi
Son
b,
Ho Min
Kim
bc and
Yongwon
Jung
*a
aDepartment of Chemistry, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34143, Republic of Korea. E-mail: ywjung@kaist.ac.kr; Fax: +82-42-350-2810; Tel: +82-42-350-2817
bGraduate School of Medical Science & Engineering, KAIST, Daejeon 34141, Republic of Korea
cCenter for Biomolecular & Cellular Structure, Institute for Basic Science (IBS), Daejeon 34126, Republic of Korea
First published on 7th October 2019
Supramolecular protein assemblies have garnered considerable interest due to their potential in diverse fields with unrivaled attainable functionalities and structural accuracy. Despite significant advances in protein assembly strategies, inserting long linkers with varied lengths and rigidity between assembling protein building blocks remains extremely difficult. Here we report a series of green fluorescent protein (GFP) oligomers, where protein building blocks were linked via two independent peptide strands. Assembling protein units for this two-peptide assembly were designed by flopped fusion of three self-assembling GFP fragments with two peptide linkers. Diverse flexible and rigid peptide linkers were successfully inserted into high-valent GFP oligomers. In addition, oligomers with one flexible linker and one rigid linker could also be fabricated, allowing more versatile linker rigidity control. Linker length could be varied from 10 amino acids (aa) even up to 76 aa, which is the longest among reported protein assembling peptide linkers. Discrete GFP oligomers containing diverse linkers with valencies between monomers to decamers were monodispersely purified by gel elution. Furthermore, various functional proteins could be multivalently fused to the present GFP oligomers. Binding assays, size exclusion chromatography, dynamic light scattering, circular dichroism, differential scanning calorimetry, and transmission electron microscopy suggested circular geometries of the GFP oligomers and showed distinct characteristics of GFP oligomers with length/rigidity varied linkers. Lastly, a surface binding study indicated that more spaced oligomeric binding modules offered more effective multivalent interactions than less spaced modules.
Despite these rapidly developing protein assembly strategies, varying the length and rigidity of linkers between assembled protein units is still highly challenging. Most symmetric protein assembly approaches by computational design require short and rigid linkers between binding protein domains or even direct protein contact via designed interfaces.2,14 For protein polymers, incorporating relatively long (both peptide and synthetic chemical) linkers between monomeric building blocks leads to increased intramolecular assembly, which produces dominantly low-valent oligomers (Table S1†). Multiple studies have demonstrated the importance of spacing between assembled functional proteins. Optimization of the distance between multiple protein binding domains is critical to maximize binding affinities to multimeric target proteins via multivalent interactions.18,19 Spacing between multiple ligands against cell surface receptors is also one of the key regulation factors for cellular receptor signaling.20,21 At present, however, spacing between assembled proteins is mostly varied by adapting DNA scaffolds, which are synthetically fused to protein units. Developing new assembly strategies that allow variations on linkers between assembled proteins will greatly diversify the structures and functions of supramolecular protein assemblies.
Here, we developed a new in-cell protein assembly approach, where protein units (GFP) were linked via two peptide strand linkers with varied length and flexibility. The tripartite split GFP fragments (GFP 1–9, GFP 10 and GFP 11),22 which can self-assembled to form mature GFP, were utilized for this two-peptide linker assembly. Flopped fusion of three GFP fragments provided a protein building block with an ideal assembling orientation for a high-degree of intermolecular oligomerization via two peptide linkers (Scheme 1). Importantly, linkers with various lengths and flexibility could be inserted, while maintaining effective GFP oligomerization. By careful control of linker flexibility, we were able to fabricate highly spaced and flexible GFP oligomers, where the longest two-peptide linkers consisted of one 38-amino acids (aa) flexible and one 76-aa rigid peptide linkers. Various functional proteins were successfully displayed on these GFP oligomers, and discrete oligomers could be purified by gel electro-elution. The exact valence numbers of discrete GFP oligomers were confirmed by functional binding assays. We also assessed the spatial arrangements of oligomers by visualizing through transmission electron microscopy (TEM), and demonstrated how the linker rigidity affects the elution profiles of GFP oligomers during size-exclusion chromatography (SEC). Dynamic light scattering (DLS), circular dichroism (CD), and differential scanning calorimetry (DSC) analyses were also conducted to investigate various structural features of GFP oligomers with different linkers. Furthermore, we examined binding characteristics of binding domain-fused GFP oligomers with varied linkers using surface plasmon resonance (SRP) analysis.
Flexible 10 aa peptides were first introduced as two linkers (linker 1: GTDVGSGGGS and linker 2: GGGSGGGSTS; 10aa/10aa) (Table 1). When the resulting protein building block is overexpressed in E. coli, a wide range of fluorescent GFP oligomers from monomer to over decamer were effectively assembled, as demonstrated by native polyacrylamide gel electrophoresis (PAGE) (Fig. 1a). GFP oligomers were produced in an over 100 mg scale from 1 L of E. coli flask culture and easily purified using Ni-affinity chromatography. Oligomers were stable during multi-step purification processes such as sonication, centrifugation, and affinity column purification without any noticeable loss of fluorescence (Fig. S2†). More precise GFP oligomer valency was estimated by gel analysis in the presence of a 0.1% denaturing detergent, sodium dodecyl sulfate (SDS), for size-dependent separation. The oligomers were mostly stable under 0.1% SDS, showing discrete fluorescent bands that correspond to multiple GFP units, whereas only a monomer band was exhibited in boiled samples (Fig. 1b). While GFP oligomers were not discretely separated by SEC, SEC and native gel analysis with fluorescent signal quantitation indicate that close to 43% of expressed (10aa/10aa) GFP building blocks were polymerized in cells into high molecular weight oligomers (≥3 mer) (Fig. 1c and S3†).
GFP oligomers | Linker (linker 1: L1, linker 2: L2) | Amino acids |
---|---|---|
a GFP oligomers 4aa/4aa, 22aa/22aa, 30aa/30aa, and EP6/EP6 were only marginally oligomerized. | ||
4aa/4aaa | L1: GGGS | 4/4 |
L2: GSTS | ||
10aa/10aa | L1: GTDVGSGGGS | 10/10 |
L2: GGGSGGGSTS | ||
15aa/15aa | L1: GTDVGSGGGSGGGGS | 15/15 |
L2: GGSGGGGGSGGGSTS | ||
22aa/22aaa | L1: GTDVGSGGGSGGGGSGGGSEGS | 22/22 |
L2: GGSGGGGGSGGGSTSGGGSEGS | ||
30aa/30aaa | L1: GTDVGSGGGSGGGGSGGGSEGSGGSGGGSG | 30/30 |
L2: GGSGGGGGSGGGSTSGGGSEGSGGSGGGSG | ||
EKn/EKn | L1: GSGS(EAAAK)nGSGS (n = 2, 3, 4, 5, 6) | 5n+8/5n+8 |
L2: GSGS(EAAAK)nGSGS (n = 2, 3, 4, 5, 6) | ||
EPn/KPn | L1: GSGS(EP)nGSGS (n = 4, 6a) | 2n+8/2n+8 |
L2: GSGS(KP)nGSGS (n = 4, 6) | ||
(H4)2/(H4)2 | L1: GSGS(EAAAK)4ALEA(EAAAK)4GSGS | 52/52 |
L2: GSGS(EAAAK)4ALEA(EAAAK)4GSGS | ||
(H4)3/(H4)3 | L1: GSGS(EAAAK)4ALEA(EAAAK)4GSGS(EAAAK)4GSGS | 76/76 |
L2: GSGS(EAAAK)4ALEA(EAAAK)4GSGS(EAAAK)4GSGS | ||
(H4)2/30aa | L1: GSGS(EAAAK)4ALEA(EAAAK)4GSGS | 52/30 |
L2: GGSGGGGGSGGGSTSGGGSEGSGGSGGGSG | ||
(H4)3/38aa | L1: GSGS(EAAAK)4ALEA(EAAAK)4GSGS(EAAAK)4GSGS | 76/38 |
L2: GGSGGGGGSGGGSTSGGGSEGSGGSGGGSGGGGSGGGS | ||
(EP)6/22aa | L1: GSGS(EP)6GSGS | 20/22 |
L2: GGSGGGGGSGGGSTSGGGSEGS |
Monodisperse oligomers were next isolated in a size-dependent manner from monomer to decamer by gel electro-elution (Fig. 1d).12 Assembled and gel-purified GFP oligomers were intact during multistep gel elution processes. In addition, oligomers from monomer to hexamer all showed identical fluorescence spectra, indicating stable GFP maturation regardless of protein valency (Fig. S4†). This data also suggests that purified GFP oligomers were all intra-molecularly circularized forms without any free GFP 1–9 and GFP 10/GFP 11strands (Scheme 1). To further support circularization of GFP oligomers, free GFP 1–9 was added to purified oligomers as well as to GFP with free GFP 10/GFP 11 strands as a control. GFP 1–9 was assembled with free GFP 10/GFP 11 strands to form mature GFP but not with GFP oligomers, indicating the absence of free GFP 10/GFP 11 strands on GFP oligomers due to complete circularization (Fig. S5†). Eluted GFP oligomers can be concentrated over 0.3 mg mL−1 and were stable in phosphate-buffered saline (PBS) at 4 °C for at least 6 months. For GFP trimer and tetramer, 2–4 mg of each oligomer could be prepared from 1 L of E. coli culture, and 1–2 mg of GFP oligomers from 5 mer to 10 mer could be homogenously isolated from the same protein mixture.
Next, we introduced rigid peptide linkers such as helical EAAAK repeats (EKn) and proline rich EP/KP repeats (EPn/KPn).25,26 Identical helical peptides containing from 2 (EK2) to 6 (EK6) sequentially increased EAAAK repeats were inserted between GFP 1–9 and GFP10/11 strands (Table 1). In addition, much longer ((H4)2: 52 aa and (H4)3: 76 aa) but less stiff peptides,27 which contain 8 and 12 EAAAK repeats, respectively, were also introduced. Interestingly, these GFP building blocks with helical linkers were all successfully assembled into a series of high-valent GFP oligomers (Fig. 2b), unlike those with long fully flexible linkers. In addition, these oligomers were stably and homogeneously isolated again by gel-elution (Fig. 2b). Rigid linkers might prevent prompt intra-protein assemblies and allow more inter-protein assemblies to produce diverse GFP oligomers with similar valency distributions regardless of linker lengths. These GFP oligomers offer the first multivalent protein scaffolds, where backbone protein units were linked via two helical peptide linkers with incrementally variable lengths (from 18 aa to 76 aa). We also inserted short proline rich EP and KP repeats between ‘GFP 10 and GFP 1–9’ and ‘GFP 1–9 and GFP 11’, respectively. EP and KP repeats were observed in a natural stiff peptide,25 and here, both sequences were used together as two peptide linkers to avoid repulsion between two charged linkers. GFP oligomers bearing four repeats of EP and KP (EP4/KP4) were well constructed and purified as stable monodisperse forms, whereas the subunits with six repeats (EP6/KP6) were poorly oligomerized, presumably owing to high structural constraints on constructs (Fig. 2c).
To further increase the spacing of asymmetric GFP oligomers, linker 2 of the most extended (H4)3/(H4)3 GFP oligomers was replaced with a long 38 aa flexible peptide. The resulting (H4)3/38aa also showed a clear ladder form of oligomers on a native gel (Fig. 3a). The present two-peptide linker protein assembly allowed the insertion of extremely long peptide linkers between protein building blocks while also varying the rigidity of these linkers. To our knowledge, (H4)3/(H4)3 and (H4)3/38aa GFP oligomers represent the longest peptide linkers connecting monomeric protein subunits (Table S1†). Discrete GFP oligomers with long and asymmetric rigid/flexible peptide linkers were also stably and monodispersely isolated from the mixtures (Fig. 3b). We even further extended linkers to (H4)4/50aa. While (H4)4/50aa GFP oligomers showed a moderate oligomerization level, oligomers were not separated well on a native gel due to their large sizes (Fig. S8†), limiting the use of (H4)4/50aa as multivalent protein scaffolds. Additionally, we applied our assembly approach to other GFP color variants to examine how slight variation on the protein structure affects the present protein assembly and also if FP oligomers with different colors can be generated. Upon single residue mutations, yellow FP oligomers were successfully formed with both short and long linkers, while oligomeric assembly was clearly less efficient for cyan and blue proteins (Fig. S9†).
We utilized ligand-fused GFP oligomers to functionally confirm the accurate oligomer valencies (the numbers of GFP units), which were initially estimated based on oligomer sizes, as discussed in Fig. 1. To exploit rapid and stable interaction between leucine zipper ZE and ZR peptides (sequences in ESI†),28 the ZE peptide was N-terminally fused to various GFP oligomers with different linker length and rigidity such as 15aa/15aa, EK2/EK2, EP4/KP4, and (H4)2/30aa. All ZE fused proteins were well assembled to multivalent GFP oligomers, and each oligomer was homogeneously isolated (Fig. 5a and S10†). Precise ZR binding valencies of ZE-fused GFP oligomers were functionally examined by native gel band-shift analyses with ZR peptide-fused GFP (at the N-terminal) (GFP-ZR) (Fig. 5b). ZE-fused GFP oligomers were mixed with GFP-ZR at varying concentration ratios ranging from 0.5 to 2 (GFP-ZR/ZE-GFP oligomer, ZR/ZE concentration) for 1 h at 25 °C before gel analyses. ZE-GFP oligomer complexes with different numbers of bound GFP-ZR were clearly discriminated (Fig. 5b). Expected ZR binding valencies were clearly identified for all isolated oligomers with different two peptide linkers, as demonstrated with ZE-GFP dimers and tetramers (Fig. 5b and S11†). When two-fold excess of GFP-ZR was added, all fused ZE peptides on tested GFP oligomers were bound with GFP-ZR, indicating that all fused ZE peptides are functional and ideally oriented for ZR interaction on the present GFP oligomers with two peptide linkers.
Structural characteristics of GFP oligomers were also investigated with size-exclusion chromatography (SEC), which is widely used for separation of proteins by their size and shape. For typical proteins, larger proteins are eluted (or migrate) faster than smaller proteins. To examine different chromatographic migration properties of GFP oligomers with different linkers, three GFP oligomers with similar sizes but with different linker rigidities (ZE-fused 15aa/15aa, EK2/EK2, and EP4/KP4) were subjected to SEC. ZE-fused oligomers were used because they are functionally further defined (Fig. 5 and S11†). GFP building blocks of these oligomers have similar molecular weights (35.8, 36.6, and 36.7 kDa, respectively). Indeed, monomeric assemblies were all eluted at a similar time (Fig. 6b). Monomers are intra-molecularly assembled with only one subunit, and thereby peptide linkers will have minimal effects on protein shapes (Scheme 1). On the other hand, a trimeric assembly with flexible linkers (15aa/15aa) was clearly eluted later than those with more rigid linkers (EK2/EK2 and EP4/KP4) (Fig. 6b). While their assembly geometries are highly heterogeneous (Fig. 6a), oligomers with rigid linkers might have higher populations of more widely spaced and stiff GFP oligomers (therefore acting as bigger proteins) than oligomers with flexible linkers. It is possible that long flexible peptide linkers were in more folded forms rather than in fully extended forms during SEC.
Relative size distributions of discrete GFP oligomers were also measured by dynamic light scattering (DLS). Monomers to tetramers of GFP oligomers with short (15aa/15aa) and long ((H4)3/38aa) linkers were examined. Oligomer sizes clearly increased as the valency increased, and sizes of (H4)3/38aa oligomers were larger than those of 15aa/15aa oligomers (Fig. S14†). In addition, size variations were generally higher for more spaced (H4)3/38aa oligomers than 15aa/15aa oligomers. We also examined structural heat stability of GFP oligomers with differential scanning calorimetry (DSC) and a heat titration test. DSC data indicated two melting temperatures of GFP oligomers (Fig. S15†). We believe that the first melting event (∼60 °C) corresponds to oligomer disassembly into monomeric units, and the second one (∼67 °C) to complete denaturation of these GFP units. Fluorescence loss after heating oligomers at ∼60 °C might also indicate oligomer disassembly at the temperature (Fig. S16†). Secondary structures of GFP oligomers were analyzed by circular dichroism (CD), which showed dominant beta-sheet signals for both 15aa/15aa and (H4)3/38aa oligomers but strong alpha-helix signals only for (H4)3/38aa oligomers with their excessive helical linker (Fig. S17†).
On the high density Fc domain surface, binding stability of PG-GFP proteins was clearly enhanced by increased valency, showing significantly slowed dissociation, likely by multivalent binding (Fig. 7b). Regardless of linkers, PG-GFP monomers similarly bind to this Fc domain surface. On the other hand, enhanced binding stability was noticeably greater for GFP oligomers with the longer (H4)3/38aa linkers than those with shorter 15aa/15aa (Fig. S18†). The data indicate that more spaced (H4)3/38aa PG-GFP oligomers have a higher chance to offer multivalent interactions since they can cover wider areas with more Fc domains. Since Fc domains are randomly distributed on a surface, spaced PG oligomers might also have a higher chance to find an ideal combination of Fc domain proteins for more effective (stronger) multivalent interactions.
When the surface Fc domain coverage on flat gold surfaces was lowered (70 RU), slowed dissociation by increased valency of GFP oligomers was mostly vanished (Fig. 7c). It is possible that many surface Fc domains are too far apart for oligomers to capture them simultaneously for multivalent binding. Still, overall binding signals increased as the valency of GFP oligomers increased, even though an assembly concentration decreased from monomer (100 nM) to dimer (50 nM) and to trimer (33 nM). We suspect that multimer-monomer binding might be stronger than monomer–monomer binding. The present SPR study indicates that proper spacing between binding modules is important for effective multivalent interactions against multiple surface binding targets (e.g. cell surface receptors).
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc04158c |
This journal is © The Royal Society of Chemistry 2019 |