Jussara
Amato
a,
Nunzia
Iaccarino
a,
Federica
D'Aria
a,
Francesco
D'Amico
b,
Antonio
Randazzo
a,
Concetta
Giancola
a,
Attilio
Cesàro
b,
Silvia
Di Fonzo
*b and
Bruno
Pagano
*a
aDepartment of Pharmacy, University of Naples Federico II, Naples, I-80131, Italy. E-mail: bruno.pagano@unina.it
bElettra-Sincrotrone Trieste S. C. p. A., Science Park, Trieste, I-34149, Italy. E-mail: silvia.difonzo@elettra.eu
First published on 24th February 2022
The promoter regions of important oncogenes such as BCL2 and KRAS contain GC-rich sequences that can form distinctive noncanonical DNA structures involved in the regulation of transcription: G-quadruplexes on the G-rich strand and i-motifs on the C-rich strand. Interestingly, BCL2 and KRAS promoter i-motifs are highly dynamic in nature and exist in a pH-dependent equilibrium with hairpin and even with hybrid i-motif/hairpin species. Herein, the effects of pH and presence of cell-mimicking molecular crowding conditions on conformational equilibria of the BCL2 and KRAS i-motif-forming sequences were investigated by ultraviolet resonance Raman (UVRR) and circular dichroism (CD) spectroscopies. Multivariate analysis of CD data was essential to model the presence and identity of the species involved. Analysis of UVRR spectra measured as a function of pH, performed also by the two-dimensional correlation spectroscopy (2D-COS) technique, showed the role of several functional groups in the DNA conformational transitions, and provided structural and dynamic information. Thus, the UVRR investigation of intramolecular interactions and of local and environmental dynamics in promoting the different species induced by the solution conditions provided valuable insights into i-motif conformational transitions. The combined use of the two spectroscopic tools is emphasized by the relevant possibility of working in the same DNA concentration range and by the heterospectral UVRR/CD 2D-COS analysis. The results of this study shed light on the factors that can influence at the molecular level the equilibrium between the different conformational species putatively involved in the oncogene expression.
The so-called “noncanonical” DNA secondary structures represent sequence-dependent conformational topologies, frequently clustered in regulatory regions of oncogenes and in telomeres. For example, G-rich strands can form G-quadruplex (G4) structures which, depending on the DNA sequence, in solution may switch into several interconvertible polymorphs upon changes in DNA or cation concentration.4–8 Similarly, depending on the environmental conditions (particularly pH variations), some C-rich sequences can experience polymorphism between i-motif (iM) and hairpin structures.9–12
While the G4 structures have been extensively investigated, a growing attention has only recently being paid to the iM DNA.13,14 Indeed, although the first iM structure was reported in 1993,15 strong evidence for iM formation in vivo has been only recently provided by in cell NMR experiments16 and by the discovery of an antibody able to specifically bind such structures in the nuclei of human cells.17 Even more recent is the visualization of iMs in the nuclei and chromosomes of the testis of Bombyx mori by using immunofluorescence staining.18
The iM structures consist of two parallel-stranded duplexes, stabilized by intercalated, hemi-protonated cytosine–cytosine (C·C+) base pairs (Fig. 1A), which is why the formation of an iM is favored at slightly acidic pH.13,19,20 As in the case of G4 structures, iMs may fold in an intermolecular fashion from the association of two (dimers) or four (tetramers) separate DNA strands; otherwise, they may form an intramolecular structure (monomer) due to the spatial arrangement of four different C-segments within the same strand. Differently from G4s, the intercalation of base pairs from two parallel duplexes leads to a structure with two wide grooves and two extremely narrow grooves. A relevant feature of iM structures is also the absence of stabilizing π-stacking interactions between the aromatic heterocycles of the adjacent C·C+ base pairs (Fig. 1B); indeed in this intercalated DNA structure the stacking geometry of C·C+ pairs is mainly governed by the repulsive base stacking interactions due to the presence of the proton in each base pair.21
Fig. 1 (A) Structure of the hemi-protonated C·C+ base pair; (B) representation of two adjacent C·C+ base pairs in an iM structure. |
The lack of base stacking is partly compensated by interactions between the deoxyribose sugar moieties of the two closely spaced antiparallel backbones in the narrow grooves, and by the dipole–dipole interactions between intercalated C·C+ base pairs.15,22 Furthermore, iM formation and stabilization may be favored in cells by interactions with DNA-binding proteins and by molecular crowding.13,23–25
Interestingly, putative G4/iM-forming sequences have been identified in or near the promoter regions of more than 40% of human genes,26 including important oncogenes such as BCL2 and KRAS, where they have been shown to be involved in the regulation of transcription,12,27 by means of mechanisms in which the iM structures appear to be in dynamic equilibrium with hairpin species (Fig. 2).9,12,28
Furthermore, there was also evidence that iM formation could interfere with DNA repair and replication. This, in turn, may result in genomic instability, which is associated with cancer development.29,30 Therefore, iM DNA and its equilibrating species may represent attractive targets to treat cancer; indeed topology-specific binders may enable site-specific control of genomic instability and expression of genes modulated by the noncanonical structures.
Generally, intramolecular iM structures with longer loops are more stable compared to those with shorter loops,13,31 due to the presence of extra stabilizing interactions in longer loops. Noteworthily, depending on pH, iMs with long loop configuration can even house an entire hairpin-forming sequence in the structure,32 thus coexisting as a hybrid iM/hairpin species in equilibrium with the single iM and hairpin structures, as assumed in the case of the iM within the KRAS gene promoter region termed the Mid-region (Fig. 2B).12 Interestingly, the capping structures formed from these large loops may serve as recognition scaffolds for specific interaction with proteins and/or small molecules, thus affecting the equilibrium between these alternative DNA structures, with consequent effects on the cellular regulation mechanism.
Given the high structural polymorphism of C-rich sequences and the effects of the different structures on cell replication and transcription, it is of primary importance to shed light on the equilibrium between the different species and on the factors that influence it.
Ultraviolet resonance Raman (UVRR) spectroscopy represents a useful tool for detecting and monitoring molecular markers of biomolecules, including nucleic acids.33 Besides providing structural information of biomolecules, produced by molecular vibrations associated to their specific conformation (at micromolar or lower concentrations), UVRR is a powerful method to disclose biologically relevant issues affecting the structure and dynamics of nucleic acids and their interaction with putative drugs, as also already reported by some of us.34–37 Compared to other Raman-based methods, UVRR scattering benefits from the selective enhancement occurring in resonance conditions for the vibrations mainly localized on the nucleotide rings, thus allowing the monitoring of all four nitrogenous bases in a single experiment.37 Indeed, a careful choice of the UVRR excitation wavelength, in resonance with the electronic transitions of specific nucleobases, efficiently allows the disentanglement of each vibrational contribution arising from individual nucleotides.
In this context, the present study aims to investigate by means of UVRR and circular dichroism (CD) spectroscopies two iM-forming DNA sequences known to play a functional role in the transcriptional regulation of BCL2 and KRAS oncogenes. In particular, these sequences are a 39-nucleotide sequence located directly upstream (∼30 bases) of the P1 promoter of the BCL2 gene38 and a 54-nucleotide sequence from the GC-rich nuclease hypersensitive element (NHE) of the KRAS gene promoter region located upstream of the transcription start site (Fig. 2).12
Since CD spectroscopy is a valuable tool for mapping conformational properties of particular DNA molecules,39 it has been used here to preliminarily study the pH-dependent as well as the temperature-dependent structural features of BCL2 and KRAS oligonucleotides. Multivariate curve resolution-alternating least squares (MCR-ALS) analysis of CD data was crucial to explore similarities and differences in the behavior of the two DNA sequences and to describe the number, evolution and identity of structural species found in the pH range of interest.
In parallel, an extensive UVRR study was accomplished to establish the diagnostic markers of C-rich oligonucleotides in their different states of folding (iM, hairpin, and hybrid iM/hairpin structures) and to investigate the changes in the equilibrium between these species induced by environmental perturbations, like pH variations and presence of PEG 200 as a cell-mimicking molecular crowder. The 266 nm UVRR excitation wavelength was selected since it achieves the resonance and pre-resonance conditions for the electronic transitions which contribute to the long-wavelength absorption bands of DNA bases.40–42 Therefore, this wavelength provides the best performance to detect the modifications occurring in the cytosine and guanine bases upon pH changes and to unravel the role of several functional groups in the DNA conformational transitions. Structural and dynamic information on pH-dependent conformational transitions was further retrieved by analyzing the UVRR spectra of DNA molecules using the two-dimensional correlation spectroscopy (2D-COS) technique. In addition, the heterospectral 2D-COS analysis made it possible to correlate the changes in the UVRR and CD spectra. The combined use of the two spectroscopic tools is of relevance also in view of the possibility of working in the same DNA concentration range.
D = CST + E | (1) |
First, the CD spectra as a function of pH at 10 °C were analyzed, so as to verify the proper folding of the two sequences at the different pH values (Fig. S3, ESI†). At pH 5.2, the CD spectra of both BCL2 and KRAS sequences exhibited the characteristic iM profile, i.e., a strong positive band at 288 nm and a negative one at 264 nm.62 On increasing the pH to 6.6, a shift of the maximum to 285 nm was observed for both oligonucleotides, suggesting the formation of a certain amount of hairpin conformation. Upon further increase of the pH to 7.8, the CD spectra of both DNA sequences exhibited a significant intensity drop of the positive band along with the shift of the maximum to 280 nm. This suggests the complete conformational transition of the iM toward the hairpin, although the presence of a fraction of single-stranded DNA cannot be excluded, since unfolded C-rich sequences show a CD maximum centered at around 278 nm.
In order to better understand such conformational equilibria, pH-dependent CD spectra (Fig. S3, ESI†) were submitted to a multivariate data analysis approach, named MCR-ALS, which estimates the pure spectra and the concentration profiles of the species in solution.46 The MCR output strongly depends on the number of components (species) considered to be present during the titration. For BCL2, the model was generated by assuming the presence of either two or three species. However, only the solution obtained from the model generated considering two species turned out to be chemically meaningful. Indeed, the results (Fig. 3A and C) indicate that the first species, which predominates at pH values lower than 7, corresponds to the iM structure, having a strong positive band at 288 nm and a negative one at 264 nm. The second species, prevalently present at pH > 7, corresponds to the hairpin conformation which arises when the bases are deprotonated, as already reported in the literature.9,61 Concerning KRAS data, the MCR model was generated by considering two, three and four species. However, only the model that included three species turned out to be acceptable. Fig. 3B and D show, respectively, the MCR-resolved pure spectra of the species in solution and their relative concentration during the pH titration. Interestingly, it seems that two iM-containing species are present at pH lower than 7.2. The first species (with a positive band at 288 nm and a negative one at 264 nm) is predominant at pH < 6, while the second one starts to appear at pH 5.5, reaches its maximum concentration at pH 7, and then it decreases as the pH increases. This second iM-containing species is characterized by a CD spectrum having a negative band slightly shifted toward lower wavelengths compared to the usual iM structures (at around 258 nm instead of at 264 nm). Such an additional species has already been reported by Benabou and co-workers for another iM-forming sequence,32 and it arises from the partial deprotonation of cytosines located in the loops which allows the formation of a stable Watson–Crick hairpin that coexists with the iM structure, thus generating a hybrid iM/hairpin species (hereafter referred to as hybrid). Finally, at pH values higher than 7.5, the hairpin conformation (third species) becomes the predominant species in solution.
Three-dimensional CD melting curves, which consist in collecting whole CD spectra as a function of temperature, were then obtained for BCL2 and KRAS oligonucleotides at pH 5.2, 6.6, and 7.8 (Fig. S4 and S5, ESI†). The univariate analysis of the data using the CD trace at the wavelength of maximum absorbance variation showed the existence of clear cooperative transitions at pH 5.2 (with apparent melting temperatures, Tm, of 59.9 and 66.5 °C for BCL2 and KRAS, respectively) and pH 6.6 (with Tm of 27.7 and 43.8 °C for BCL2 and KRAS, respectively). In contrast, the temperature-induced changes in the CD spectra at pH 7.8 seemed to reflect a non-cooperative process. However, from this univariate procedure, it is difficult in all cases to state the existence of more than one transition during the experiment.
Thus, as done for the pH titrations, MCR-ALS analysis of temperature-induced unfolding processes was performed. To fully exploit the information hidden behind the 3D melting experiments conducted at three different pH values, a “column-wise matrix augmentation” approach was used to combine in one matrix the CD spectra obtained at the three pH values investigated (see the Experimental section for further details). In this way, the information related to both pH and temperature was employed to consistently generate the pure spectra of the species and their relative concentration profiles. The choice of the number of components was based on the species found by the MCR results obtained from the pH titrations and considering that the increase of the temperature generates the additional single-stranded species in solution.
Therefore, three components were chosen for the MCR-ALS analysis of BCL2. Moreover, we also constrained the concentration profiles to fixed values, using the equality constraint, as described in the Experimental section. The choice of the constrained values was based both on our previous knowledge (i.e., single-stranded DNA is always absent at 10 °C, while it is the only species present at 90 °C, independent of pH) and on the information obtained from the pH titrations performed at 10 °C (i.e., relative fractions of iM, hybrid, and hairpin at pH 5.2, 6.6, and 7.8). The obtained MCR-resolved spectral and concentration profiles clearly show the presence of three species in equilibrium when pH and temperature change simultaneously (Fig. S6, ESI†). At pH 5.2, the presence of iM is predominant at temperature values lower than 60 °C while at higher temperature the unfolded species takes over. At pH 6.6, the hairpin conformation seems to be more thermally stable than the iM structure. This is in line with the fact that such pH is not optimal for the iM conformation, and thus its thermal stability drops dramatically. Finally, at pH 7.8 the iM species is basically absent and the two main species in solution are the hairpin, prevalent at low temperatures, and the single-stranded DNA, which predominates at higher temperatures.
Analogously, the 3D melting data sets of KRAS were processed with the MCR-ALS analysis (Fig. S7, ESI†). Indeed, also in this case, the results obtained from the analysis of the pH titration were used to constrain the concentration profiles to fixed values, using the equality constraint. Thus, at low temperature, three conformations (iM, hybrid and hairpin) were found to be in equilibrium at the different pH values, while the single-stranded DNA (fourth species) turned out to be always the predominant species at temperature values higher than 50 °C indicating the complete DNA unfolding with the further increase in temperature.
The tridimensional MCR-resolved composition profiles on varying pH and temperature, for both BCL2 and KRAS, were generated according to the procedure described in the Experimental section and shown as contour plots of conformational composition (iM, hairpin, hybrid, and single-stranded DNA) in Fig. S8A and B (ESI†). Within the limits of the approximations imposed by the interpolation procedures, these plots represent a sort of state diagrams featuring the conformational transitions as a function of pH and temperature.
Furthermore, to disentangle the contribution of the vibrations of each nucleobase to the bands of the spectra, a data fitting procedure was applied, in combination with the homospectral 2D-COS analysis (see below). Bands relative to the nucleobases are colored by type (Fig. 4). Highlighted in light blue are the diagnostic bands of cytosines in their neutral form, located at about 783, 1253, 1269, 1295 and 1529 cm−1, and hereafter indicated with the letters a, c, d, e, and h, respectively. In particular, band a is mainly associated with the breathing of the cytosine ring; bands c and d are associated with the exocyclic C6H bending and C4N stretching, and with cytosine ring stretching vibrations, respectively; while the two most intense bands e and h are attributed to vibrational modes involving the stretching of the N1C6, C5C6 and N3C4, N1C2 groups in the pyrimidine ring, respectively. Since cytosines are highly sensitive to protonation/deprotonation equilibria, most of these bands should be influenced by pH changes.
As for the thymine bases (in green), two diagnostic bands at about 1239 and 1374 cm−1, referred to as b and f, respectively, are observed in the spectra. These bands are both attributed to the stretching of the methyl C5-CH3 group and bending of C6H.40,63 Among the two, band b is less informative because of its overlapping with band c of the cytosine around 1253 cm−1, while the position of the isolated band f is expected to give information about the interaction between thymines and other bases in the DNA sequence or with the solvent.35
Furthermore, two bands diagnostic of guanine (in red) were also identified in the spectra. The most intense one at about 1488 cm−1 (hereafter referred to as g) is attributed to vibrational modes involving the N7C8 and C8N9 imidazole ring stretching plus the C8H inter-plane bending.40 Since the guanine imidazole ring is not engaged in the recognition and pairing with cytosine, band g is not expected to be particularly susceptible to pH changes, while it should be sensitive to solvent exposure. A similar pH response would be expected for band i at approximately 1577 cm−1, which is assigned to mixed guanine vibrational modes involving the C4C5 stretching out of phase with the flanking N3C4 and C5N7 moieties. In fact, these groups are not directly involved in guanine pairing with cytosine (G–C base pair). Moreover, this band is less informative since it severely overlaps with the one at 1580 cm−1, ascribed to the vibration of the single adenine residue present in the BCL2 sequence (Fig. S9, ESI†).
UVRR profiles of both oligonucleotides also exhibited a wide band centered at around 1653 cm−1, hereafter referred to as k, which mainly results from the overlap of the stretching vibrations of the exocyclic CO group of unpaired cytosine and thymine residues,41,56 while the stretching vibration of the CO group of unpaired guanines appears as a separate shoulder of band k (hereafter referred to as band l) at around 1679 cm−1.41 This band is expected to increase upon acidification as a consequence of the progressive cytosine protonation and consequent disruption of G–C base pairs, with the formation of C·C+ base pairs.
As mentioned above, the redshift of UVRR bands of BCL2 and KRAS at pH 7.8 with respect to the weighted sum of their constituting nucleotides (Fig. S9 and S10, ESI†) clearly suggests the presence of some base-pairing interactions in both DNA sequences at such pH. Particularly significant are the redshifts of bands g and i of guanine, and h of cytosine indicating the presence of G–C interactions.
Noteworthily, UVRR spectra comparison with the sum spectra of the weighted nucleotides also reveals the loss of the band at around 1605 cm−1 which, according to the literature, is associated with the NH2 scissor modes of unpaired guanine,41 along with the appearance of a shoulder, named band j, on the left side of band k. Actually, at pH 7.8, band j comprises two bands, one at 1620 cm−1 corresponding to the NH2 scissor modes of guanine paired to cytosine, and the other at 1632 cm−1 associated to the exocyclic CO stretch of cytosine paired to guanine, overall confirming the occurrence of G–C base pairing at pH 7.8 for both oligonucleotides.41
Interestingly, bands a, d, and l exhibited a significant intensification from pH 7.8 to 5.2, in agreement with the increase of cytosine protonation, and thus with the higher number of C·C+ base pairs formed at the lower pH value. However, among them, only band a can be considered as a good marker of the C·C+ base pair, because of the extended overlap of neighboring bands in the other cases. Interestingly, band a undergoes a blueshift of about 3.5 cm−1 from pH 7.8 to 5.2 in both KRAS and BCL2 spectra (Fig. 5A and B), suggesting the conformational transition of the oligonucleotides to their corresponding iM structures upon decreasing the pH.65 Conversely, band h undergoes a strong intensity drop upon decreasing the pH (Fig. 4), thus suggesting that, in principle, it could be used as a marker of the unpaired cytosines.
Fig. 5 Behavior of the bands (A and B) a, (C and D) f, (E and F) g, and (G and H) j and l for BCL2 (top panels) and KRAS (bottom panels) oligonucleotides at the three different pH investigated. |
Noteworthily, spectra at pH 6.6 and 5.2 of both DNA sequences also exhibit the presence of two novel bands located at 1394 and 1543 cm−1 (indicated by f′ and h′, respectively), whose intensities increase as the pH decreases (Fig. 4 and Tables S1–S6, ESI†). Both these bands are associated with the presence of protonated cytosine and consequently with the formation of hemi-protonated C·C+ base pairs in C-rich oligonucleotides.57 However, the overlapping of the small band f’ with the more intense band f of thymine prevents its use as a potential quantitative marker of iM formation, while the relative variation of bands h (1529 cm−1, neutral cytosine) and h’ (1543 cm−1, C·C+ base pair) may represent a valuable marker of iM formation. According to the literature, the Raman signature of hemi-protonated cytosine base pairs is distinct from the signatures of unprotonated and protonated cytosines, since C·C+ lacks marker bands of unprotonated cytosine, and the intensity of the bands of protonated cytosine decreases by a factor of 2. Indeed, at pH 7.8, band h is the only observable and a decrease in the intensity of this band is clearly observed upon acidification, along with the concomitant gradual intensification of band h′. Therefore, a quantitative measure of the fraction of iM population can be given by fraction Π defined by the relation
Π(%) = 100Ah′/(Ah + Ah′) | (2) |
pH | BCL2 Π | KRAS Π | ||
---|---|---|---|---|
No PEG | PEG 200 | No PEG | PEG 200 | |
7.8 | 0 | 0 | 0 | 0 |
6.6 | 22.3 ± 1.9 | 35.5 ± 1.4 | 16.3 ± 1.2 | 16.5 ± 1.3 |
5.2 | 48.0 ± 2.2 | 72.8 ± 2.4 | 57.6 ± 1.5 | 50.0 ± 1.9 |
Then, a careful examination of other pH-sensitive bands not related to cytosines was performed to have further insights into the pH-dependent dynamic equilibrium between iM and hairpin species of BCL2 and KRAS. From this perspective, remarkable information on DNA structural transitions was retrieved from changes of band f of thymine, as well as g, j, and l of guanine.
Band f probes the position of thymine bases in DNA, as already observed by following the thermal denaturation of G-quadruplex structures.35 This band turned out to be redshifted during the melting of G-quadruplexes, i.e. when such nucleobases become unstacked in the single-stranded DNA. Interestingly, band f turned out to be sensitive to pH as well (Fig. 5C and D). A general blueshift of band f is observed upon acidification of both oligonucleotide solutions, which can be ascribed to changes experienced by thymines during the C-rich DNA conformational transition to form the iM. Interestingly, band f undergoes a larger shift in the case of KRAS compared to BCL2 (Tables S1–S6, ESI†), suggesting a different structural environment experienced by thymines in the case of the two sequences. Previous studies indicate that thymines in the loops of iMs are common capping residues that can be involved in additional base pairs, like T·T and/or A·T (Hoogsteen, reverse Watson–Crick), that extend the iM core and stabilize the structures.66,67 In the structural model proposed for KRAS (Fig. 2B),12 all thymines belong to the flanking residues at the 5′- and 3′-ends in the unordered region of the hairpin structure, while they are part of the loops of the iM structure, potentially contributing to its stabilization. Conversely, in the folding pattern proposed for BCL2 (Fig. 2A), only the last thymine residue of the sequence is located in the flanking, unordered region of the iM and hairpin structures.9,61 Therefore, the different changes in the position of band f observed for the two oligonucleotides agree well with their structural models. Thus, band f represents a marker to detect loop formation or modification during the conformational transitions of C-rich oligonucleotides.
An appreciable shift of band g (assigned to the guanine ring) to a lower frequency was observed for both oligonucleotides upon acidification from pH 7.8 to 5.2 (Fig. 5E and F). It has been pointed out that a redshift of band g shows an increased strength of hydrogen bonding of N7 of unpaired guanine bases with the solvent, which causes a lowering of the frequency of this vibration.68 Interestingly, a gradual variation of band g takes place in the case of BCL2, thus suggesting the related gradual decrease of Watson–Crick G–C base pairs. Moreover, band g undergoes a smaller shift for KRAS than for BCL2, in agreement with the formation, at pH 6.6, of a hybrid iM/hairpin species stabilized by the Watson–Crick G–C base pairs as in the hairpin conformation, which would imply a greater persistence of G–C base pairs in KRAS.
Another remarkable piece of information on hairpin/iM transitions can also be retrieved from changes of bands j and l of guanines (Fig. 5G and H), associated with the vibration of the exocyclic NH2 scissoring and CO stretching, respectively. Although these bands are detected as shoulders of the wide band k at around 1654 cm−1, they are well-established markers of guanines involved or not in base pairing with cytosines. As for band j, both oligonucleotides exhibited a redshift when the pH decreases, albeit to a slightly different extent, indicative of G–C base pair dissociation.41,56 In particular, at pH 7.8, the position of band j at the same wavenumber for the two oligonucleotides (around 1622 cm−1) suggests a similar G–C base pair content for the two. However, some differences are observed in the behavior of band j for KRAS and BCL2 at pH 6.6. These differences may reflect the different conformational states that the two oligonucleotides have at this pH value. Finally, at pH 5.2, the position of band j is the same in both KRAS and BCL2 spectra (around 1610 cm−1), indicating an almost complete iM formation for both the oligonucleotides.
Interesting is also the behavior of band l (which refers to unpaired guanines) upon pH changes. In particular, a gradual intensification of band l is observed for BCL2 from pH 7.8 to 5.2, which denotes the gradual increase of unpaired guanines. Conversely, no significant variation of such band is detected for KRAS between pH 7.8 and 6.6, confirming the retention of the G–C base pairs in the corresponding DNA structure it forms at pH 6.6. On the other hand, the KRAS oligonucleotide exhibits a significant intensification of band l at pH 5.2, in agreement with the already mentioned predominant formation of iM at such pH.
With the aim of further corroborating the pH-dependent structural information and conformational transitions, the two-dimensional correlation spectroscopy (2D-COS) analysis of UVRR spectra at the three different pH values was performed. The synchronous 2D correlation maps for BCL2 and KRAS (Fig. 6A and C) show auto-peaks (located at the diagonal of the maps) for each of the UVRR bands, thus confirming that all of them, even though to different extent, are perturbed by the pH variation. On the other hand, the positive cross-peaks (located at off-diagonal positions of the maps) observed between pairs of bands indicate that the intensities of ν1 and ν2 always vary in the same direction. For example, band k (overlapped with j and l) shows cross-peaks with all the other bands, thus indicating the existence of synchronized spectral changes with the different bands. The most intense cross-peak suggests a strong correlation between the intensity change of the bands j, k, l and the bands b, c, d. This agrees well with the fact that these bands carry the information concerning the most pH-sensitive vibrations of dC and dT residues.
The sign of the cross-peaks in the homospectral asynchronous 2D-COS maps of BCL2 and KRAS (Fig. 6B and D) provides crucial information on the sequential relationships between peaks with different spectral origins and provides further validation of the peak deconvolution. Indeed, according to the interpretation rules of such signs by Noda,58,60 in an asynchronous plot, if a peak (ν1, ν2) is positive, band ν1 will vary prior to band ν2, and if it is negative, band ν1 will vary after band ν2. Furthermore, cross-peaks in the synchronous map with a rather broad area that does not display any well-resolved bands, likely because of the presence of several overlapping bands, could also split into smaller cross-peaks of different colors in the asynchronous map, thus showing the presence of different entities under the same band. Thus, the asynchronous 2D-COS maps of BCL2 and KRAS show a clear split of the four broad synchronous cross-peaks at around 1250 (bands b, c, and d), 1377 (bands f and f′), 1529 (bands h and h′), and 1654 cm−1 (bands k and l). This result matches very well with the previously performed peak deconvolution and assignment reported in Fig. 4 and 5.
Regarding the sequential order of events, the values of the cross-peaks indicate that, when pH changes from 5.2 to 7.8, the changes in the bands a, c, d, f′, h′, and k occur before the intensity variation in g and i, demonstrating that the guanines (bands g and i) are involved in the iM to hairpin conformational transition after the partial deprotonation of the cytosines. On the other hand, the sign of the cross-peaks indicates that changes in bands e and h occur always after the changes in the other bands as the pH increases.
Finally, a careful inspection of the asynchronous maps reveals that the only noticeable difference between the two sequences is in the behavior of band b (associated with dT), which, only in the case of BCL2, shows a negative cross-peak with band k. This difference once again suggests a different structural environment experienced by thymines in the case of the two sequences.
Fig. 7 and 8 show the overlaid UVRR spectra of BCL2 and KRAS in the presence and absence of PEG 200 at pH 7.8, 6.6 and 5.2. At pH 7.8, BCL2 seems to be affected by the presence of 40% PEG 200 more than KRAS (Fig. 7A and 8A, respectively), with a general increase of the intensity of bands for the first and an increase limited to bands b–e for the second. Moreover, a shift of some bands of cytosine and guanine (about 2–5 cm−1) is observed along with no significant changes in the position of the thymine bands (see Tables S1 and S7 for BCL2, and Tables S4 and S10 for KRAS, ESI†), thus suggesting some effects on G–C interactions. However, the absence of bands f′ and h′, which are the markers for the iM formation, indicates that at pH 7.8 both sequences are unable to form such a structure under the molecular crowding conditions as in dilute solution.
This finding is also confirmed by CD spectra and melting data recorded in PEG 200-crowded solution (Fig. S11 and S12, ESI†), which show no significant changes with respect to the data obtained in dilute solution.
At pH 6.6, the UVRR spectra of the oligonucleotides in PEG-containing solution vary differently in terms of intensity for BCL2 and KRAS compared to the respective spectra recorded in the absence of PEG (see Tables S2 and S8 for BCL2, and Tables S5 and S11 for KRAS, ESI†). In particular, an increase in the intensity of bands d and e of cytosine, and b of thymine is detected in the case of BCL2 (Fig. 7B). Moreover, a significant increase of the Π value (around 60%) for the hemi-protonation of cytosine was observed in the presence of PEG 200 with respect to the dilute solution (Table 1), while the CD melting data at pH 6.6 show increased thermal stability of the BCL2 structure (ΔTm ≈ +4.5 °C) in the presence of PEG compared to that in its absence (Fig. S4 and S11, ESI†). These results suggest that PEG 200 induces the formation and stabilization of BCL2 iM at that pH.23
In contrast, a different scenario is observed for KRAS in PEG-containing solutions at pH 6.6 (Fig. 8B). In this case, an increase of the intensity of bands g, i, and j, all relative to guanine bases, is detected, while the bands of thymine and cytosine (either neutral or protonated) remain almost unaffected. The irrelevant effect of PEG on cytosines, and thus on the iM structure formed by KRAS, is also confirmed by the value of the Π marker which is close to that obtained from the spectra in dilute solution (Table 1). However, a thermal destabilization effect (ΔTm ≈ −7 °C) is observed by CD experiments for the KRAS structure in the presence of PEG 200 at pH 6.6 (Fig. S5 and S12, ESI†).
Attempting a rationalization of these results, the changes in the guanine bands of KRAS under crowding conditions at pH 6.6 suggest that the presence of PEG 200 affects mainly, if not exclusively, the guanine bases mostly located in the long hairpin-forming loop of the hybrid iM/hairpin species. This increase in intensity indicates a decrease of the short-range (nearest-neighbor) interactions of the guanine bases due to the effect of shielding by the PEG polymer chains. Actually, the effect of PEG on hairpin is complex and it depends on both the dimension of the hairpin junction and the size of the PEG chain.71,72 Cosolutes with low molecular weights, such as PEG 200, which decrease the water activity and generate osmotic stress, were found to decrease the stability of the DNA hairpins.71 Therefore, the changes in guanine bands and the lower thermal stability detected for KRAS at pH 6.6 may arise from a decrease of G–C interactions in the hairpin-forming loop of the hybrid species.
The UVRR spectra of BCL2 and KRAS at pH 5.2 in the presence of 40% PEG 200 (Fig. 7C and 8C, respectively) also show a different behavior for the two oligonucleotides. In the case of BCL2, except for a slight decrease in band e, an increase in the intensity of most bands is observed, including f′ and h′, the markers of the C·C+ base pair. This results in a PEG-induced increase of the Π value (around 50%, Table 1) indicating an increase of cytosine protonation (and thus of iM population) at that pH, with a slight thermal stabilization (ΔTm ≈ +2 °C) as detected by CD melting experiments (Fig. S4 and S11, ESI†). As far as KRAS is concerned, the spectrum at pH 5.2 in the presence of PEG shows a decrease in the intensity of bands b, d, and e, along with an increase in the intensity of bands g, h, and i. On the other hand, no significant variation with respect to the spectrum recorded in dilute solution was observed for bands f′ and h′, as well as no consequences on iM stability were detected by CD melting (Fig. S12, ESI†), thus suggesting the absence of relevant effects of PEG 200 on that iM structure. These results are in agreement with the low-to-null effects of PEG 200 on the thermal stability of c-MYC promoter iM at pH values around 5.0.72
Additional information on pH-dependent conformational transitions of BCL2 and KRAS under molecular crowding conditions was achieved by performing 2D-COS analysis of UVRR spectra recorded in the presence of PEG 200. The synchronous 2D correlation maps of BCL2 and KRAS (Fig. S13A and C, ESI†) as well as the asynchronous correlation map of BCL2 (Fig. S13B, ESI†) show no noteworthy differences with respect to the maps obtained in dilute solution (Fig. 6). Conversely, the asynchronous 2D correlation map generated for KRAS in PEG-crowded solution (Fig. S13D, ESI†) shows some small but significant differences, all concerning band g of guanine: (i) two novel cross-peaks between band g and bands b and f (around 1243 and 1376 cm−1, respectively) of thymine emerge from the background; ii) the cross-peaks of band g with bands i and k (around 1576 and 1652 cm−1, respectively) show a clear split into two smaller cross-peaks of different colors.
These findings corroborate the idea that the presence of PEG 200 mainly affects the guanine bases located in the long hairpin-forming loop of KRAS and that their behavior under crowding conditions is correlated to that of the thymines present in the short loops.
Even more informative are the asynchronous 2D-COS maps (Fig. 9B and D). Indeed, a careful inspection of those maps reveals that the intensity changes for all UVRR peaks take place faster than that of the CD band at 264 nm (positive cross-peaks), while their variations in relation to the CD band at 288 nm are slower (negative cross-peaks) in most cases, apart from bands d, f′, h′, and, in part, a, where cross-peaks with a positive sign are observed. Interestingly, these findings agree with the fact that these bands carry the information related to the hemi-protonated C·C+ pairs, which are, of course, very sensitive to the pH variation. It is also reasonable that the maximum CD band at 288 nm varies faster than the other UVRR peaks since it is also indicative of the presence of the iM in solution. Noteworthily, the asynchronous plot of KRAS (Fig. 9D) shows, for the aforementioned bands, a more pronounced split of the corresponding synchronous cross-peaks than that of BCL2. This phenomenon may be due to the different conformational transition pathway of the two oligonucleotides.
The whole set of CD data have been fruitfully exploited to deconvolute the composition profiles of the conformational species as a function of pH and temperature and, thus, to generate contour plots identifying the fractions of the various species formed by BCL2 and KRAS in a pH-temperature diagram. The inspection of these diagrams indicates that the iMs with a long loop, such as in KRAS, at certain pH values can really house an entire hairpin-forming sequence in the structure, thus coexisting as a hybrid iM/hairpin species in equilibrium with the single iM and hairpin structures. On the other hand, the Watson–Crick hairpin in both BCL2 and KRAS is the predominant species in solution at pH >7.5 and low temperature.
The pH-dependent UVRR spectral changes have been initially quantified by the shift and intensity variations, establishing the diagnostic markers of C-rich oligonucleotides in their different conformations. Then the scrutiny of the 2D-COS maps, especially the asynchronous ones, provided a strong confirmation of the conventional Raman spectral analysis and peak markers. More importantly, the analysis of the asynchronous correlation maps offered a sensitive means in detecting the sequence of events in response to changes induced by pH variations, in the absence and presence of PEG 200 as a crowding agent. This provides relevant complementary information for a detailed description of the observed dynamical scenario upon the applied pH path. The analysis of the sequential order of events indicates that the conformational changes induced by pH occurred at different rates for different parts of the oligonucleotides. In particular, when pH changes from 5.2 to 7.8, guanine bases are involved in the iM to hairpin conformational transition after the partial deprotonation of the cytosines. A noteworthy difference between BCL2 and KRAS is in the behavior of thymine bases which suggests a different structural environment experienced by such nucleobases during the iM-hairpin conformational transitions in the case of the two DNA sequences. Similarly, the effect of the presence of PEG 200 on the pH-induced conformational transitions was also elucidated, highlighting a different effect of the crowding agent on the two oligonucleotides especially at pH 6.6. Indeed, at this pH, it induces the stabilization of BCL2 iM, while a destabilizing effect is observed for KRAS, most likely due to the decrease of G–C interactions in the hairpin-forming loop of the hybrid species.
Overall, the present work emphasizes the concerted CD-assisted UVRR task to shed light on the effect of solution conditions on intra- and intermolecular interactions of the different DNA conformations.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d2cp00058j |
This journal is © the Owner Societies 2022 |