Afaf H.
El-Sagheer‡§
ab and
Tom
Brown‡§
a
aSchool of Chemistry, University of Southampton, Highfield, Southampton, SO17 1BJ, UK. E-mail: tb2@soton.ac.uk; A.H.El-Sagheer@soton.ac.uk; Fax: +44 (0)2380 592991; Tel: +44 (0)2380 592974
bChemistry Branch, Dept. of Science and Mathematics, Faculty of Petroleum and Mining Engineering, Suez University, Suez, 43721, Egypt
First published on 17th October 2013
DNA strands containing a triazole linkage flanked on its 3′-side by an aminoethylphenoxazine nucleobase analogue (G-clamp) have been prepared by solid-phase synthesis followed by CuAAC-mediated click oligonucleotide ligation. The stability of the doubly modified DNA duplexes and DNA–RNA hybrids is greatly increased, whereas a single base pair mismatch located at or adjacent to the modifications is strongly destabilising, making triazole G-clamp a potent mismatch/point mutation sensor. A DNA strand containing this unnatural combination was successfully amplified by PCR to produce unmodified copies of the original template, with deoxyguanosine inserted opposite to the G-clamp-triazole nucleotide analogue. This study shows for the first time that a polymerase enzyme can read through a combined backbone/nucleobase modification surprisingly well. These favourable properties suggest new applications for oligonucleotides containing the G-clamp triazole modification in biotechnology, nanotechnology, diagnostics and therapeutics.
Fig. 1 Triazole G-clamp base paired with guanine in complementary DNA. The additional steric bulk of the G-clamp nucleobase relative to cytosine is in red. |
In order to address this we have been investigating approaches to stabilise triazole-containing oligonucleotides, bearing in mind that any modification must also permit read-through by DNA polymerases. Many chemical adaptations have been developed to increase the thermal stability of nucleic acid duplexes,15 but with biocompatibility as the major criterion we focused on those that would be expected to produce minimal structural perturbation.
We now report the synthesis and properties of a triazole-containing DNA analogue that forms more stable duplexes than natural DNA. This is achieved by the incorporation of the aminoethylphenoxazine analogue of 2′-deoxycytidine (G-clamp) adjacent to the triazole linkage. This cytosine analogue increases duplex stability by a combination of increased intra-helical base stacking and additional hydrogen bonding to guanine (Fig. 1).16 It has been shown to be effective in a DNA17,18 and PNA19 context, and analogues have also been developed to recognise 8-oxoguanine in DNA.20 G-clamp is essentially deoxycytidine with an additional aminoethoxyphenoxy group protruding into the major groove. We reasoned that this structural change might be accommodated by polymerase enzymes without causing mutagenesis, and we have investigated this concept.
The methodology that is generally used to synthesise 5′-azide oligonucleotides22 involves conversion of the 5′-OH group of the terminal deoxyribose sugar to azide whilst the oligonucleotide is attached to the solid support. This transformation (Fig. 2), which is accomplished using methyltriphenoxyphosphonium iodide, provides a general method to substitute the nucleobase on the 3′-side of the triazole linkage with unnatural analogues, in this case G-clamp. It has been shown that G-clamp is strongly stabilising at CpC steps in normal DNA,16 so we decided initially to synthesise and evaluate an oligonucleotide containing the equivalent triazole containing dinucleotide analogue (5-MeCtCC where CC = G-clamp and t = triazole). The 5-methylated analogue of cytosine was used for synthetic convenience.3
Fig. 2 Conversion of the 5′-OH group of a G-clamp oligonucleotide to azide on the solid support. Structure of protected and unprotected G-clamp in inset. |
A 13-mer triazole G-clamp oligonucleotide (ODN-3, Table 1) was synthesised in order to provide a sensitive probe for duplex stability. To accomplish this, the G-clamp phosphoramidite monomer was incorporated at the 5′-end of oligonucleotide ODN-1 (Table 1) by standard solid-phase methods and the 5′-OH group was converted to azide on the solid support as described above (Fig. 2). This functional group conversion has not been previously carried out on 5′-G-clamp oligonucleotides and we were pleased to observe that it proceeded smoothly (Fig. 3A and B). After cleavage from the solid support and nucleobase/phosphate deprotection, ODN-1 was obtained in high purity. Ligation of ODN-1 to an excess of 3′-propargyl-functionalised ODN-26 by the CuAAC reaction1,2 yielded the triazole-containing 13-mer ODN-3. The reaction proceeded efficiently without the need for a complementary splint (Fig. 3C and D) and excess of 3′-propargyl ODN-2 was removed by HPLC purification. Ultraviolet melting studies were performed with ODN-3 hybridised to a series of matched and mismatched complementary DNA and RNA strands (Fig. 4, Table 2). The stability of the duplex formed by ODN-3 and its unmodified DNA complement (ODN-7) was compared to the equivalent unmodified canonical duplex with a central 5-methy cytosine (ODN-4/ODN-7) and to the duplex containing a triazole linkage and a cytosine base in place of G-clamp (ODN-6/ODN-7). ODN-6 was made by clicking ODN-2 with ODN-5 using the method described above.
Code | Sequence 5′ to 3′ |
---|---|
a MeCY = 3′-propargyl-5-methyl dC, CZ = 5′-azido dC, CCZ = 5′-azido G-clamp, t = triazole linkage. F = amidohexylfluorescein. | |
ODN-1 | dCCZTGCAGC |
ODN-2 | dCGACGMeCY |
ODN-3 | dCGACGMeCtCCTGCAGC |
ODN-4 | dCGACGMeCCTGCAGC |
ODN-5 | dCZTGCAGC |
ODN-6 | dCGACGMeCtCTGCAGC |
ODN-7 | dGCTGCAGGCGTCG |
ODN-8 | dGCTGCAGACGTCG |
ODN-9 | dGCTGCAGCCGTCG |
ODN-10 | dGCTGCAGTCGTCG |
ODN-11 | dGCTGCAAGCGTCG |
ODN-12 | dGCTGCACGCGTCG |
ODN-13 | dGCTGCATGCGTCG |
ODN-14 | rGCUGCAGGCGUCG |
ODN-15 | rGCUGCAGACGUCG |
ODN-16 | rGCUGCAAGCGUCG |
ODN-17 | dFTGTGTGCTGGCGATCTTA – splint for click reaction to synthesise PCR template |
ODN-18 | dGCATTCGAGCAACGTAAGATCGMeCY |
ODN-19 | dCCZAGCACACAATCTCACACTCTGGAATTCACACTGACAATACTGCCGACACACATAACC |
ODN-20 | dGCATTCGAGCAACGTAAGATCGMeCtCCAGCACACAATCTCACACTCTGGAATTCACACTGACAATACTGCCGACACACATAACC – PCR template |
ODN-21 | dGCATTCGAGCAACGTAAG – PCR primer |
ODN-22 | dGGTTATGTGTGTCGGCAG – PCR primer |
ODN code | ODN code | Central base pairs* | MeCxC central linkage | T m | ΔTm |
---|---|---|---|---|---|
a ΔTm = (Tm duplex − Tm control duplex). t = central triazole linkage in ODN in column 1 of the table. p = central phosphodiester linkage in ODN in column 1 of the table. CC = G-clamp, MeC = 5-methylcytosine, r = RNA. * Central base pairs = base pairs around triazole site proceeding from 5′-end of ODN-3, 4 or 6 in column 1. For example entry 1 in Table 2a, MeCG–CCG is derived from: All oligonucleotide sequences are in Table 1. | |||||
(a) Watson–Crick base pairing DNA duplexes | |||||
ODN-3 | ODN-7 | MeCG–CCG | t | 75.0 | |
ODN-4 | ODN-7 | MeCG–CG | p | 68.5 | −6.5 |
ODN-6 | ODN-7 | MeCG–CG | t | 63.0 | −12.0 |
(b) Watson–Crick base pairing DNA–RNA hybrid duplexes | |||||
ODN-3 | rODN-14 | MeCG–CCG | t | 77.5 | |
ODN-4 | rODN-14 | MeCG–CG | p | 68.1 | −9.4 |
ODN-6 | rODN-14 | MeCG–CG | t | 64.8 | −12.7 |
(c) G-clamp triazole DNA–DNA mismatch studies | |||||
ODN-3 | ODN-7 | MeCG–CCG | t | 75.0 | — |
ODN-3 | ODN-8 | MeCA–CCG | t | 55.3 | −19.7 |
ODN-3 | ODN-9 | MeCC–CCG | t | 55.0 | −20.0 |
ODN-3 | ODN-10 | MeCT–CCG | t | 55.1 | −19.9 |
ODN-3 | ODN-11 | MeCG–CCA | t | 56.3 | −18.7 |
ODN-3 | ODN-12 | MeCG–CCC | t | 53.0 | −22.0 |
ODN-3 | ODN-13 | MeCG–CCT | t | 55.8 | −19.2 |
Average = −19.9 | |||||
(d) DNA–DNA mismatch studies | |||||
ODN-4 | ODN-7 | MeCG–CG | p | 68.5 | — |
ODN-4 | ODN-8 | MeCA–CG | p | 54.1 | −14.4 |
ODN-4 | ODN-9 | MeCC–CG | p | 47.9 | −20.6 |
ODN-4 | ODN-10 | MeCT–CG | p | 52.1 | −16.4 |
ODN-4 | ODN-11 | MeCG–CA | p | 45.5 | −23.0 |
ODN-4 | ODN-12 | MeCG–CC | p | 49.0 | −19.5 |
ODN-4 | ODN-13 | MeCG–CT | p | 52.2 | −16.3 |
Average = −18.4 | |||||
(e) Triazole DNA–DNA mismatch studies | |||||
ODN-6 | ODN-7 | MeCG–CG | t | 63.0 | — |
ODN-6 | ODN-8 | MeCA–CG | t | 50.6 | −12.4 |
ODN-6 | ODN-9 | MeCC–CG | t | 46.8 | −16.2 |
ODN-6 | ODN-10 | MeCT–CG | t | 48.7 | −14.3 |
ODN-6 | ODN-11 | MeCG–CA | t | 45.3 | −17.7 |
ODN-6 | ODN-12 | MeCG–CC | t | 47.6 | −15.4 |
ODN-6 | ODN-13 | MeCG–CT | t | 50.2 | −12.8 |
Average = −14.8 | |||||
(f) G-clamp triazole DNA–RNA mismatch studies | |||||
ODN-3 | rODN-14 | MeCG–CCG | t | 77.5 | — |
ODN-3 | rODN-15 | MeCA–CCG | t | 58.2 | −19.3 |
ODN-3 | rODN-16 | MeCG–CCA | t | 53.0 | −24.5 |
Average = −21.9 | |||||
(g) DNA–RNA mismatch studies | |||||
ODN-4 | rODN-14 | MeCG–CG | p | 68.1 | — |
ODN-4 | rODN-15 | MeCA–CG | p | 53.3 | −14.8 |
ODN-4 | rODN-16 | MeCG–CA | p | 49.5 | −18.6 |
Average = −16.7 | |||||
(h) Triazole DNA–RNA mismatch studies | |||||
ODN-6 | rODN-14 | MeCG–CG | t | 64.8 | — |
ODN-6 | rODN-15 | MeCA–CG | t | 51.1 | −13.7 |
ODN-6 | rODN-16 | MeCG–CA | t | 44.9 | −19.9 |
Average = −16.8 |
Incorporation of the G-clamp increased the UV melting temperature (Tm) of the triazole DNA duplex by an impressive 6.5 °C compared to the unmodified duplex, and by 12 °C compared to the triazole duplex with cytosine in place of the G-clamp (Table 2a). Even greater stabilisation was observed when the triazole G-clamp ODN-3 was hybridised to an RNA complement. In this case the increase in Tm was 9.4 °C compared to the normal DNA–RNA hybrid, and 12.7 °C relative to the triazole duplex with cytosine instead of G-clamp (Table 2b). To dispel concerns that the stabilisation due to the triazole G-clamp is unique to a MeCtCC base stacking step, studies were also carried out to determine the melting temperatures of duplexes containing a TtCC step. The same trends in duplex stability were observed, i.e. triazole G-clamp > canonical DNA > triazole DNA (ESI†). For chemically modified DNA to be useful in a biological context it is essential that it is selective for its chosen target; i.e. duplexes containing mismatched base pairs must be destabilised. To evaluate this, duplexes containing a single methylated or unmethylated C.A or C.C mismatch on either side of the triazole linkage were studied. In the DNA–DNA series the average mismatch destabilisation was 19.9 °C for the triazole G-clamp duplex, 18.4 °C for the normal non-triazole duplex and 14.8 °C for the triazole duplex without G-clamp (Table 2c–e), indicating that triazole G-clamp is the most efficient sensor of DNA mispairing. In the context of DNA–RNA hybrid duplexes the beneficial effects of triazole G-clamp were even greater, with an average mismatch destabilisation of 21.9 °C compared to 16.7 °C for normal DNA and 16.8 °C for triazole DNA without the G-clamp (Table 2f–h). Overall these melting studies confirm that the stability of DNA can be greatly improved by incorporation of G-clamp on the 3′-side of the artificial triazole backbone. Moreover, potent mismatch discrimination can be achieved if the G-clamp nucleobase is directly involved in mispairing, and also when the base directly on the other side of the triazole linkage is mispaired. There is a marked improvement in mismatch discrimination for the combination of triazole and G-clamp compared to normal DNA and DNA containing triazole alone.
Having established that G-clamp is effective in stabilising triazole DNA duplexes, it was important to determine whether DNA polymerases can read through the combination of the triazole and G-clamp to faithfully produce complementary copies. The outcome of these investigations was uncertain as there are no previous reports of replication of DNA strands that contain G-clamp monomers, let alone examples of G-clamp combined with backbone modifications. To investigate the polymerase-compatibility of the duplex-stabilising modification we synthesised an 81-mer PCR template (ODN-20) containing triazole G-clamp using a complementary splint (ODN-17). The splint was employed to assist CuAAC-catalysed oligonucleotide ligation between 3′-propargyl ODN-18 and 5′-azide ODN-19. The click reaction was efficient and the template was purified and analysed by HPLC and mass spectrometry (Fig. 5A and B). We then carried out experiments to find out if the doubly-modified linkage can be read through in a linear fashion. This was successful (ESI†). PCR amplification was carried out (Fig. 5C) and the amplicon was purified by agarose gel electrophoresis, inserted into a sequencing vector and analysed by Sanger DNA sequencing (Fig. 5D). The sequencing data from over 20 clones confirmed that the region around the triazole linkage had been read through correctly (ESI†). Overall this series of experiments prove that the G-clamp triazole linkage can be formed efficiently by click ligation, and can be read through faithfully by DNA polymerase enzymes (Fig. 6). This is an important result as it suggests that the triazole G-clamp combination could be used for in vivo applications that involve replication of the modified DNA.3,5
Fig. 5 G-clamp triazole DNA template is efficiently chemically ligated and accurately amplified by PCR. (A) Reversed-phase HPLC, (B) ESI-mass spectrum of 81-mer ODN-20, and (C) 2% agarose gel for the PCR product using ODN-20 as a template. Lane 1; 50bp DNA ladder. Lane 2; control reaction with primers and without the template. Lane 3; PCR reaction using template ODN-20 (5 ng) and primers ODN-21 and ODN-22 (Table 1). (D) Sanger sequencing of cloned PCR template showing the correct sequence (dGCCA) around the site of the original triazole G-clamp. Details of DNA sequencing in ESI.† |
Fig. 6 Formation of the G-clamp triazole linkage by the CuAAC reaction and faithful read-through by PCR. |
In the context of future therapeutic and diagnostic applications it is important to understand duplex forming properties of oligonucleotides containing multiple modifications of triazole G-clamp. It was unclear whether this would confer additional duplex stability or lead to a collapse of the duplex structure due to steric constraints. To this end we used double templated click ligation to synthesise a 13-mer oligonucleotide containing two units of triazole G-clamp and compared the stability of its fully complementary DNA duplex to equivalent unmodified and triazole modified duplexes (ESI†). Very large increases in Tm (12.2 °C and 22.7 °C respectively) were observed confirming the potential of triazole G-clamp as a modification in antisense oligonucleotides. With this result in mind future synthetic strategies will focus on producing the triazole G-clamp dinucleotide phosphoramidite for direct incorporation into oligonucleotides during solid-phase synthesis.
Footnotes |
† Electronic supplementary information (ESI) available: Oligonucleotide synthesis, DNA sequencing, UV melting, studies on oligonucleotides containing a TtC step, mass spectra of oligonucleotides, linear copying of template oligonucleotides, click ligation to synthesise oligonucleotide with two G clamp-triazole modifications. See DOI: 10.1039/c3sc51753e |
‡ TB and AHE-S are joint main authors. |
§ Current address: Department of Chemistry, University of Oxford, Chemistry Research Laboratory, 12 Mansfield Rd, Oxford, OX1 3TA, UK. Email: tom.brown@chem.ox.ac.uk; afaf.el-sagheer@chem.ox.ac.uk; Tel: +44 (0)1865 275413. |
¶ The 5× Promega green PCR buffer (pH 8.5) which was provided with GoTaq DNA polymerase contains 7.5 mM MgCl2 to give a final Mg2+ concentration of 1.5 mM. The buffer contains Tris·HCl, KCl and two dyes (blue and yellow) that separate during electrophoresis to monitor the migration process. |
This journal is © The Royal Society of Chemistry 2014 |