Kai
Gao‡
,
Shabnam
Shaabani‡
,
Ruixue
Xu‡
,
Tryfon
Zarganes-Tzitzikas
,
Li
Gao
,
Maryam
Ahmadianmoghaddam
,
Matthew R.
Groves
and
Alexander
Dömling
*
Pharmacy Department, Drug Design group, University of Groningen, A. Deusinglaan 1, 9700 AD Groningen, The Netherlands. E-mail: a.s.s.domling@rug.nl
First published on 5th May 2021
Hit finding in early drug discovery is often based on high throughput screening (HTS) of existing and historical compound libraries, which can limit chemical diversity, is time-consuming, very costly, and environmentally not sustainable. On-the-fly compound synthesis and in situ screening in a highly miniaturized and automated format has the potential to greatly reduce the medicinal chemistry environmental footprint. Here, we used acoustic dispensing technology to synthesize a library in a 1536 well format based on the Groebcke–Blackburn–Bienaymé reaction (GBB-3CR) on a nanomole scale. The unpurified library was screened by differential scanning fluorimetry (DSF) and cross-validated using microscale thermophoresis (MST) against the oncogenic protein–protein interaction menin–MLL. Several GBB reaction products were found as μM menin binder, and the structural basis of the interactions with menin was elucidated by co-crystal structure analysis. Miniaturization and automation of the organic synthesis and screening process can lead to an acceleration in the early drug discovery process, which is an alternative to classical HTS and a step towards the paradigm of continuous manufacturing.
New enabling technologies, laboratory automation and high-throughput experimentation (HTE) aim towards the paradigm of continuous manufacturing and to accelerate early preclinical discovery and development.6 For example, HTE is regularly used to optimize reaction conditions and expand the scope of known reactions.7 The combination of acoustic mist ionization and time-of-flight mass spectrometry finds already sporadic use in HTS approaches to triage assays and in metabolic profiling.8 Acoustic dispensing is used for the fast, mild and effective crystal soaking for the formation of ligand–protein complexes.9 Although novel technologies are constantly surfacing, they have to be validated, implemented and integrated with academic and pharmaceutical R&D activities to obtain a better understanding of the molecular basis of diseases and to ultimately accelerate drug discovery. Forefront developments aiming to automate and accelerate drug discovery are underway, e.g. a specialized platform for innovative research exploration.10
We are working on the concept of artificial intelligence-driven (AI) autonomous early drug discovery towards the paradigm of continuous manufacturing. We have recently shown miniaturized and automated synthesis of large and diverse small molecular weight libraries, including isoquinolines, indole polycycles, quinoxalines, boronic acids, and iminopyrrolidines that resulted in useful, scalable synthetic protocols.10,11 Here, we present the miniaturized, automated, on-the-fly synthesis, quality control, and biophysical screening of a large unpurified library on a nanomole scale to rapidly discover novel binders to the menin protein playing a role in the oncogenic protein–protein interaction menin–MLL. The discovered hits were biophysically characterized and culminated in an experimentally derived molecular model of a hit compound bound to the menin receptor provided by X-ray structure analysis.
Chromosomal rearrangements of the MLL gene12 are associated with patients with acute myeloid (AML), acute lymphoblastic (ALL) leukemias13 and in therapy related leukemias or myelodysplastic syndrome (MDS).14 As a result of these chromosomal rearrangements a gene fusion product may be formed with one of over 60 distinct protein partners, including AF4, AF9, ENL, AF6, ELL and AF10.15 Disruption of MLL activity by this gene translocation has previously been shown to upregulate genes critical in leukemogenesis (HOXA9 and MEIS1).16 The onogenic function of these MLL-fusion proteins17 is dependent upon the direct interaction of the N-terminus of MLL with menin, a 67 kDa ubiquitously protein that is located in the nucleus.18 Antagonizing menin binding to the MLL fusion proteins has been shown to abolish their oncogenic properties in vitro and in vivo and thus represents an attractive drug target in hematology (Fig. 1B).19 Several groups have developed potent small molecules using structure-based drug design methods and involving many optimization cycles.20 Structural studies demonstrated that menin is composed of 3 tetratricopeptide (TPR) motifs that generate a large central cavity of ∼5000 Å3 that is lined with hydrophobic and acidic amino acids.21 Such cavities are typically attractive “druggable” sites and mutation and structural studies demonstrated that this cavity was responsible for the menin:MLL interaction.22 As a result many groups have focused on this cavity of menin as a target for the development of novel therapies in leukemia treatment.
Our workflow towards discovering menin–MLL binders consisted of the acoustic dispensing ejection-enabled (ADE) autonomous synthesis and analysis of a random library of 1536 heterocycles on a nanomole scale (Fig. 1A and C–H). The compounds were then screened as crude reaction mixtures for menin affinity by a high throughput differential scanning fluorimetry/thermal shift analysis (DSF/TSA)-based assay. Identified hits were resynthesized, purified and further characterized by microscale thermophoresis (MST), and our best performing compound was co-crystallized with the menin receptor to confirm binding and understand mode-of-action.
We used 71 isocyanides, 53 aldehydes, and 38 cyclic amidines (Fig. 2). The starting materials were chosen to represent multiple physicochemical properties and spanning an interesting chemical screening space. Specifically, we integrated multiple functional groups such as aliphatic and aromatic hydroxyl groups, amines, heterocycles, (pseudo)halogens, and carbonyls to enhance the chances of creating specific small molecule–receptor interactions. Many of the building blocks used herein were previously unknown or are unprecedented in the GBB-3CR.23 For example, amongst the cyclic amidine components (A), sterically hindered A10, hydrophilic A12, bifunctional A14 (–CHO), iodo building block A16, benzoxazole A29, and tetrazole A37. Unique aldehyde components (B) include, for example, hindered benzaldehyde B17, bifunctional B46 (–COOH), α,β-unsaturated B47 and B52, and acetal protected B50. Moreover, the majority of the isocyanides (C) used were previously unknown in the GBB-3CR, including amino acid derivatives C16, C20, C25, C29, acrylamide derivative C32 (as potential covalent inhibitors), or azide derivative C56 (for orthogonal tagging of protein targets) just to name a few. To avoid chemical space bias by synthesizing combinatorial libraries, we used a script that randomly combines building blocks in the reaction wells (ESI†-2.3.4). The theoretical chemical space of the above building blocks is 142994 (71 × 53 × 38); however, we synthesized only a small subspace of 1536 GBB-3CR products.
To assemble the library on a nanomole scale, we used an Echo 555 acoustic dispensing instrument.10,11b Briefly, acoustic dispensing is a contact-less, fast, precise, and accurate fluid transfer technology using ultrasound.25 A transducer below a source plate containing the building blocks stock solutions emits focused sound energy repeatedly to the meniscus of the fluid to be transferred. A stream of 2.5 nL droplets is ejected into the wells of an inverted destination microplate. By combining different building blocks in the destination plate, reactions can take place in a fast and miniaturized fashion. Each well was filled with 500 nanomole of reagents yielding a total volume of 3.1 μL. Thus, the filling of a 1536-well plate takes ∼10 h employing the Echo 555. Importantly, standard plate formats can be utilized in acoustic dispensing, such as 96-, 384-, 1536- or 3456-well plates that allow for facile integration in an established laboratory environment. A number of different solvents were tested and can be transferred on the Echo 555 platform: DMSO, DMF, water, ethylene glycol, 2-methoxyethanol, and N-methylpyrrolidone. The polar protic solvent ethylene glycol and 2-methoxyethanol were chosen for the nano-scale GBB reaction.
Next, we analyzed the reactions using direct mass spectrometry (Fig. 3A). After 24 h of reaction, each well was diluted with 100 μL of ethylene glycol and the reaction mixtures were injected directly into the mass spectrometer. We categorized the reaction success according to three different classes. Reactions showing as the main peak (M + H)+, (M + Na)+, (M + K)+, showing a peak corresponding to the desired products but not as the highest peak or not showing the desired product peaks at all, were classified as green, yellow and blue, respectively. Analysis of 1536 wells yielded 323 green, 281 yellow, and 932 blue reactions (Fig. 3 and S4–S7†).
Fig. 4 Resynthesized and purified molecules (green and yellow) on a 1 mmol scale, their structural diversity, and determined yields. |
The crude reaction mixtures were diluted with DMSO as an initial stock, then further mixed with protein solution at a ratio of 1:9 and screened against menin by DSF at a throughput of ∼190 wells per h (Scheme S1†). Thus a 1536 well plate can be measured in less than 1 day. Hits were then resynthesized, purified, and MST was utilized to cross-validate the binding affinity of the hits. The MST technique relies on the thermophoretic motion of the molecule in a temperature gradient induced by a high precision IR-laser beam.29 Changes in molecular diffusion resulting from binding events provide information on affinity. The validated hits showed binding affinity to the target protein in the μM range (Fig. 5F and S8†).
Generally, as ligand binding enhances the thermal stability of target protein, DSF would be the ideal way in which to distinguish the minor difference caused by the presence and absence of ligands.28 As a rapid readout, the first derivative (Fig. 5B) of the experimental fluorescence (Fig. 5A) provides a clear estimate of the Tm, indicated by the minima of the 1st derivative curve. Amongst one of the 96-well plates, while most of the mixtures remain close to the DMSO control, a compound labeled with its position code I-J21 showed a distinct pattern of two different dual peaks in the first derivative (Fig. 5B, orange curve). Comparing with the blank control, it dramatically elevated the Tm from 44 °C to 67 °C (Fig. 5C), even though a fraction of the protein remained the same Tm as in the DMSO control, as revealed in the first derivative curve (Fig. 5D). This biphasic melting phenomenon in thermal shift assay has been described before as an indicator of the presence of two populations caused by under titration of compound binding.30 In a calorimetric study, calcium at various concentration induced bovine α-lactalbumin undergoing a two-state transition, the excess calcium shifts the temperature from 27 °C to 63 °C, coming with a single peak split into dual-peak state.31 The same behavior has also been observed in ligands screening against the human estrogen receptor α ligand-binding domain (ERαLBD). Three compounds displayed significant secondary melting points, which are already known ERα agonists or antagonists with sub-nanomolar EC50.32 Inspired by this obvious hint, we resynthesized and tested the pure compound I-J21, for which the MST subsequently determined a 2.8 μM affinity to menin. Furthermore, I-C13 and III-O16 hits were also confirmed by MST (Fig. S8†).
Fig. 6 Structural basis of GBB-3CR compounds binding to menin. (A) The co-crystal structure of menin with I-J21 (PDB: 6S2K) and modeled III-O16 (menin is shown in surface representation, I-J21, and III-O16 as spheres); (B) and (C) details of the interaction between I-J21; (B) and III-O16; (C) residues in the receptor (the ligands are shown as cyan sticks, nitrogen and oxygen are shown in blue and red, respectively; hydrogen bondings, hydrogen bonding-pi interactions, pi–pi interactions, and van der Waals interactions are shown as red, blue, orange and green dotted lines, respectively). |
Here, we describe for the first time the automated synthesis of a library of 1536 compounds of heterocycles on a nanomole scale using fast and precise acoustic dispensing technology, followed by an equally fast biophysical screening of the crude library against a protein of therapeutic use. The reader should be very clear that while our approach allows an unprecedented speed of screening – due to the omission of a purification step – there is a significant price to be paid as many designed compounds are inefficiently synthesized. We believe that this issue can be addressed by overpopulating the reaction space. Our present and previous studies demonstrated that large compound libraries could be produced unattended in an automatic and very fast fashion on a nanomole scale, including quality control using mass spectrometry.10,11b,e Screening of the library for the protein–protein interaction menin–MLL using differential scanning fluorometry and microscale thermophoresis yielded several hits. Binding of compound I-J21 was confirmed through crystallization with menin. Although, our biophysical assays are not suitable to proof functional antagonisms, it has to be noted that I-J21 bind in the center of the menin–MLL protein–protein interaction. The progress of our work consists of the automated, miniaturized, and accelerated library synthesis and the screening of the unpurified compounds in the same format yielding in rapid, high-quality hit finding. Unlike the traditional screening process using prepared stored (often historical) libraries, we freshly synthesized a large 1536 compound library ‘on-the-fly’ and screened it using fast DSF and MST. Although the screening hits are far from optimized compounds, our approach shows an alternative process to classical HTS and triaging, followed by lengthy manual medicinal chemistry. Perhaps this presents a stepping stone for a future more sustainable continuous manufacturing and has the potential to greatly reduce the ecological footprint of early drug discovery.
The advantages of our process in the sense of green metrics are multiple. The miniaturized synthesis in our example on a 1536-well format leads to a great reduction of compound mass and waste (reduced E-factor), thus addressing the green principle avoidance. In the classical process of property optimization, compounds are often synthesized on a 0.5 mmol scale and purified, thus leading to the usage of multi kg amounts of precious reagents and solvents and chromatography material. The automation aspect of the synthesis leads to greatly increased safety and less exposure to hazardous fumes. The in situ quality control and biophysical screening of unpurified compounds not only removes the necessity of mass intensive purification but also adds to the green principles of real-time analysis for pollution prevention and inherently safer chemistry for accident prevention. An additional benefit of the herein used multicomponent reaction GBB-3CR is the inherent atom economy.23,33 The well-established convergency of MCRs (three or more building blocks assembly leading to a complex product) leads to a reduced environmental burden and to improved process efficiency through fewer operations. Clearly, all these factors not only contribute to the greenness of the process but also greatly reduced the costs. Based on the expected affinity we could reasonably expect of the fragments (at best ∼10 μM) we decided that a low value of thermal stability increase (0.5°) would reduce our false negative rates – allowing us to identify these initial weakly binding hits. We believe that the subsequent cross-validation by MST allows us to filter out the false positives, but choice of appropriate filters will likely differ with different targets. Equipment costs for the biophysical data collection are high (MST > 100kE, DSF > 10kE), consumable costs are low and such equipment can provide data on many targets. While we have previously published advances in the use of nanosynthesis coupled with direct biophysical screening,11e the herein exemplified miniaturized and accelerated synthesis, analytics, and screening provides the basis to come one step closer to the ultimate goal, a closed loop of automated and accelerated early drug discovery in the sense of continuous manufacturing.5
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1md00087j |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2021 |