Alejandro Ariasa,
Sara Gómez*b,
Natalia Rojas-Valenciaac,
Francisco Núñez-Zarurd,
Chiara Cappellib,
Juliana A. Murillo-Lópeze and
Albeiro Restrepo*a
aInstituto de Química, Universidad de Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia. E-mail: albeiro.restrepo@udea.edu.co
bScuola Normale Superiore, Classe di Scienze, Piazza dei Cavalieri 7, Pisa, 56126, Italy. E-mail: sara.gomezmaya@sns.it
cEscuela de Ciencias y Humanidades, Departamento de Ciencias Básicas, Universidad Eafit, Medellín, AA 3300, Colombia
dFacultad de Ciencias Básicas, Universidad de Medellín, Carrera 87 No. 30-65, Medellín, 050026, Colombia
eDepartamento de Ciencias Químicas, Facultad de Ciencias Exactas, Universidad Andres Bello, Autopista, Concepción-Talcahuano, Talcahuano 7100, Chile
First published on 10th October 2022
A series of prebiotic chemical reactions yielding the precursor building blocks of amino acids, proteins and carbohydrates, starting solely from HCN and water is studied here. We closely follow the formation and evolution of the pivotal C–C, C–O, CO, and C–N bonds, which dictate the chemistry of the molecules of life. In many cases, formation of these bonds is set in motion by proton transfers in which individual water molecules act as catalysts so that water atoms end up in the products. Our results indicate that the prebiotic formation of carbon dioxide, formaldehyde, formic acid, formaldimine, glycolaldehyde, glycine, glycolonitrile, and oxazole derivatives, among others, are best described as highly nonsynchronous concerted single step processes. Nonetheless, for all reactions involving double proton transfer, the formation and breaking of O–H bonds around a particular O atom occur in a synchronous fashion, apparently independently from other primitive processes. For the most part, the first process to initiate seems to be the double proton transfer in the reactions where they are present, then bond breaking/formation around the reactive carbon in the carbonyl group and finally rupture of the C–N bonds in the appropriate cases, which are the most reluctant to break. Remarkably, within the limitations of our non-dynamical computational model, the wide ranges of temperature and pressure in which these reactions occur, downplay the problematic determination of the exact constraints on the early Earth.
There are several hypotheses on how life originated, prioritizing either metabolism, self-replication, or a combination of both.6–10 In the RNA world hypothesis, RNA acted as information storage, replicator, and as an active catalyst.11–13 Other views argue in favor of RNA cooperating with peptides and other biomolecules.14 The metabolism-first hypothesis posits that the first living organisms appeared as the final steps of the evolution towards increasing complexity of networks of sequential chemical reactions aided by simple metal catalysts.10,15 The molecules of life (water, carbohydrates, proteins, nucleic acids, ATP, etc.) contain only a few types of atoms, most prominently hydrogen, carbon, oxygen, nitrogen, sulfur, phosphorous. Consequently, any rationalization of how those large biomolecules came to be, at least on Earth, must include detailed descriptions of the formation of C–C, CO, C–O and C–N bonds and of the formation of the small molecules that act as the initial building blocks in the series of chemical reactions affording such complicated primary sequences from primitive environments, presumably rich in one carbon (C1) molecules.1 One of those C1 molecules, abundant in extreme environments such as the interstellar medium, planetary and lunar atmospheres within our own solar system, comets, and in the conditions of the primitive Earth is HCN,16–19 which is a favorite target for the ultimate precursor of biomolecules.20–23 Recently, Das and coworkers,15 starting with binary mixtures of water and HCN, offered via molecular dynamics simulations in ab initio nanoreactors, a detailed picture of the myriad of chemical reactions leading to small molecules that are the building blocks of most large biomolecules, i. e., carbon dioxide CO2, formaldehyde CH2O, formic acid H2CO2, formaldimine CH3N, glycolaldehyde C2H4O2, glycine C2H5NO2, glycolonitrile C2H3NO, oxazole C3H3NO derivatives, among many others.
The scientific literature cited above has provided detailed mechanistic pathways identifying reactants, transition states, intermediates, and products as well as estimating the energetics of the chemical reactions that result in the formation of the target molecules building biochemical complexity. Sadly, crucial knowledge describing the formation of the all important C–C, C–O, CO, and C–N bonds from the perspective of primitive changes is found nowhere. Consequently, in this work, building on the results of Das,15 we have studied a network of reaction mechanisms yielding the experimentally reported precursor molecules of RNA and proteins. To that end, we closely follow the evolution of bonding interactions leading to formal bonds along the intrinsic reaction coordinates using the tools provided by the Quantum Theory of Atoms in Molecules (QTAIM)24–26 and the Natural Bond Orbitals (NBO).27–29 At this point, it is important to mention that we use QTAIM and NBO as formal analysis tools and make no attempt at discussing their merits and flaws, for which the reader is referred to the extensive literature. The combination of NBO, QTAIM, and other descriptors of bonding allows a detailed dissection of reaction mechanisms and has proven to be an enormously valuable tool to gain insight into the evolution of chemical bonds. A variety of reactions has been successfully studied in the past by our group using this methodology.30–37
We study the energetics of the entire set of 23 reactions listed in Table 1 and characterized every single reactant, product, intermediate, and transition state. However, because of the large number of intermediates and chemical reactions, in order to study the evolution of chemical bonding, we chose a set of five reactions that are particularly relevant in the complex reacting system. These reactions happen to be essential in the pathway towards the target products, which can not be obtained unless those reactions are considered. The point is, the formation of formic acid from HCN and water is the common root from where chemistry branches out into several other intermediates and products according to the following sequence of reactions:
Label | Reaction | ΔG†act | ΔGrxn | ||
---|---|---|---|---|---|
In gas phase | In solution | In gas phase | In solution | ||
a See Table S1 in the ESI for the intrinsic barriers. An asterisk attached to a label means that HNC is involved instead of HCN. Notice that reaction 14 leads to different products than those reported by Das and coworkers15 in gas phase. | |||||
1 | HCN + H2O → CH3NO | 57.2 | 63.8 | 14.3 | 17.0 |
2 | CH3NO → CH3NO | 12.9 | 15.9 | −12.0 | −14.1 |
3 | CH3NO + H2O → NH3 + CH2O2 | 55.9 | 58.5 | 2.4 | 5.3 |
4* | CH2O2 + HNC → C2HNO + H2O | 69.9 | 68.2 | 28.6 | 27.9 |
5* | C2HNO → CO + HCN | 34.6 | 27.3 | −38.0 | −38.2 |
6 | CO + H2O → CH2O2 | 75.7 | 76.2 | 54.6 | 53.7 |
7 | HCN → HNC | 58.0 | 48.8 | 18.4 | 17.7 |
8 | CH2O2 → CO2 + H2 | 52.9 | 53.1 | −21.8 | −16.0 |
9 | CH2O2 + H2 → CH4O2 | 64.3 | 61.1 | 4.9 | 6.9 |
10 | CH4O2 → CH2O + H2O | 35.6 | 37.4 | −5.9 | −8.5 |
11* | CH2O + HNC → C2H3NO | 31.9 | 32.4 | −20.8 | −19.8 |
12 | CH2O + NH3 → CH5NO | 44.6 | 39.1 | 4.4 | 6.6 |
13 | CH5NO → CH3N + H2O | 40.8 | 39.6 | −2.8 | −5.4 |
14* | CH3N + HNC → C2H4N2 | NA | 10.6 | NA | −27.2 |
14 | CH3N + HCN → C2H4N2 | 39.8 | NA | 11.2 | NA |
15 | C2H4N2 + H2O → C2H6N2O | 57.5 | 63.8 | 14.8 | 19.0 |
16 | C2H6N2O → C2H6N2O | 11.4 | 14.5 | −12.8 | −14.8 |
17 | C2H6N2O + H2O → C2H5NO2 | 53.7 | 57.3 | 1.4 | 1.2 |
18 | C2H3NO + CH2O → C3H5NO2 | 64.5 | 60.4 | −2.8 | −1.5 |
19* | C3H5NO2 → C2H4O2 +HNC | 37.1 | 31.5 | 19.6 | 17.2 |
20 | C2H3NO + CH3NO → C3H6N2O2 | 40.4 | 47.3 | 20.6 | 24.8 |
21 | C3H6N2O2 +HCN → C4H7N3O2 | 59.3 | 58.4 | 15.2 | 14.0 |
22 | C4H7N3O2 → C4H7N3O2 | 44.5 | 40.0 | −9.4 | −9.0 |
23 | C4H7N3O2 → C4H5N3O + H2O | 35.3 | 37.0 | −9.8 | −13.1 |
1. HCN + H2O →⋯→ HCOOH (formic acid). Reaction 3 is the last step of this series.
2. HCOOH →⋯→ H2CO (formaldehyde). Reaction 10 is the last step of this series.
3. H2CO + NH3 → CH5NO (methanol amine via reaction 12).
4. CH5NO →⋯→ C2H5NO2 (glycine). Reaction 17 is the last step of this series.
5. CH2O + HNC → C2H3NO (glycolonitrile via reaction 11).
Notice that all those intermediates (formic acid (reaction 10), formaldehyde (reaction 3), glycolonitrile (reaction 11), amino methanol (reaction 12)) have been experimentally reported to be precursors en route to the target molecules. Glycine is quite important because it is the only amino acid obtained in the entire reacting system. The chosen reactions enable the analysis of the formation and breaking of the very important C–O, CO, C–C and C–N bonds starting from C1 molecules. These bonds dictate the chemistry in all the molecules of life. Scheme 1 summarizes the reasons for choosing the set of five reactions listed above as follows: formic acid leads to the formation of formaldehyde. Formaldehyde triggers competitive reactions towards glycine (Strecker synthesis of amino acids) in one channel and glycolonitrile in another. Glycolonitrile yields two further reaction paths, one leading to the oxazole derivative and the other to the glycolaldehyde sugar.
Scheme 1 The set of five reactions (reactions 3, 10, 11, 12, 17) branching out from formic acid, chosen from Table 1, to study the formation and evolution of the C–C, C–N, C–O and CO bonds. |
IRCs are conveniently dissected by the critical points of the reaction force, defined, as for any conservative system, as the points for which the negative derivative of the potential energy along the reaction coordinate vanishes (F = −dV/dξ = 0).40–43 For a given elementary step, three critical points are obtained, corresponding to reactants (ξR), transition state (ξTS), and products (ξP); the critical points of the reaction force yield the activated reactants (ξR*) and activated products (ξP*). These five points along the IRC are the limits of three well defined regions, namely, the reactants region, associated to the structural changes needed to bring the reactants into the geometries needed for the reaction to occur and covering the ξR → ξR* interval, the transition state region, the region where most of the electron activity needed for bond breaking/formation takes place, covering the ξR → ξP* interval, and the products region, where the activated products suffer further structural changes to finally release the products, covering the ξP → ξP* interval. The chemical potential along the reaction path is obtained as half the sum of the energies of the HOMO and LUMO orbitals at each point of the IRC,44,45 and the reaction electron flux (REF), is calculated as the derivative of the chemical potential along the entire reaction path.46–48
The evolution of bonding interactions was followed using the set of descriptors provided by QTAIM and by NBO and the recently developed GGR synchronicity index.35 In lieu of a detailed description and to spare the technical details of how QTAIM and NBO descriptors are related to bonding, the interesting reader is directed to the specialized literature,24–29 more explicitly, to the work of Rojas-Valencia et al.49 (their Table 2 and the ensuing description) and references within. All electronic structure calculations including optimization, vibrational frequencies, solvent effects and IRCs were conducted using the Gaussian09 suite of programs.50 QTAIM quantities were derived from the AIMAll suite.51 NBO descriptors were obtained from NBO6 (ref. 52) as implemented in Gaussian.
There is a high degree of uncertainty on the conditions of Hadean Earth,53–57 with estimates ranging from very low pressures to well in excess of 20 atm and with equally unconfirmed temperatures, thus, in this work, the energetic balance of the reactions as well as the associated full and intrinsic barriers are reported as changes in the Gibbs free energies within bidimensional grids of pressure and temperature. Accordingly, an exhaustive study of the dependency of activation energies with temperatures and pressures over a grid of [293373] K and [0.5,20] atm was carried out, the results for elementary steps 3 and 11 leading to formic acid and glycolonitrile (Table 1) are shown in Fig. 1, additional plots are provided in the in the ESI (Fig. S1 and S2†). A few important points are immediately drawn. First, only moderate changes in activation energies as a function P, T are observed. This is a very important observation because given the current poor knowledge of primitive conditions, our calculations indicate that once the thresholds for these conditions are overcome,9 then the chemical reactions will occur under a wide variety of conditions. Second, activation energies are relatively high and not too far from the DFT calculations of Das and coworkers, nonetheless, a few of the calculated energy barriers are low enough to be encouraging about the feasibility of these processes to occur.58–60 In any case, all these processes would greatly benefit from natural catalysts such as clays and metal surfaces, which have been postulated to be pivotal components of prebiotic chemistry.61–64
Fig. 1 Absolute activation energies for elementary steps 3 and 11 (Table 1), in vacuo (left) and in solution (right), over a grid of temperatures in [293, 373] K and pressures in [0.5, 20] atm. |
The Gibbs activation barriers listed in Table 1 lead to slow processes, yet these reactions must be thermodynamic and kinetically viable to be considered as part of likely mechanisms for the formation of life building blocks in prebiotic conditions. These energetic and kinetic observations, in addition to the extreme complexity offered by the ab initio nanoreactor, suggest a more efficient production of the precursors studied in this work under catalytic environments, which are not considered in this work. Notwithstanding, our approach offers valuable insight to understand the bonding interactions driving the reaction mechanisms and should be considered a stepping stone to explicitly deal with the role of catalysts, work that is already in progress in our group.
Several of the reactions under discussion have been already studied with various forms of catalysts, which have proven to significantly reduce activation barriers. For example, Krug et al. studied the hydrolysis of formamide (reaction 3) under basic and acidic catalysis, and obtained a 6 kcal mol−1 barrier for the N-protonated case, 19 kcal mol−1 for the basic promoted case, and a stepwise mechanism for the O-protonated case, with two barriers of 24 and 16 kcal mol−1 for the nucleophilic addition to C and the breaking of the C–N bond respectively.65 Thus, significant reductions of the barriers were found when compared to the neutral case, which afforded a 44 kcal mol−1 barrier. These environment dependent catalysis are relevant since the exact pH under prebiotic conditions is unknown, with estimates pointing to non-neutral scenarios.66 Ice has also been suggested as a proper catalyst in prebiotic conditions. Despite the lack of a universally accepted geochemical model for the primitive earth, well grounded geochemical models that call for the presence of stable icy environments have been proposed.67–70 The icy protons allow the so called “proton-relay” mechanism to take place,71,72 in which a network of hydrogen bonded water molecules present in the ice surface assist in the proton transfer required in most of the set of 23 reactions listed in Table 1. Rimola et al. successfully assessed the catalytic effect of ice in the well known Strecker synthesis of amino acids, reporting barriers of 9.6, 21.3, and 8–9 kcal mol−1 for reactions 12, 13 and 14 respectively.73 Our isolated gas phase calculations are in agreement with their results.
Reaction 3, the neutral hydrolysis of formamide, which we will use to dissect and highlight mechanistic trends in the set of studied prebiotic reactions, has been studied before in the gas phase and in solution with explicit consideration of either one single water molecule as reactant or assisted by an extra water molecule as catalyst.65,74–76 Stepwise and concerted mechanisms have been reported. After refinement by the highly correlated model chemistry used in this work, our mechanism, based on the initial reactive trajectory obtained from the ab initio nanoreactor of Das et al. seeking to satisfy a set of likely prebiotic conditions of temperature, absence of metal catalyst, etc.,15 consists of a concerted water assisted reaction involving two water molecules in the transition state. For the gas phase, concerted, water assisted hydrolysis, Antonczak et al.77 reported a Gibbs free energy of activation of 49.6 kcal mol−1 computed at the MP2/6-31G**//HF/3-21G level at 298 °C and 1 atm, which is consistent with our computed values, 50.5 kcal mol−1 (absolute) and 45.7 kcal mol−1 (intrinsic). Later, using an improved theoretical framework (MP2/6-311+G**//B3LYP/6-31G*), Gorb et al.78 studied the mechanism in solution through different solvent models considering, in all of them, five explicit water molecules. They obtained for the assisted concerted reaction, a Gibbs activation energy of 47.6 kcal mol−1 at the same conditions. Our mechanism in solution, computed with the substantially improved MP2/6-311++G(d, p) yields activation free energies of 53.2 kcal mol−1 (absolute) and 42.8 kcal mol−1 (intrinsic). The energies computed at 298 °C and 1 atm that we use to compare against previous studies are covered in the grids of temperature and pressure presented in Fig. S1 and S2†.
On the grounds of purely electronic energies, the profiles shown in Fig. 3 indicate that reactions 3, 10 and 17 consume energy while reactions 11 and 12 release energy, however, Gibbs energies predict reaction 10 to be exergonic and reaction 12 to be endergonic. Notice that purely electronic energies describe only intrinsic processes (intermediate⋯products) while changes in Gibbs energies are calculated with the isolated reactants as reference. Thus, it is clear that in reaction 10, accounting for entropy and internal degrees of freedom results in a strongly stabilized intermediate while in reaction 12 the intermediate is strongly destabilized. Table 1 reveals only slight changes in activation energies in going from gas phase to solution.
Fig. 3 Relative energies in the IRC (top row), reaction forces (second row), chemical potential (third row), and reaction electron flux (bottom row) corresponding to elementary steps leading to the formation of formic acid (reaction 3, Fig. 2), formaldehyde (reaction 10, Fig. 2), glycolonitrile (reaction 11, Fig. 2), methanolamine (reaction 12, Fig. 2) and glycine (reaction 17, Fig. 2). Solid vertical lines define the reactants ξR → ξR*, transition state ξR → ξP*, and products ξP → ξP* regions. The position of the TS is marked by a dashed vertical line. |
Most reactions involve large regions of structural changes before reaching the activated reactants, that is, are far away from the reactants and comparatively closer to the transition states. The consequences for the reactions in the primitive Earth are that the environment needs to provide the means for the reactions to occur, in other words, that somewhat extreme conditions of temperature and pressure (especially temperature) are needed for the molecules to undergo the required structural changes leading to positive collision events. Transition states are neither earlier nor late, rather, transition state regions are well separated from reactants and products via large regions of a priori structural changes. Thus, as a general trend (there are exceptions), |ΔG†| > |ΔGrxn|.
The reaction force, defined as the negative derivative of the potential energy with respect to the reaction coordinate along the reaction path provides a useful dissection of the entire mechanism: negative regions are associated with retarding forces while positive regions are associated with driving forces, thus, the local extrema of the force, corresponding to the second derivative of the IRC, or in other words, the inflection points of the IRC determine the progress of the chemical reaction as comprising a region of structural changes ξR → ξR* opposing the chemical transformation, a region of mostly electronic rearrangements, ξR* → ξTS → ξP* in which the force sharply switches from retarding (before TS) to driving (after TS), and a final structural relaxation ξP → ξP* region that finally yields the corresponding products. There is an apparent inverse correlation between the length of the transition state region ξR* → ξTS → ξP* and activation energies, namely, the smaller the TS region, the larger the activation energy. This observation is consistent with saying that larger activation energies are associated to larger TS regions, thus, most of the energy from the environment is transferred to the reacting system to overcome structural changes.
Steps 3 and 17 – both hydrolysis reactions – deviate from the typical behavior of the reaction force for an elementary step, characterized by a global maximum and global minimum connected through the transition state at ξ = 0. For these two reactions we observe two additional stationary points in the force plot, a local maximum and local minimum. This observation suggests that both reactions occur via a single, complex, concerted, borderline two-step mechanism, i.e., there are two well differentiated regions along the IRC path where electronic activity predominates, and it also serves as an early indication of a formation/breaking of bonds taking place in a non-synchronous fashion. Further evidence for borderline two-step characterization of reactions 3, 17 comes from the maximum of the chemical potential for both reactions in the region products, way past the transition state region. The electron flux plots accentuate the visualization of heavy electron activity outside the transition state region for reactions 3, 17, thus providing additional support for all the previous observations.
Fig. 4 Descriptors of the evolution of bonding as a function of the reaction coordinate for the one step concerted reactions leading to the formation of formic acid (reaction 3, Fig. 2), formaldehyde (reaction 10, Fig. 2), glycolonitrile (reaction 11, Fig. 2), methanolamine (reaction 12, Fig. 2) and glycine (reaction 17, Fig. 2). Wiberg bond indexes (first row), their derivatives (second row) and GGR synchronicity35 (third row). Solid vertical lines define the reactants ξP → ξR*, transition state ξR → ξP*, and products ξP → ξP* regions. The position of the TS is marked by a dashed vertical line. |
For the set of five reactions shown in Fig. 2 a general mechanistic trend may be established from the plots in Fig. 4 in the context of a single step concerted reaction in each case. For the most part, the first process to initiate seems to be the double proton transfer in the reactions where they are present, then bond breaking/formation around the reactive carbon in the carbonyl group and finally rupture of the C–N bonds in the appropriate cases, which are the most reluctant to break. The reluctance of the C–N bond to break is clear from the fact that this rupture occurs far behind all other primitive processes, which have significantly progressed at the late stages of the reaction (this is true if reaction 12 is inverted). This is a pivotal point that highlights the importance of water not only as the medium in which all these reactions occur but also as a non innocent spectator catalyzing and actively taking part in most reactions (hydrogen atoms in water molecules end up in the products). In all cases, the double proton transfer process occurs mostly within the boundaries of the given TS region. Conversely, it is not uncommon for the breaking/formation of C–N, C–O, CO and N–H bonds to anticipate or to extend beyond the TS region. In other words, besides the structural changes needed to bring reactants to the activated form and to deactivate the activated products, a sensible degree of electronic activity is still needed in the reactants and products regions.
Maximization of the chemical potential (Fig. 3) occurs ahead of the TS region for reaction 12 and beyond the TS region for reactions 3, 17. This is consistent with all descriptors in Fig. 4 which show high electron activity involved in the early formation of the C–N bond (reaction 12) and in the late breaking of the C–N bond in reactions 3, 17. Reaction 10 is a well known dehydration reaction involving the transformation of methanediol into formaldehyde under the catalytic action of one water molecule that enables double proton transfer in our mechanism. This reaction deviates from other perfectly synchronous double proton transfers (i.e. in the dimer of formic acid79) because the two protons transferred in reaction 10 originate from different O–H bonds, C–O–H from methanediol and O–H from water.
Fig. 5 Descriptors of the evolution of bonding as a function of the reaction coordinate for the one step concerted reactions leading to the formation of formic acid (reaction 3, Fig. 2), formaldehyde (reaction 10, Fig. 2), glycolonitrile (reaction 11, Fig. 2), methanolamine (reaction 12, Fig. 2) and glycine (reaction 17, Fig. 2). Electron densities at bond critical points (first row), virial ratios at bond critical points (second row), and representative intermolecular donor → acceptor orbital interactions (third row). Solid vertical lines define the reactants ξR → ξR*, transition state ξR → ξP*, and products ξP* → ξP regions. The position of the TS is marked by a dashed vertical line. |
Without a single exception, all broken/formed bonds are formally covalent, as characterized by virial ratios larger than 2 and large accumulations of electron density at bond critical points and by well defined σ, π bonds. This means that there are no ionic reactions and that all reactants and products have well defined bonds. As suggested above, hydrogen transfer seems to kick off the reacting system in the appropriate cases, that is, changes in electron density and virial ratios for the associated bonds seem to initiate ahead of all other primitive changes followed by the breaking/formation of CO, C–O, and C–C bonds and mostly independent of the chemical transformation of C–N bonds.
In Fig. 5, bottom panel, all curves correspond to donor → acceptor interactions within the NBO paradigm. For those special cases in which a third lone pair on a given O atom arises, the corresponding interactions are followed via dotted lines. In general, breaking and formation of sigma bonds is due to a maximization in charge transfer following the nO → σ*, nO → π*, nN → σ*, nN → π* rule. In agreement with all other descriptors, in the NBO context, both proton transfers occur very early during the reaction while the bonds associated with heavy atoms are transformed later within the same concerted step. Notice that for each one of the proton transfers, the formation of one σO–H orbital occurs simultaneously with the breaking of a second σO–H orbital associated with the transferred proton. Opposite to the double proton transfer, the bonds involving heavy atoms, i.e., formation of C1–O2 and breaking of C1–N3, do not occur simultaneously. In fact, the formation of the σC1–O2 is completed at the activated products coordinate and precedes the breaking of σC1–N3, which is not finalized until the very end of the reaction, at the coordinate of the products. Although after the reaction has taken place the πC1O4 orbital remains intact, it is by no means (just like the catalytic water) an innocent spectator for the reaction to occur: reorganizing its electron density is essential during the ξTS → ξP* interval in which a π → σ transformation occurs, inducing the subsequent breaking of the C1–N3 orbital followed by the restoring of the πC1O4 orbital in the formic acid of the final products.
Fig. 6 Orbital interactions leading to the breaking/formation of bonds driving the production of formic acid and ammonia from formamide and two water molecules along the lines of the one step, concerted mechanism of reaction 3 (Fig. 2). All donor → acceptor interaction energies in kcal mol−1. |
Scheme 2 Reaction mechanism covering the entire ξR → ξR* → ξTS → ξP* → ξP IRC, as dissected by the reaction force (Fig. 3), for the one step concerted formation of formic acid from formamide and two water molecules. Notice that one of the water molecules is incorporated into the final formic acid and that the other is not an innocent spectator as indicated by the colors associated to the double proton transfer. See Fig. S3 in the ESI† to follow the natural charges at all oxygen atoms justifying the assignment of formal negative charges at specific points. |
As discussed before, every single reaction begins by forming a stabilized long range complex, ξR in Scheme 2. ξR is the starting point of the one step concerted reaction. In the case of reaction 3, this complex involves two specific intermolecular hydrogen bonds of strengths comparable to the isolated water dimer: H–N⋯H–O E(2)a = −4.75 kcal mol−1, ρ(rc) = 1.7 × 10−2 a. u, in F1⋯W1, and H–O⋯H–O E(2)a = −6.51 kcal mol−1, ρ(rc) = 2.4 × 10−2 a. u, in W1⋯W2. The formation of these complexes leads to somewhat shallow, purely electronic energy wells which may be overcome simply by correcting for the ZPE energies or by accounting for temperature, entropy, and internal degrees of freedom (i.e., using Gibbs energies) as concluded above during the discussion of the energetics of the reaction profiles. The formation of these complexes requires minimal distortion of the electron densities from the isolated reactants, and thus the shallow wells. The regions within the IRC are conveniently analyzed in a separate fashion as follows:
ξR →ξR* region: two important developments characterize this region: first, somewhere within this interval (at ξ = −14.97 a0 a.m.u1/2), a critical point for the C1⋯O2 interaction appears, this interaction strengthens steadily up to the coordinate of the activated reactants, however, not at the same rate as the interactions involved in the double proton transfer as consistently shown by all synchronicity, QTAIM, and NBO descriptors in Fig. 4, 5 and 6, respectively. Second, all orbital interactions associated to intermolecular contacts increase by large factors (from 4.75 to 185.35 kcal mol−1, etc., Fig. 6). These observations are consistent with the sensible structural changes needed to accommodate all reacting fragments in the positions required for the reaction to occur. Notice that although structural changes dominate this region of the IRC, it is not void of electronic activity.
ξR* →ξTS region: the double proton transfer is so early for this reaction that both protons have completely migrated at ξTS. Multiple evidence supports this claim: virial ratios larger than 2 thus indicating covalent bonding ( for σN3–H8 and for σO5–H12), large bond orders, and the clear presence of well defined σN–H, σO–H orbitals for the newly formed bonds (Fig. 6). This reorganization of the electron density yields formal negative and positive charges at O2 and N3 respectively (Scheme 2), which are major players driving the progress of the reaction beyond the TS. The asynchronous character of this reaction is clearly seen in the fact that up to ξTS, two O–H bonds have been broken, one O–H and one N–H bond have been formed, while C1⋯O2 remains a long range contact and C1–N3 remains a well defined σ bond (for additional support, see the evolution of all bonds in the GGR synchronicity index, bottom panel in Fig. 4).
ξTS →ξP* region: this region is mostly devoted to the formation of the σC1–O2 bond via a classic nucleophilic attack from the negatively charged H–O:− to the carbon atom in the carbonyl group. The formation of the σC1–O2 bond results in a reorganization of the electron density of the CO bond to produce a formally charged C–O− fragment (Scheme 2), beautifully supported by the emergence of a third lone pair on the carbonyl oxygen (O4) right at (ξP*). There is a progressive weakening of the soon to be broken C–N bond in going from ξTS to (ξP*) as seen in the steady decrease of the bond order, electron density, and virial ratio.
ξP* →ξP Region: this step involves separation of the fragments, restoring of the πCO bond, and final rupture of the C1–N3 bond to liberate ammonia and formic acid. Evidence for the rupture of the C1–N3 bond: the virial ratio goes from 2.92 (covalent bond) to 0.82 (long range interaction); the bond order goes from 0.73 to 0.03; the derivative of the bond order decreases substantially; the σC1–N3 orbital ceases to exist; a lone pair is formed in N3. The driving force for the breaking of the C1–N3 bond is unveiled by NBO in the form of destabilizing charge transfer from the formally charged C–O− fragment freshly formed at (ξP*), as clearly shown by the dotted purple line at the leftmost bottom panel in Fig. 6. Notice that the breaking of the C1–N3 bond occurs very late in the reaction, way past the activated products, just beyond the point in which the C1O4 carbonyl is restored and the third lone pair is no longer available after doing its job of weakening C1–N3.
Until the discovery of the actual causes and mechanisms of the origin of life, we may assert in a pragmatic although speculative way, that a set of conditions are necessary. Namely, a network of chemical reactions capable of producing the building blocks of the molecules of life (such as the ones studied in this work) in local environments that further facilitate their combination to produce large molecules of such astonishing diversity and complexity as proteins and DNA. Our calculations unveil only moderate changes in activation energies as a function of pressure and temperature for the set of reactions studied here (Fig. 1, S1 and S2†), this observation supports the perspective of life just exposed because given the current poor understanding of the conditions of the Hadean earth, these small changes in reaction barriers indicate that above those thresholds, the chemical reactions producing the building blocks for the molecules of life will occur under a wide variety of conditions. The local favorable environments are the product of large scale cumulative changes in the course of billions of years brought about by deterministic molecular processes, that is, chemical reactions, and the consequential physical and chemical changes of the environment (slow building of a tolerable atmosphere, regulation of temperatures and acidic conditions, etc.) that allow the emergence and evolution of life under a particular set of local environmental evolution pressures.
The ever increasing entropy of the universe, as stated by the second law of thermodynamics, is the ultimate reason behind the molecular scale evolution (deterministic, not random, because of the need to satisfy the equilibration of chemical potentials in reacting systems) that changes the chemical landscape and then the environment itself. A second factor contributing to the increase in entropy of the universe in the process of originating life is the complexity of the multitude of reaction channels accessible to produce the small molecules needed to make large biomolecules, as is clearly the case in the present work. Furthermore, proteins are built upon primary sequences of just 20 amino acids, thus, there are on the order of 2.4 × 1018 possible different primary sequences for even the smallest proteins.83 Under this perspective, any local environment (a planet, a moon, or regions within) capable of undergoing the necessary changes under periods of time long enough to avoid major catastrophes caused by external agents, is poised to originate and sustain life. There is a high probability that once set in motion, many of these evolution patterns may not succeed in creating life, but by virtue of the immense dimensions of our observable universe, the emergence of life is not only deterministic, but should be a rather common place.
Footnote |
† Electronic supplementary information (ESI) available: Cartesian coordinates for all isolated reactants, long range complexes, transition states, and products. Gibbs activation and reaction energies in gas phase and in solution for the set of 23 reactions are provided. Plots of absolute and intrinsic activation energies for reactions 10, 12, 17 over grids of temperature and pressure. Plots of the natural charges at all oxygen atoms for followed along the entire IRC of reaction 3. See https://doi.org/10.1039/d2ra06000k |
This journal is © The Royal Society of Chemistry 2022 |