Exploring the Core Formose Cycle: Catalysis and Competition

Kua, Jeremy; Tripoli, L. Philip

doi:10.3390/life14080933

Open AccessArticle

Exploring the Core Formose Cycle: Catalysis and Competition

by

Jeremy Kua

^*

and

L. Philip Tripoli

Department of Chemistry and Biochemistry, University of San Diego, San Diego, CA 92110, USA

^*

Author to whom correspondence should be addressed.

Life 2024, 14(8), 933; https://doi.org/10.3390/life14080933

Submission received: 14 June 2024 / Revised: 10 July 2024 / Accepted: 19 July 2024 / Published: 25 July 2024

(This article belongs to the Special Issue Feature Papers in Origins of Life 2024)

Download

Browse Figures

Versions Notes

Abstract

:

The core autocatalytic cycle of the formose reaction may be enhanced or eroded by the presence of simple molecules at life’s origin. Utilizing quantum chemistry, we calculate the thermodynamics and kinetics of reactions both within the core cycle and those that deplete the reactants and intermediates, such as the Cannizzaro reaction. We find that via disproportionation of aldehydes into carboxylic acids and alcohols, the Cannizzaro reaction furnishes simple catalysts for a variety of reactions. We also find that ammonia can catalyze both in-cycle and Cannizzaro reactions while hydrogen sulfide does not; both, however, play a role in sequestering reactants and intermediates in the web of potential reactions.

Keywords:

origin of life; thermodynamics; kinetics; prebiotic chemistry; formose reaction; sugars

1. Introduction

It is curious that autocatalytic cycles are ubiquitous in living systems [1] but rare in benchtop chemistry. The oligomerization of formaldehyde (CH₂O) into sugars via aldol reactions, known as the formose reaction, is one of the rare exceptions. Its mechanism was first elucidated in 1959 by Breslow [2], with an update in 2014 [3], and has been extensively studied for its potential in food production [4]. While often mentioned in origin-of-life chemistry as a “simple” route to ribose (and other sugars utilized in extant biochemistry), the formose reaction typically forms a complex and intractable mixture under (catalytic) alkaline conditions [5] with acids and alcohols produced via the competing Cannizzaro reaction.

Early prebiotic chemical syntheses focused on finding conditions to increase ribose selectivity; three examples are using phosphates [6], borate [7], or trying to find the best metal catalysts [8]. But more recently, the messiness of the formose reaction is embraced as a feature, not a bug. The Huck group systematically probed how environmental conditions affect the observed product distribution [9,10,11]. Paschek et al. investigated the effect of catalysts that might be present in carbonaceous chondrites [12], Omran considered chemical gardens resembling hydrothermal vents [13], Vinogradoff et al. utilized olivine silicate catalysts [14], and Haas et al. examined mechanochemical effects [15].

The formose reaction also caught the attention of computational modelers as a classic test case to examine chemical reaction networks. Examples include heuristics-aided quantum chemistry [16], graph-theoretical approaches coupled with quantum calculations [17], and utilizing machine learning on reaction-rule networks [18]. These large-scale studies took a bird’s-eye view, showcasing the complexity of the system with its myriad compounds, but do not focus on individual reactions or how molecules in the network may act recursively as catalysts. Our present work complements these approaches by examining in detail one small portion of the formose reaction—its smallest autocatalytic core. At this small yet focused scale, we previously generated a free energy map (thermodynamics and kinetics) of the uncatalyzed formose reaction [19]; our present work extends the earlier work by examining the effect of catalysts and exploring reactions that may sequester reactants and intermediates in the smallest autocatalytic cycle.

The smallest autocatalytic cycle of the formose reaction is shown in Figure 1. CH₂O is the C₁ “food” species. The linchpin C₂ species is glycolaldehyde. C₃ and C₄ species can be produced via successive aldol addition of C₁ food; larger sugars are also synthesized via this route. The key to autocatalysis is that C₄ can undergo a retro-aldol reaction to produce two linchpin C₂ species, thereby accelerating the consumption of C₁ food with each successive turn of the cycle. In the formose reaction, myriad aldol condensations and retro-aldol reactions take place; for example, you could have C₂ + C₃ → C₅ or C₆ → C₃ + C₃, although a core autocatalytic cycle can be achieved with just C₁ through C₄ species.

Several features of this core cycle deserve mention, and we will make reference to our earlier work [19] on the free energy map of the thermodynamics and (uncatalyzed) kinetics of these reactions. Forming the C₂ linchpin directly from CH₂O (the initiation step) is very challenging kinetically because, without an umpolung species, it is difficult to make C–C bonds; our previously calculated barrier is 45.3 kcal/mol. However, once a small amount of the C₂ linchpin is formed, the direct dimerization is bypassed. Food is consumed via the much lower barrier C₂ + C₁ → C₃ and C₃ + C₁ → C₄ reactions. The retro-aldol C₄ → C₂ + C₂ reaction regenerates (more) C₂ and accelerates the consumption of C₁. As the food supply dwindles, a wider variety of sugars in the C₅ to C₇ range are produced.

Off-cycle reactions can deplete the concentrations of the C₁ to C₄ species involved in the cycle. The Cannizzaro disproportionation reaction parasitizes the cycle by removing CH₂O food, but by producing HCOOH, it may add a catalyst to enhance reaction rates within the cycle. (The C₂, C₃, and C₄ sugars can also undergo Cannizzaro reactions.) All the aldehydes can form polyols by oligomerization reactions forming C–O bonds. This could sequester some of the reacting species, slowing down reactions in the cycle, but also reduce parasitic reactions and provide an equilibrating pool of reactants, thus lengthening the lifetime of autocatalytic cycles. Keto-enol isomerization reactions (e.g., glyceraldehyde to dihydroxyacetone for the C₃ species) or ring closures (e.g., erythrose or threose for the C₄ species) may play similar sequestering roles.

Our previous free energy map only considered uncatalyzed reactions in aqueous solution under neutral conditions. The present work considers whether other small molecules may act as catalysts to influence the kinetics or act as sequestering agents to influence the thermodynamics of the reactions involved in (or adjacent to) the core cycle. The small molecules being considered in this article are (1) the C₁ products of the Cannizzaro reaction CH₃OH and HCOOH; (2) small molecular acids that may be found in hydrothermal vent reactions, such as carbonic acid and acetic acid; and (3) small molecules that provide reduced nitrogen and sulfur, NH₃ and H₂S, that may have been present at life’s origin.

To keep this investigation tractable, there is much we excluded. The rich chemistry of HCN (and formamide) is the subject of a future paper. Cyanide provides an umpolung C₁ reactant and combines readily with aldehydes, making C–C bonds, and the plethora of intermediates increases many-fold. HCN is also a “high energy” reactant compared to the carbon and nitrogen sources that are prebiotically plausible: CO₂ and NH₃. Our calculated free energy change for the reaction CO₂ + NH₃ + H₂ → HCN + 2 H₂O is +18 kcal/mol. (Carbon monoxide is also excluded in this study and may be considered a cyanide analog). We have also excluded ions because the results are less than satisfactory with our current computational protocol (see Section 2 below). Finally, we limit ourselves to just the C₂ to C₄ “sugars” in the smallest autocatalytic cycle, as shown in Figure 1.

This article is organized as follows: After describing our computational protocol and its limitations, the combined Results and Discussion section (Section 3) will go through each of the reactions relevant to Figure 1, starting with C₁ and working our way through the C₂, C_3, and C₄ reactions. Finally, we will consider sequestering the sugars by direct reaction with NH₃ and H₂S.

2. Computational Methods

Since we will be comparing our present calculations to our previous work, we apply the same computational protocol [19]. Here, we provide a brief description of that protocol for the convenience of our readers. Some of the text in this section is reproduced from our recent work on CHO systems published in this journal [20] since we think that description is both clear and succinct. Essentially, we calculate the free energies in an aqueous solution using quantum chemical methods, and our protocol shows suitable agreement with available experimental results for CHO systems [19,20,21,22]. Here are the computational details:

The geometry of each molecule is optimized, and its electronic energy is calculated at the B3LYP [23,24,25,26] in favor of density functional theory with the 6-311G** basis set. To maximize the probability of finding global minima, multiple conformers are generated using molecular mechanics (MMFFs force field [27]). The optimized structures are embedded in a Poisson–Boltzmann continuum to calculate the aqueous solvation contribution to the free energy. While this does not provide a specific concentration, it assumes a dilute solution such that the electrostatic field generated by a neighboring solute molecule is effectively screened by the water solvent. One can consider all solutes to have the same relative concentrations in our calculations. Zero-point energy corrections are included, and we apply the standard temperature-dependent enthalpy correction term (for 298.15 K) from statistical mechanics by assuming translational and rotational corrections are a constant times kT and that low-frequency vibrational modes generally cancel out when calculating enthalpy differences. So far, this is standard fare.

However, entropic corrections in aqueous solution are problematic [28,29,30]. Changes in free energy terms for translation and rotation are poorly defined in solution due to restricted complex motion, particularly as the size of the molecule increases (thus increasing its conformational entropy). Free energy corrections come from two different sources: thermal corrections and implicit solvent. Neither of these parameters is easily separable, nor do they constitute all the required parts of the free energy. We follow the approach of Deubel and Lau [31], assigning the solvation entropy of each species as half its gas-phase entropy (calculated using standard statistical mechanics approximations similar to the enthalpy calculations described above), based on proposals by Wertz [32] and Abraham [33] that upon dissolving in water, molecules lose a constant fraction (~0.5) of their entropy.

To estimate activation energies, transition states were optimized by including several explicit water and/or catalytic molecules to aid in transferring H moieties. All calculated transition states have one significant negative eigenvalue corresponding to the reaction coordinate (eigenvector) involving bond breaking/forming. Several conformers are tested in each case, and we only report the lowest calculated barriers.

When put to the test by first calculating the equilibrium concentrations in a self-oligomerizing solution of 1 M glycolaldehyde at 298 K, our protocol fared very well compared to subsequent NMR measurements [22]. Our relative Gibbs free energies in aqueous solution are typically within 0.5 kcal/mol compared to experimental results. That being said, our protocol did show systematic errors of 2–3 kcal/mol when calculating barriers involving carbonyl chemistry when compared to experimental results. Going to a higher level of theory does not reduce this error [34]. For the few S_N2 reactions we considered, having to daisy chain an additional 3–4 water molecules to facilitate proton transfer results in barriers that may be up to 10 kcal/mol too high, consistent with our previous work [19,35]. Using carboxylate anions with diffuse functions to represent weak acids gives poorer results compared to keeping the COOH group neutral [20]. Our previous attempts to incorporate H₃O⁺ (and its larger cousins) as catalysts also gave unrealistic barriers and structures, with transition states quickly falling into local minima [36]. We have also attempted using Group I or Group II metal cations in conjunction with hydroxide as catalysts (unpublished results) that run into similar issues.

Quantum chemistry is about error cancellation, and our protocol (with its foibles, including the simplistic entropy correction) has worked well even with this systematic error for activation barriers. Thus, we do well on thermodynamics and just okay on kinetics (but at least we are in the ballpark for the majority of the carbonyl chemistry and aldol reactions in this work), and we want to be very clear about both the opportunities and the limitations of our protocol in this brief description.

3. Results and Discussion

Throughout this paper, for the thermodynamics of a chemical reaction, the reaction free energy will be designated ΔG. For a transition state, kinetic barriers will be designated ΔG^‡ and will typically refer to the forward direction. From the ΔG and ΔG^‡ values for the forward reaction, the thermodynamics and kinetics of the reverse reaction can be easily ascertained. All numerical values are in kcal/mol, abbreviated as kcal. In each subsection, we begin with the uncatalyzed reaction (water may facilitate proton transfer) before examining the inclusion of a potential catalyst. The potential catalysts in this study are NH₃, H₂S, methanediol (the hydrated form of CH₂O, which is dominant in aqueous solution), the C₁ Cannizzaro products CH₃OH and HCOOH, along with three other acids (carbonic, acetic, and glycolic) that are likely to be present in prebiotic systems.

3.1. Formaldehyde Hydration

As the main “food” species, CH₂O in aqueous solution is typically hydrated and exists primarily as methanediol, CH₂(OH)₂. The reaction is as follows:

CH₂O + H₂O → CH₂(OH)₂ ΔG = −4.6 kcal

The optimal uncatalyzed transition state includes two additional explicit water molecules forming an eight-center transition state, as shown in Figure 2A; the barrier ΔG^‡ is 13.2 kcal. The reactant water molecule is labeled III, and the additional water molecules are labeled I and II. Both our ΔG and ΔG^‡ values are in good agreement with experimental results, as discussed in our previous work [19].

To investigate if prebiotically plausible small molecules potentially present in the solution can catalyze this reaction, we replace the water molecule I with CH₃OH, CH₂(OH)₂, NH_3, or H₂S. (Replacing the water molecule II is either less or equally favorable.) Or we can replace both water molecules I and II with a carboxylic acid moiety, as shown in Figure 2B with formic acid. This optimal transition state structure is geometrically similar to the eight-center water-only transition state in Figure 2A, but there are some notable differences in the bond distances. The acidic proton of HCOOH is 1.16 Å to the carbonyl-O, while the corresponding distance is 1.28 Å for the uncatalyzed reaction. The hydrating water molecule has a longer C…O distance of 1.80 Å instead of 1.69 Å in the water-only transition state. These are consistent with an acid-catalyzed mechanism where the proton transfer proceeds ahead of the nucleophilic addition of the reactant water to CH₂O.

From the table in Figure 2C, we see that CH₃OH, CH₂(OH)_2, and H₂S have no catalytic efficacy for formaldehyde hydration. The barriers are either similar to or greater than the uncatalyzed ΔG^‡ of 13.2 kcal. As a proton transfer agent, NH₃ substantially reduces the barrier. For the four carboxylic acids tested, the barrier is approximately halved. This fits well with NMR studies on the hydration and dehydration of formaldehyde [37], where the observed exchange rate constants (k_ex = k_hyd + k_dehyd) were ~16 s⁻¹ at pH 7.4 and range from ~4 s⁻¹ in the pH 3–6 range (barriers are not provided in that work, but since ln 16 = 2.77 and ln 4 = 1.39, one might surmise that the barrier is approximately half in the acidic pH range). The order of our calculated ΔG^‡ for the acids is likely not significant, given the computational error, and our later results will see some swaps in the order. Thus, we cannot relate the pK_a or an acid to its catalytic ability in this reaction. (The NMR results also do not show much distinction in the pH 3–6 range.)

Our results are consistent with experimental observations that under neutral or acidic conditions, in addition to CH₂(OH)₂, formaldehyde can oligomerize via C–O bond formation to form acetals, hemiacetals, and oxane rings [4] at higher concentrations of CH₂O. This begins with the dimerization of formaldehyde to form the hemiacetal.

CH₂O + CH₂(OH)₂ → HOCH₂OCH₂OH ΔG = −4.0 kcal

But since this is slightly less exergonic than formaldehyde hydration, at least under the “dilute” conditions of our computational protocol, the equilibrium will favor monomers, and the dimerization of CH₂(OH)₂ would be marginally endergonic.

2 CH₂(OH)₂ → HOCH₂OCH₂OH + H₂O ΔG = +1.2 kcal

The formation of the six-membered trioxane (CH₂O)₃ and larger oxane rings is also endergonic relative to monomeric aqueous CH₂(OH)₂, as shown in our previous work [19].

In commercial use, methanol is typically added to aqueous formaldehyde solutions to prevent higher oligomers from forming. From our calculations,

CH₂(OH)₂ + CH₃OH → CH₃OCH₂OH + H₂O ΔG = −1.4 kcal

CH₃OCH₂OH + CH₃OH → CH₃OCH₂OCH₃ + H₂O ΔG = −1.4 kcal

and thus, methanol acts as a capping group by favorably forming acetals. This suggests that the effective concentration of formaldehyde in aqueous solution can be lowered when a sequestering agent such as CH₃OH is present. Since CH₃OH is produced in the Cannizzaro reaction (see next subsection), we expect it to be present in prebiotic mixtures containing CH₂O. We will consider the sequestering of the aldehydes in more detail later in this article.

3.2. The C₁ Cannizzaro Reaction

Formaldehyde can disproportionate into methanol and formic acid. The reaction is thermodynamically favorable and catalyzed under alkaline conditions [5].

CH₂O + CH₂(OH)₂ → HCOOH + CH₃OH ΔG = −19.6 kcal

The lowest energy transition state had an uncatalyzed barrier of ΔG^‡ = 25.8 kcal and did not require additional explicit water molecules (adding them increased ΔG^‡). The hydride is transferred directly between the two carbons as the proton hops between the oxygens, as shown in Figure 3A. A catalyst molecule can be added in two possible positions: between the two carbons to facilitate the hydride transfer or between the carbonyl-O of CH₂O and the hydroxyl of CH₂(OH)₂ to facilitate proton transfer. In all cases, the latter situation was favored.

Our calculated ΔG^‡ values are shown in Figure 3B. Once again, we find that alcohols and H₂S do not catalyze the reaction; the barriers are in the 30–34 kcal range (similar to adding one explicit water molecule where ΔG^‡ is 29.5 kcal). The carboxylic acids and NH₃ lower the barrier by 4–8 kcal, a similar range to what we found in formaldehyde hydration. When NH₃ is a catalyst, the bond distances suggest that it abstracts a proton from CH₂(OH)₂ early in the process (the O…H distance is 1.39 Å while the forming N–H bond has already shortened to 1.14 Å). The hydride is being transferred (C…H distances of 1.31 and 1.40 Å), but no proton transfer is yet observed from NH₃ to the carbonyl-O of CH₂O. This mimics a base-catalyzed reaction. It also suggests that one way to get around the difficulty of explicitly including hydroxide (in an anionic transition state) is to utilize NH₃ as a proxy.

The structure with HCOOH is interesting. The hydride is being transferred (C…H distances of 1.32 and 1.36 Å) in concert with the proton transfer between HCOOH and CH₂O (O…H distances of 1.22 and 1.18 Å). The abstraction of the proton from CH₂(OH)₂ is only barely beginning (the O–H bond has only marginally lengthened to 1.05 Å). Is this an acid-catalyzed reaction? Probably not; it looks mostly like the water-only transition state, except that HCOOH facilitates the process and lowers ΔG^‡. This illustrates why we should be cautious in interpreting our transition states and barriers. Molecular HCOOH acts as a proxy catalyst by lowering the barrier, but this does not imply an acid-catalyzed reaction. While Cannizzaro reactions are catalyzed by a strong base, we have not found any reference to acid catalysis, although we find one reference of the uncatalyzed reaction in aqueous solution proceeding at high temperature and high pressure [38].

In a buffered pH of ~7, NH₃ exists primarily as ammonium, and HCOOH is in its carboxylate form in solution. Thus, we emphasize that in our calculations, molecular NH₃ and HCOOH are acting as proxy catalysts. If there was a microenvironment in which the reactants and our neutral molecular “catalysts” are present, and they collide in the right orientation with sufficient energy, then the barrier might be lower in that instance. But in bulk quantities in an actual experiment at a particular pH, these specific molecules might not be catalytic. Thus, looking at transition state structures is important; only then can we see that HCOOH is acting as a proxy for an acid-catalyst in formaldehyde hydration but not in the Cannizzaro reaction. Conversely, NH₃ is acting as a proxy for a base-catalyst in the Cannizzaro reaction but not in formaldehyde hydration. Both molecules lead to lowered ΔG^‡ in both reactions. We will revisit this issue in our concluding remarks.

3.3. Glycolaldehyde Reactions: Enolization, Aldol, Hydration, and Cannizzaro Reactions

Glycolaldehyde is the linchpin molecule of the smallest autocatalytic cycle in the formose reaction. Access to larger sugars comes from the feasibility of aldol additions; thus, the simplest aldol addition involves glycolaldehyde reacting with formaldehyde to form the C₃ sugar glyceraldehyde. We will only consider the formation of D-glyceraldehyde. Its mirror image would have the same free energy; thus, the thermodynamics do not change (and we expect equal amounts of D and L sugars to be formed in this scenario, as observed experimentally in the formose reaction if no chiral catalysts are added). Transition states for aldol addition to form the enantiomers are also mirror images, so the barriers should also be equal. The aldol addition is feasible because glycolaldehyde is enolizable. Thus, we first examine the enolization reaction.

The enolization reaction has ΔG = +6.8 kcal, and the uncatalyzed barrier is 24.3 kcal. As shown in Figure 4A, the optimal 10-center transition state utilizes three water molecules. The C–H bond breaks first (C…H is 1.68 Å), and the waters at positions II and III resemble a H₅O₂⁺ moiety. At the other end (involving water at position I), the O…H is just starting to form at 1.45 Å. Placing the alcohols and H₂S in position I gives the lowest barriers, but just like before, these have higher ΔG^‡, as shown in Figure 4B. The optimal position for carboxylic acids is to replace waters II and III. The C–H bond has not stretched as far (at 1.50 Å), while at the other end, the proton transfer is midway (O…H distances are 1.22 and 1.18 Å). The ΔG^‡ values are similar to the water case. If NH₃ replaced water in position I, the results are similar (with a marginally lower ΔG^‡), but the barrier is significantly lowered (to 12.2 kcal) if NH₃ is in position III. The C…H distance is 1.54 Å, and it looks like NH₃ has mostly abstracted the hydrogen but not yet relayed it through the hydrogen bond network.

In the aldol addition, CH₂O adds to the enol of glycolaldehyde, forming a C–C bond. The product formed is glyceraldehyde. Since the enol is a transient species, our ΔG and ΔG^‡ values for the reaction are with reference to the aldehydes.

CH₂O + C₂H₄O₂ → C₃H₆O₃ ΔG = −9.4 kcal

The reaction is exergonic and remains so even if applied to the aldehyde-hydrates. Unlike the thermodynamic reversibility of oligomers forming and breaking C–O bonds, the equilibrium clearly favors C–C bond formation, as analyzed in our previous work [19]. As to the kinetics, our calculations yield ΔG^‡ = 23.6 kcal for the uncatalyzed reaction. As shown in Figure 5, the forming C…C distance is 1.91 Å. The proton transfer distances are similar to what we observed in previous reactions. There is a hydrogen bond (dashed orange line) that does not directly participate in bond breaking/forming.

Similar to previous cases, the alcohols and H₂S have either similar or higher barriers. The carboxylic acids have ΔG^‡ values in the 16–20 kcal range. From Figure 5 (HCOOH), the C…C distance is similar, while the O…H distances favor proton transfer from the acid to CH₂O over the enol-to-carbonyl proton transfer. The base-catalyzed features are more apparent with NH₃ (with a very low barrier of 10.6 kcal): A proton is pulled early from the enol to form the enolate while the C…C forming bond is more distant at 2.15 Å. The NH₄⁺ moiety has not begun to transfer a proton back to CH₂O (the H…O distance is 1.74 Å). As we saw in the Cannizzaro reaction, NH₃ acts as an effective proxy for base-catalyzed mechanisms.

At this point, we have considered four distinct reaction mechanisms: C–O bond formation (hydration), C–C bond formation (aldol), non-redox H-shifts via enolization, and redox H-shifts in the Cannizzaro disproportionation. Because the remaining reactions in this paper will recapitulate similar transition states, we will focus our attention mainly on NH₃ and HCOOH as proxy catalysts and compare these to only having water molecules as a proton shuttle.

Glycolaldehyde hydration is similar to formaldehyde hydration, except the thermodynamics are less favorable. Our calculated ΔG for hydration is +0.5 kcal. This may seem surprising because one might expect hydration to be favored and for the reaction to be exergonic in a dilute solution. Indeed, if we include concentration corrections for a 1 M aqueous solution of glycolaldehyde, equilibrium slightly favors the hydrate. We have extensively analyzed the oligomers of glycolaldehyde in solution, and our computational protocol is in excellent agreement with NMR measurements [22], but in that paper, we also discuss why reporting concentration-corrected ΔG values can be misleading in a messy system where the concentrations of different reactants, intermediates, and products can range over many orders of magnitude. Thus, in this work (and in our previous free energy maps), we report our ΔG values without concentration corrections. Our uncorrected values provide a baseline for subsequent kinetic simulations that are beyond the scope of the present work. In our previous work, the barrier for glycolaldehyde hydration was 16.1 kcal, in good agreement with experimental results [22]. In the present work, HCOOH lowers the barrier to 11.0 kcal, while NH₃ lowers it to 10.1 kcal. This is similar to what we found for CH₂O hydration discussed earlier.

Similar to formaldehyde, glycolaldehyde can also participate in Cannizzaro reactions. These may be self-disproportionation reactions (Rxn 1 in Figure 6), although more likely to be cross-Cannizzaro reactions between a C₁ and C₂ species (Rxn 2 and Rxn 3 in Figure 6) since formaldehyde (and its hydrate) is likely to be found in higher concentrations in the early stages of the formose reaction. The ΔG and ΔG^‡ values of Rxn 2 are similar to the C₁ Cannizzaro reaction discussed previously. The ΔG and ΔG^‡ values of Rxn 1 and Rxn 3 are similar but ~4–6 kcal higher than Rxn 2.

These results are not surprising from our thermodynamic map of CHO compounds [20]. Essentially, the reaction exergonicity is driven by the relative stability of CH₂O to CH₃OH (18 kcal) versus the relative stability of glycolaldehyde to ethylene glycol (13 kcal). Since the transition states are similar, Hammond’s postulate suggests we will see similar differences in the barriers. Our results do imply that once the C₁ food runs out, Cannizzaro reactions are less favored in the larger species; similar to the C₂ case, the relative stability is ~14 kcal in both the C₃ (glycolaldehyde to glycerol) and C₄ cases (erythrose to erythritol). Thus, reduction of the C₁ species (CH₂O to CH₃OH) is favored in the cross-Cannizzaro.

3.4. Glyceraldehyde Reactions: Isomerization and Aldol Addition

Glyceraldehyde favorably isomerizes into dihydroxyacetone in aqueous solution. Our calculated ΔG for the reaction is −3.5 kcal, and the uncatalyzed barrier is 24.8 kcal (very similar to the 24.3 kcal enolization barrier for glycolaldehyde). The isomerization is a two-step reaction going through an enol intermediate. Figure 7 shows the ΔG and ΔG^‡ values for enol formation. Since dihydroxyacetone is more stable than glyceraldehyde, these values are 3.0–3.5 kcal higher starting from dihydroxyacetone. The enolization barriers of glyceraldehyde with HCOOH and NH₃ are very similar to those for glycolaldehyde, as expected, since the transition states are structurally similar. Uncatalyzed, the isomerization is relatively slow, but lower barriers with the proxy catalyst NH₃ allow for more rapid interconversion. Dihydroxyacetone could be one way to sequester glyceraldehyde.

For the C₃ + C₁ → C₄ aldol addition, we had previously determined [19] that the path to forming the ketone erythrulose had lower (uncatalyzed) barriers compared to forming the aldehydes, erythrose, or threose. However, we had not considered the formation of a branched tetrose 2,3-dihydroxy-2-(hydroxymethyl)propanal, which we will refer to as DHMP. Our results are shown in Figure 8.

For erythrulose formation, ΔG for the reaction is −13.1 kcal. While this is 4 kcal more exergonic than C₂ + C₁ → glyceraldehyde, it is because erythrulose is more stable than its aldehyde counterparts erythrose and threose by 4.2 and 3.5 kcal, respectively. The water-only barrier is 21.1 kcal, or 3.3 kcal lower than C₂ + C₁ → glyceraldehyde. While we expected the barrier with the HCOOH proxy catalyst to be lower than 21 kcal, this was not the case even after trying multiple conformations. (We did find a lower barrier with CH₃COOH.) The barrier with the NH₃ proxy catalyst is lowered by 13 kcal, a similar magnitude compared to C₂ + C₁ → glyceraldehyde.

DHMP formation is less exergonic. Steric hindrance destabilizes DHMP by ~1 kcal compared to its unbranched counterparts, erythrose and threose. The barriers for water-only, HCOOH, and NH₃ closely matched what we found for C₂ + C₁ → glyceraldehyde.

Since glyceraldehyde can either convert into dihydroxyacetone or undergo aldol addition with CH₂O to form erythrulose, and in both cases, our transition states go through the C₃ enediol intermediate, there is competition between the two reactions. The aldol addition is thermodynamically favored with ΔG = −13.1 kcal compared to the isomerization with ΔG = −3.5 kcal (uphill 6.2 to the enediol, then downhill 9.7 to the ketone). The aldol reaction is also kinetically favored. From the enediol, the water-only aldol addition has ΔG^‡ = 21.1 − 6.2 = 14.9 kcal, while the enol-to-keto isomerization has ΔG^‡ = 28.3 − 9.7 = 18.6 kcal. Thus, we would expect aldol addition to outcompete enol-to-keto isomerization when CH₂O is plentiful. (NH₃ lowers the barriers for both reactions by similar amounts; thus, aldol addition is still favored.) This fits with what Appayee and Breslow found in their follow-up experiments [3] prompted by work in Benner’s group [39]; at higher concentrations of CH₂O, hardly any dihydroxyacetone is observed, but at lower concentrations, enolization can come into play.

Additionally, Appayee and Breslow [3] proposed a 1,2-hydride shift for the isomerization of glyceraldehyde to dihydroxyacetone based on deuterium labeling experiments. According to their mechanism, this conversion is catalyzed by Ca²⁺ and hydroxide ions. Because our protocol is limited to neutral species (as discussed in Section 2), we can only investigate this with neutral molecule catalysts. The optimal transition state has one mediating water molecule between the oxygens, as shown in Figure 9. The proton from the 2-hydroxy group has been mostly transferred (H…O is 1.49 Å), while the middle C…H (at 1.24 Å) is only just starting to break. This is somewhat analogous to Appayee and Breslow’s proposed mechanism, where deprotonation takes place before the deuterium transfer. However, our calculated ΔG^‡ is 30.0 kcal, which is higher than enolization. Adding HCOOH or NH₃ in place of the mediating water molecule does not lower the calculated barrier.

Because our calculations do not mimic the most productive experimental formose reaction conditions for sugar synthesis (high pH, the presence of divalent metal ions or borate, higher temperatures, and reactant concentrations), they limit the mechanisms we can explore, and we cannot always make direct comparisons to experimental results. However, our consistent protocol allows us to peg the energies of the myriad compounds that may show up in a messy prebiotic mixture. This approach has allowed us to generate a series of intersecting free energy maps under a set of baseline conditions for a wide range of prebiotic systems. And while our protocol has its limitations, we have some confidence that our calculated free energies are reasonable and can be validated, as discussed in Section 2.

3.5. Tetrose Reactions: Isomerization, Ring Closure, and Retro-Aldol Splitting

While erythrulose is the most stable of the tetrose sugars, it does not easily undergo retro-aldol splitting to form glycolaldehyde. The key step of re-forming the C₂ linchpin facilitates autocatalysis in the formose reaction. Instead, it is the aldehyde that more favorably undergoes the retro-aldol reaction; thus, erythrulose has to first isomerize into erythrose or threose. For the enolization of erythrulose, we calculate ΔG to be +9.7 kcal, essentially matching the enolization of its C₃ counterpart, dihydroxyacetone. The barriers are also similar: ΔG^‡ values are 29.4, 28.7, and 13.0 kcal for the water-only, HCOOH, and NH₃ proxy catalysts. (See Figure 7 for the counterpart C₃ values).

The linear C₄ aldehydes can potentially be sequestered by ring closure reactions and thus less susceptible to retro-aldol reactions. We previously calculated the free energy changes for both D-erythrose and D-threose into their α- and β-furanoses [19]; the linear tetroses and their furanoses are 1 kcal or less apart, so we expect both to be present at equilibrium. Since ring closure and opening resemble hydration and dehydration, it is not surprising that the barriers are similar in magnitude.

We had not considered the retro-aldol C₄ → C₂ + C₂ in our previous work, so we will examine it more carefully here. Our results are shown in Figure 10. The retro-aldol splitting is marginally endergonic. Since D-threose is more stable than D-erythrose by 0.7 kcal, the retro-aldol of D-threose is 0.7 kcal more endergonic, and its corresponding barriers are also marginally higher. At over 30 kcal/mol for the water-only barriers, the uncatalyzed retro-aldol is expected to be quite slow at room temperature. HCOOH and NH₃ proxy catalysts lower ΔG^‡, but these values are still over 20 kcal/mol. It is not surprising that the retro-aldol (being endergonic) has higher barriers than the (exergonic) aldol additions.

The three retro-aldol transition states for D-erythrose are shown in Figure 11. For water-only, the breaking C–C bond at 1.75 Å has not extended very far, while the O…H distances suggest that the proton hops between water and erythrose have proceeded substantially. For HCOOH, the acid proton has mostly hopped to the aldehyde carbonyl (O–H is 1.08 Å), and the breaking C–C bond is substantially longer at 2.24 Å. For NH₃, the aldehyde carbonyl is starting to grab a proton (O…H is 1.32 Å), and that is because NH₃ has already grabbed a proton (N–H is 1.06 Å) from the neighboring water, which has also grabbed its proton (O–H is 1.03 Å) from the alcohol group on erythrose.

3.6. Sequestering and Parasitic Reactions

Earlier, we discussed how CH₃OH, produced in the C₁ Cannizzaro reaction, could sequester CH₂O via acetal formation. In this section, we examine possible reactions that may either sequester or remove “food” or intermediates from the smallest autocatalytic cycle. Most of our analysis will focus on the C₁ compounds, and we will discuss extrapolations to the C₂, C_3, and C₄ sugars in the cycle.

We saw earlier that the addition of water to CH₂O to form the hydrate CH₂(OH)₂ was exergonic 4.6 kcal with a barrier of 13.2 kcal. What if instead of acting as proxy catalysts, NH₃ or H₂S could add to CH₂O in a similar way? (The transition states both include two additional water molecules facilitating proton transfer.)

CH₂O + NH₃ → CH₂(OH)(NH₂) ΔG = −4.6 kcal ΔG^‡ = 3.9 kcal

CH₂O + H₂S → CH₂(OH)(SH) ΔG = −3.9 kcal ΔG^‡ = 12.1 kcal

Both reactions are exergonic. CH₂(OH)(NH₂) is similar in relative energy to the hydrate, while CH₂(OH)(SH) is higher by 0.7 kcal. Since the barrier for NH₃ addition is significantly lower in the presence of NH₃, we expect NH₃ to favorably react with aldehydes if present in the solution. H₂S is marginally disfavored and, unless present in high concentration, is less likely to be found added to CH₂O. The dominant species in aqueous solution is still CH₂(OH)₂.

Since CH₃OH may be generated from Cannizzaro reactions, NH₃ and H₂S could potentially act as attacking nucleophiles in an S_N2 reaction.

CH₃OH + NH₃ → CH₃NH₂ + H₂O ΔG = −2.8 kcal ΔG^‡ = 32.2 kcal

CH₃OH + H₂S → CH₃SH + H₂O ΔG = −8.1 kcal ΔG^‡ = 20.6 kcal

Both reactions are exergonic, but the barriers are higher. Recall from Section 2 that our S_N2 transition states require a daisy chain of 3–4 additional water molecules, and in our experience, our calculated barriers are artificially high. (We calculate ΔG^‡ = 43.3 kcal for the self-barrier for CH₃OH + H₂O.) In a dilute aqueous solution, the equilibrium may shift toward hydrolyzing CH₃NH₂, but the exergonicity of CH₃SH formation suggests that it might persist in solution as a potential reactant. The lower ΔG^‡ for H₂S addition suggests that CH₃SH formation might be quite facile in a messy formose system spiked with H₂S.

Since HCOOH may also be present from Cannizzaro reactions, nucleophilic addition of NH₃, H₂S, and CH₃OH (with subsequent elimination of water as a leaving group) could respectively lead to amides, thioacids, and esters. (Since this is a two-step reaction, we report ΔG^‡ for the rate-determining step.)

HCOOH + NH₃ → HC(O)NH₂ + H₂O ΔG = –5.4 kcal ΔG^‡ = 24.1 kcal

HCOOH + H₂S → HC(O)SH + H₂O ΔG = +10.0 kcal ΔG^‡ = 33.1 kcal

HCOOH + CH₃OH → HCO₂CH₃ + H₂O ΔG = –0.4 kcal ΔG^‡ = 26.3 kcal

Amide formation is exergonic, as expected, and the barrier is moderate. Once amides are formed, their hydrolysis (with a barrier of ~30 kcal) is slower. Formamide, the C₁ product of this reaction, is prominent in prebiotic chemistry scenarios [40,41]. We have previously computed the route to formamide via the hydrolysis of HCN, which is more exergonic but has a higher barrier [42]. On the other hand, forming the thioacid is rather endergonic, and this contributes to its higher barrier. From our recent work mapping the free energies of small CHOS compounds [43], we find that thioester formation is less endergonic (6–7 kcal), and thus, coupled with the stability of CH₃SH as a reactant, a feasible route to thioesters may be envisioned. With regard to ester formation, our calculated ΔG close to zero suggests that in a relatively dilute aqueous solution, the equilibrium favors hydrolysis.

While C₁ food is plentiful, the possible presence of CH₂(OH)(NH₂) and CH₂(OH)(SH) in solution opens up the possibility of additional Cannizzaro reactions from the favorable reduction in CH₂O to CH₃OH.

CH₂O + CH₂(OH)(NH₂) → HC(O)NH₂ + CH₃OH ΔG = –25.0 kcal ΔG^‡ = 23.3 kcal

CH₂O + CH₂(OH)(SH) → HC(O)SH + CH₃OH ΔG = –10.2 kcal ΔG^‡ = 28.9 kcal

In this third route to formamide, the reaction exergonicity doubly benefits from the stability of the amide and the reduction to CH₃OH. The barrier is similar to the self-disproportionation of formaldehyde and its hydrate, which was reported earlier. Unlike the aforementioned route to the thioacid, the Cannizzaro reaction, in this case, is exergonic, although it also has a high barrier. (This suggests a promising route for thioester formation, and we are generating several energy maps utilizing CH₃SH as a reactant and expect to present those results in our next publication.)

Looking at this small subset of reactions suggests that the possibilities multiply exponentially as we go beyond C₁ compounds. For the C₂ compounds, besides glycolaldehyde and its hydrate, we might see glycolic acid and ethylene glycol as first-order Cannizzaro products, but subsequent redox reactions might lead to acetic acid and ethanol. All these compounds can react with each other (and with NH₃, H₂S, CH₃NH₂, CH₃SH, etc.), leading to a plethora of compounds—a very messy system! We expect to pursue some of the promising reactions in the near future, but since we have been examining the smallest autocatalytic cycle, we close this section by presenting our results on the thermodynamics for adding water, NH₃, and H₂S to the carbonyl group of the C₂, C_3, and C₄ sugars.

The results in Table 1 suggest that reaction with NH₃ can potentially sequester the aldehydes, but the reaction is reversible, and the barriers are likely to be very low (as we saw for NH₃ adding to CH₂O). H₂S is less effective as a sequestering agent. For the ketones, the addition reactions are all endergonic, and we do not expect sequestration to be a factor. If the aldehyde-NH₃ adducts subsequently participate in exergonic Cannizzaro reactions, these are unlikely to be reversible and, therefore, parasitic on the cycle. While the barriers are expected to be significant (23 kcal in the case of C₁), they are not insurmountable at moderate to higher temperatures.

4. Overall Free Energy Map and Concluding Remarks

In the previous section, we discussed the reactions of C₁, C₂, C_3, and C₄ “sugars” piecemeal with ΔG and ΔG^‡ values for each individual reaction. In this section, we will look at the overall free energy map. With multiple pathways, we have found it easier to present our results, not along a particular reaction coordinate but with a bird’s-eye-view schematic, as shown in Figure 12. For this to be effective, we need to choose a set of reference compounds and assign them relative free energies of zero. Our reference compounds will be CH₂O, H₂O, NH_3, and H₂S.

The numbers in black and colored fonts are for stable minima and transition states, respectively. The G value for CH₂O in the center of Figure 12 is zero. At a glance, you can see the relative stability of each compound with respect to the reference states. Making new C–C bonds via aldol additions is significantly exergonic. For example, the reaction of glycolaldehyde (G = −16.2) with formaldehyde to form glyceraldehyde (G = −25.6 kcal) has a ΔG of −25.6 − (−16.2 + 0) = −9.4 kcal as shown in Section 3.3. The uncatalyzed (H₂O-only) transition state (G = +7.4 in green font) tells us that the forward barrier ΔG^‡ is +7.4 − (−16.2) = +23.8 kcal while the reverse barrier would be ΔG^‡ = +7.4 − (−25.6) = +33.2 kcal. For the HCOOH catalyzed reaction (transition state G = +2.1 in dark red font), the forward barrier ΔG^‡ is +2.1 − (−16.2) = 18.3 kcal (as shown in Figure 5) while the reverse barrier would be ΔG^‡ = +2.1 − (−25.6) = +27.7 kcal.

As a second example, consider the reaction of HCOOH (G = −5.1) reacting with H₂S (G = 0, a reference compound) to form the thioacid HC(O)SH (G = +4.9). ΔG of this reaction is +4.9 − (−5.1 + 0) = +10.0 kcal. On the other hand, the reaction of HCOOH with CH₃SH (G = −27.1) to form the thioester HC(O)SCH₃ (G = −26.9) has ΔG = −26.9 − (−5.1 −27.1) = +5.3 kcal. Hence, forming the thioester is less endergonic than forming the thioacid, as discussed in Section 3.6. Except for the exergonic formation of amides, the sequestering reactions (top half of Figure 12) have ΔG close to zero or slightly endergonic.

Figure 12 is only a very small part of what may take place in a messy formose reaction if NH₃ and H₂S are present. All the reactions in the top half only show reacting C₁ species, but because the formose reaction generates C₂, C_3, and C₄ aldehydes, ketones, alcohols, and carboxylic acids (the latter two via Cannizzaro reactions), these can further react with each other and with NH₃ and H₂S. While we do not have all these calculations completed, what we have presented thus far allows us to make reasonable first approximations of both the thermodynamics and kinetics of these myriad reactions. We expect to set up kinetic simulations to explore potential product distributions in this complex system in a future publication when we have accumulated more data.

Figure 12 also summarizes most of the results we have discussed in the previous section, and we hope this has provided the reader with a free energy “map” to see where the hills and valleys are located, how high the hills are, and how deep or shallow the valleys lie. The takeaway summary: (1) C–C bond-forming aldol reactions are thermodynamically more favorable than C–O, C–S, or C–N bond formation. (2) Kinetically, C–N bond formation has the lowest barriers. (3) Cannizzaro reactions open access to species with other redox states and are thermodynamically driven by forming the reduced (alcohol) product, although the kinetic barriers are relatively high. (4) HCOOH and NH₃ can be used as proxy catalysts in computational studies; looking at the transition states gives us a sense of whether a reaction may be acid- or base-catalyzed, but we should be cautious not to over-interpret the results. (5) The exponentially growing set of compounds in the formose reaction includes many opportunities for some intermediates to act as catalysts, sequestering agents, or even parasites of the autocatalytic cycle. Our study only touches a very small part of the initial steps that may occur, but we hope it serves as a guide for future studies.

A big origins-of-life question is how and if an autocatalytic cycle of (sugar) aldehydes evolved into a proto-metabolic cycle of carboxylic acids analogous to the reverse TCA cycle. Kinetically, the main impact of HCOOH is catalyzing the disproportionation of aldehydes to form more acids, parasitizing the formose cycle, while furnishing acids via Cannizzaro reactions toward constructing an analogous cycle of carboxylic acid proto-metabolites. HCOOH has little impact on the enolization and aldol additions required to form larger compounds. On the other hand, NH₃ lowers the barriers of all reactions and acts as an indiscriminate catalyst. By doing so, it allows thermodynamics to play a role in favoring the formation of larger molecules (potential biomass) via new C–C bonds in aldol additions. Thermodynamically, amide bonds are also more stable. This may hint at some roles of nitrogen in extant life today; we find them in biocatalytic proteins or in secondary metabolites, but they are not metabolites in the core pathways, such as glycolysis and the TCA cycle. The eventual evolution to a carboxylic acid cycle with higher barriers now requires catalysts to proceed (because aldol reactions are no longer as feasible kinetically), thus leading to selectivity and control in metabolism.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/life14080933/s1, Table S1: Energy breakdown of molecules and transition states.

Author Contributions

Conceptualization, J.K.; methodology, J.K.; validation, J.K. and L.P.T.; formal analysis, J.K. and L.P.T.; investigation, J.K. and L.P.T.; data curation, J.K. and L.P.T.; writing—original draft preparation, J.K. and L.P.T.; writing—review and editing, J.K.; visualization, J.K. and L.P.T.; supervision, J.K.; project administration, J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in Supplementary Materials.

Acknowledgments

This research was supported by the University of San Diego; J.K. acknowledges a University Professor Award from the university for funding. Shared computing facilities were provided by the saber3 computing cluster at the University of San Diego.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xavier, J.C.; Hordijk, W.; Kauffman, S.; Steel, M.; Martin, W.F. Autocatalytic chemical networks at the origin of metabolism. Proc. Biol. Sci. 2020, 287, 20192377. [Google Scholar] [CrossRef] [PubMed]
Breslow, R. On the Mechanism of the Formose Reaction. Tetrahedron Lett. 1959, 1, 22–26. [Google Scholar] [CrossRef]
Appayee, C.; Breslow, R. Deuterium studies reveal a new mechanism for the formose reaction involving hydride shifts. J. Am. Chem. Soc. 2014, 136, 3720–3723. [Google Scholar] [CrossRef] [PubMed]
Mizuno, T.; Weiss, A.H. Synthesis and Utilization of Formose Sugars. Adv. Carbohydr. Chem. Biochem. 1974, 29, 173–227. [Google Scholar] [CrossRef]
Omran, A.; Menor-Salvan, C.; Springsteen, G.; Pasek, M. The Messy Alkaline Formose Reaction and Its Link to Metabolism. Life 2020, 10, 125. [Google Scholar] [CrossRef] [PubMed]
Mueller, D.; Pitsch, S.; Kittaka, A.; Wagner, E.; Wintner, C.E.; Eschenmoser, A. Chemistry of a-aminonitriles. Aldomerization of glycolaldehyde phosphate to rac-hexose 2,4,6-triphosphates and (in presence of formaldehyde) rac-pentose 2,4-diphosphates: Rac-allose 2,4,6-triphosphate and rac-ribose 2,4-diphosphate are the main reaction products. Helv. Chim. Acta 1990, 73, 1410–1468. [Google Scholar] [CrossRef]
Ricardo, A.; Carrigan, M.A.; Olcott, A.N.; Benner, S.A. Borate minerals stabilize ribose. Science 2004, 303, 196. [Google Scholar] [CrossRef] [PubMed]
Zubay, G. Studies on the lead-catalyzed synthesis of aldopentoses. Orig. Life Evol. Biosph. 1998, 28, 13–26. [Google Scholar] [CrossRef] [PubMed]
Robinson, W.E.; Daines, E.; van Duppen, P.; de Jong, T.; Huck, W.T.S. Environmental conditions drive self-organization of reaction pathways in a prebiotic reaction network. Nat. Chem. 2022, 14, 623–631. [Google Scholar] [CrossRef]
Van Duppen, P.; Daines, E.; Robinson, W.E.; Huck, W.T.S. Dynamic Environmental Conditions Affect the Composition of a Model Prebiotic Reaction Network. J. Am. Chem. Soc. 2023, 145, 7559–7568. [Google Scholar] [CrossRef]
Bris, A.; Baltussen, M.G.; Tripodi, G.L.; Huck, W.T.S.; Franceschi, P.; Roithova, J. Direct Analysis of Complex Reaction Mixtures: Formose Reaction. Angew. Chem. Int. Ed. 2024, 63, e202316621. [Google Scholar] [CrossRef] [PubMed]
Paschek, K.; Kohler, K.; Pearce, B.K.D.; Lange, K.; Henning, T.K.; Trapp, O.; Pudritz, R.E.; Semenov, D.A. Possible Ribose Synthesis in Carbonaceous Planetesimals. Life 2022, 12, 404. [Google Scholar] [CrossRef] [PubMed]
Omran, A. Plausibility of the Formose Reaction in Alkaline Hydrothermal Vent Environments. Orig. Life Evol. Biosph. 2023, 53, 113–125. [Google Scholar] [CrossRef] [PubMed]
Vinogradoff, V.; Leyva, V.; Mates-Torres, E.; Pepino, R.; Danger, G.; Rimola, A.; Cazals, L.; Serra, C.; Pascal, P.; Meinert, C. Olivine-catalyzed glycolaldehyde and sugar synthesis under aqueous conditions: Application to prebiotic chemistry. Earth Planet Sci. Lett. 2024, 626, 118558. [Google Scholar] [CrossRef]
Haas, M.; Lamour, S.; Christ, S.B.; Trapp, O. Mineral-mediated carbohydrate synthesis by mechanical forces in a primordial geochemical setting. Commun. Chem. 2020, 3, 140. [Google Scholar] [CrossRef] [PubMed]
Rappoport, D.; Galvin, C.J.; Zubarev, D.Y.; Aspuru-Guzik, A. Complex Chemical Reaction Networks from Heuristics-Aided Quantum Chemistry. J. Chem. Theory Comput. 2014, 10, 897–907. [Google Scholar] [CrossRef] [PubMed]
Simm, G.N.; Reiher, M. Context-Driven Exploration of Complex Chemical Reaction Networks. J. Chem. Theory Comput. 2017, 13, 6108–6119. [Google Scholar] [CrossRef] [PubMed]
Roszak, R.; Wolos, A.B.M.; Glen, L.; Konka, J.; Jensen, P.; Burgchardt, P.; Zadlo-Dobrowolska, A.; Janiuk, P.; Szymkuc, S.; Grzybowski, B.A. Emergence of metabolic-like cycles in blockchain-orchestrated reaction networks. Chem 2024, 10, 952–970. [Google Scholar] [CrossRef]
Kua, J.; Avila, J.E.; Lee, C.G.; Smith, W.D. Mapping the Kinetic and Thermodynamic Landscape of Formaldehyde Oligomerization under Neutral Conditions. J. Phys. Chem. A 2013, 117, 12658–12667. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Hernandez, A.L.; Velasquez, D.N. Thermodynamics of Potential CHO Metabolites in a Reducing Environment. Life 2021, 11, 1025. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Hanley, S.W.; De Haan, D.O. Thermodynamics and Kinetics of Glyoxal Dimer Formation: A Computational Study. J. Phys. Chem. A 2008, 112, 66–72. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Galloway, M.M.; Millage, K.D.; Avila, J.E.; De Haan, D.O. Glycolaldehyde Monomer and Oligomer Equilibria in Aqueous Solution: Comparing Computational Chemistry and NMR Data. J. Phys. Chem. A 2013, 117, 2997–3008. [Google Scholar] [CrossRef] [PubMed]
Vosko, S.H.; Wilk, L.; Nusair, M. Accurate spin-dependent electron liquid correlation energies for local spin density calculations: A critical analysis. Can. J. Phys. 1980, 58, 1200–1211. [Google Scholar] [CrossRef]
Becke, A.D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 1988, 38, 3098–3100. [Google Scholar] [CrossRef] [PubMed]
Lee, C.; Yang, W.; Parr, R.G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 1988, 37, 785–789. [Google Scholar] [CrossRef] [PubMed]
Becke, A.D. Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 1993, 98, 5648–5652. [Google Scholar] [CrossRef]
Halgren, T.A. MMFF VII. Characterization of MMFF94, MMFF94s, and other widely available force fields for conformational energies and for intermolecular-interaction energies and geometries. J. Comput. Chem. 1999, 20, 730–748. [Google Scholar] [CrossRef]
Warshel, A.; Florian, J. Computer simulations of enzyme catalysis: Finding out what has been optimized by evolution. Proc. Natl. Acad. Sci. USA 1998, 95, 5950–5955. [Google Scholar] [CrossRef] [PubMed]
Wiberg, K.B.; Bailey, W.F. Chiral diamines 4: A computational study of the enantioselective deprotonation of Boc-pyrrolidine with an alkyllithium in the presence of a chiral diamine. J. Am. Chem. Soc. 2001, 123, 8231–8238. [Google Scholar] [CrossRef] [PubMed]
Nielsen, R.J.; Keith, J.M.; Stoltz, B.M.; Goddard, W.A., 3rd. A computational model relating structure and reactivity in enantioselective oxidations of secondary alcohols by (-)-sparteine-Pd(II) complexes. J. Am. Chem. Soc. 2004, 126, 7967–7974. [Google Scholar] [CrossRef] [PubMed]
Deubel, D.V.; Lau, J.K. In silico evolution of substrate selectivity: Comparison of organometallic ruthenium complexes with the anticancer drug cisplatin. Chem. Commun. 2006, 23, 2451–2453. [Google Scholar] [CrossRef] [PubMed]
Wertz, D.H. Relationship between the gas-phase entropies of molecules and their entropies of solvation in water and 1-octanol. J. Am. Chem. Soc. 1980, 102, 5316–5322. [Google Scholar] [CrossRef]
Abraham, M.H. Relationship between solution entropies and gas phase entropies of nonelectrolytes. J. Am. Chem. Soc. 1981, 103, 6742–6744. [Google Scholar] [CrossRef]
Krizner, H.E.; De Haan, D.O.; Kua, J. Thermodynamics and Kinetics of Methylglyoxal Dimer Formation: A Computational Study. J. Phys. Chem. A 2009, 113, 6994–7001. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Paradela, T.L. Early Steps of Glycolonitrile Oligomerization: A Free-Energy Map. J. Phys. Chem. A 2020, 124, 10019–10028. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Miller, A.S.; Wallace, C.E.; Loli, H. Role of Acid in the Co-oligomerization of Formaldehyde and Pyrrole. ACS Omega 2019, 4, 22251–22259. [Google Scholar] [CrossRef] [PubMed]
Rivlin, M.; Eliav, U.; Navon, G. NMR studies of the equilibria and reaction rates in aqueous solutions of formaldehyde. J. Phys. Chem. B 2015, 119, 4479–4487. [Google Scholar] [CrossRef] [PubMed]
Tsujino, Y.; Wakai, C.; Matubayashi, N.; Nakahara, M. Noncatalytic Cannizzaro-type reaction of formaldehyde in hot water. Chem. Lett. 1999, 28, 287–288. [Google Scholar] [CrossRef]
Kim, H.J.; Ricardo, A.; Illangkoon, H.I.; Kim, M.J.; Carrigan, M.A.; Frye, F.; Benner, S.A. Synthesis of Carbohydrates in Mineral-Guided Prebiotic Cycles. J. Am. Chem. Soc. 2011, 133, 9457–9468. [Google Scholar] [CrossRef]
Saladino, R.; Crestini, C.; Pino, S.; Costanzo, G.; Di Mauro, E. Formamide and the origin of life. Phys. Life Rev. 2012, 9, 84–104. [Google Scholar] [CrossRef] [PubMed]
Bizzarri, B.M.; Saladino, R.; Delfino, I.; Garcia-Ruiz, J.M.; Di Mauro, E. Prebiotic Organic Chemistry of Formamide and the Origin of Life in Planetary Conditions: What We Know and What Is the Future. Int. J. Mol. Sci. 2021, 22, 917. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Thrush, K.L. HCN, Formamidic Acid, and Formamide in Aqueous Solution: A Free Energy Map. J. Phys. Chem. B 2016, 120, 8175–8185. [Google Scholar] [CrossRef] [PubMed]
Kua, J.; Miller, N.A. Preliminary Free Energy Map of Prebiotic Compounds Formed from CO₂, H₂ and H₂S. Life 2022, 12, 1763. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Core autocatalytic cycle of the formose reaction.

Figure 2. Transition state structures (A,B) and barriers (C) for formaldehyde hydration.

Figure 3. Transition state structures (A) and barriers (B) for the C₁ Cannizzaro reaction.

Figure 4. Transition state structures (A) and barriers (B) for the C₁ + C₂ → C₃ aldol reaction.

Figure 5. Transition state structures and barriers for the C₁ + C₂ → C₃ aldol reaction.

Figure 6. Cannizzaro reactions of glycolaldehyde.

Figure 7. Enolization of glyceraldehyde and dihydroxyacetone. ΔG and ΔG^‡ values in kcal.

Figure 8. C₃ + C₁ → C₄ aldol addition. ΔG and ΔG^‡ values in kcal.

Figure 9. Transition state of 1,2-hydride shift for glyceraldehyde-dihydroxyacetone isomerization.

Figure 10. C₄ → C₂ + C₂ retro-aldol splitting. ΔG and ΔG^‡ values in kcal.

Figure 11. Transition state structures for retro-aldol splitting.

Figure 12. Overall free energy map with G values in kcal relative to CH₂O, H₂O, NH₃, and H₂S.

Table 1. Thermodynamics of addition reactions to sugar carbonyls.

Sugar	Added Molecule	ΔG (kcal)
Glycolaldehyde	H₂O	+0.4
	NH₃	−0.4
	H₂S	+2.6
Glyceraldehyde	H₂O	−0.1
	NH₃	0.0
	H₂S	+2.2
Dihydroxyacetone	H₂O	+2.6
	NH₃	+3.7
	H₂S	+7.2
Erythrose	H₂O	−0.4
	NH₃	−0.7
	H₂S	+2.8
Threose	H₂O	+1.1
	NH₃	+0.1
	H₂S	+3.5
Erythrulose	H₂O	+3.8
	NH₃	+2.8
	H₂S	+6.0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kua, J.; Tripoli, L.P. Exploring the Core Formose Cycle: Catalysis and Competition. Life 2024, 14, 933. https://doi.org/10.3390/life14080933

AMA Style

Kua J, Tripoli LP. Exploring the Core Formose Cycle: Catalysis and Competition. Life. 2024; 14(8):933. https://doi.org/10.3390/life14080933

Chicago/Turabian Style

Kua, Jeremy, and L. Philip Tripoli. 2024. "Exploring the Core Formose Cycle: Catalysis and Competition" Life 14, no. 8: 933. https://doi.org/10.3390/life14080933

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exploring the Core Formose Cycle: Catalysis and Competition

Abstract

1. Introduction

2. Computational Methods

3. Results and Discussion

3.1. Formaldehyde Hydration

3.2. The C₁ Cannizzaro Reaction

3.3. Glycolaldehyde Reactions: Enolization, Aldol, Hydration, and Cannizzaro Reactions

3.4. Glyceraldehyde Reactions: Isomerization and Aldol Addition

3.5. Tetrose Reactions: Isomerization, Ring Closure, and Retro-Aldol Splitting

3.6. Sequestering and Parasitic Reactions

4. Overall Free Energy Map and Concluding Remarks

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Exploring the Core Formose Cycle: Catalysis and Competition

Abstract

1. Introduction

2. Computational Methods

3. Results and Discussion

3.1. Formaldehyde Hydration

3.2. The C1 Cannizzaro Reaction

3.3. Glycolaldehyde Reactions: Enolization, Aldol, Hydration, and Cannizzaro Reactions

3.4. Glyceraldehyde Reactions: Isomerization and Aldol Addition

3.5. Tetrose Reactions: Isomerization, Ring Closure, and Retro-Aldol Splitting

3.6. Sequestering and Parasitic Reactions

4. Overall Free Energy Map and Concluding Remarks

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. The C₁ Cannizzaro Reaction