The metabolic processes in living organisms inherently possess stochastic elements and intrinsic noise, rendering the attainment of a strictly analytical or precisely deterministic solution a formidable challenge in the realm of biomathematics. In response to this complexity, biologists and mathematicians frequently employ kinetics and statistics as powerful tools to establish mathematical models that encapsulate the dynamics of biochemical reactions [1]. In the context of elucidating the operational intricacies of the formaldehyde metabolism pathways and NCD synthesis within E. coli, and, more significantly, in comprehending the ramifications of alterations in the efficiency and activity of the enzymes governing these processes on the system's overall behaviour, we advocate for an in-depth, bottom-up approach. A kinetic modelling framework, grounded in the metabolic pathway, is essential for scrutinizing this dynamic system.
In the examination of the NCD-dependent formaldehyde metabolic system, it is imperative to adopt a biochemist's perspective. Complex biochemical networks are conventionally employed to represent intricate metabolic pathways, and herein, we provide a structural formula-based representation of the NCD-dependent formaldehyde pathway (Fig. 1). This figurative framework enables us to dissect the system's progression systematically. The influx of formaldehyde into the cellular matrix is driven by concentration gradients, and, guided by our reengineered metabolic pathway, it undergoes a transformation within the cell, effectively functioning as a formaldehyde processing conduit, akin to a specialized pipeline.
In a biochemical context, the progression of formaldehyde metabolism unfolds as follows:
Exogenous Formaldehyde Passive Diffusion: The initial stage entails the passive diffusion of formaldehyde (HCHO), a small hydrophilic molecule, across the cell membrane driven by concentration gradients. This diffusion process is reversible and is distinguished by subscripts ex
and in
denoting the formaldehyde's presence in distinct cellular compartments.
Oxidation of Cellular Formaldehyde: Following diffusion, formaldehyde encounters oxidation catalyzed by formaldehyde dehydrogenase (FADH). This catalytic reaction results in the conversion of hazardous formaldehyde into formate. Notably, the non-natural cofactor NCD, an artificial analog of NAD, plays a pivotal role as an electroreceptor in facilitating this reaction. FADH catalyzes this reaction, which can be represented as a reversible process.
A structural comparison between NAD (Nicotinamide Adenine Dinucleotide) and NCD (Nicotinamide Cytosine Dinucleotide) is depicted in Fig. 2, revealing the distinctions marked by cyan and tawny rectangles. While NCD and NAD share a common dinucleotide backbone covalently linked by pyrophosphate bonds, their different functional nitrogenous bases impart varying biochemical roles in vivo [2].
Oxidation of Formate: Formate, the product of formaldehyde oxidation, undergoes further transformation into carbon dioxide, with this reaction also being dependent on NCD. This process involves an artificially mutated formate dehydrogenase (FDH*), distinguished by an asterisk, indicating differences from the wild-type FDH.
Incorporating Downstream Products: The challenge of incorporating the downstream product of formaldehyde into the existing natural metabolic pathways is addressed by guiding carbon dioxide (CO2) towards the Tricarboxylic Acid Cycle (TCA) through a series of steps facilitated by a mutant-type malic enzyme (ME*). By bibliographic retrieval, we find that this coupled reaction can be divided into two oxaloacetate-involved reactions [2,3,4]. This transformation of CO2 into malate is achieved through a progression known as the "Hydrocarboxylation of Pyruvate."
This complex reaction comprises two distinct sub-reactions:
a. Carboxylation of Pyruvate: In the first sub-reaction, pyruvate is carboxylated, yielding oxaloacetate intermediates. This step enables the immobilization of carbon dioxide.
b. Reduction of Oxaloacetate Intermediate: In the second sub-reaction, the oxaloacetate intermediate is reduced, with the artificial proton donor NCDH facilitating the reaction and regenerating the oxidized form of NCD. The primary product, malate, participates in various biochemical processes in vivo, including the TCA cycle. Significantly, this redesigned formaldehyde metabolic pathway presents an eco-friendly approach for malate production, reducing the reliance on hydrogen (H2) as seen in conventional industrial processes [4].
In the elucidation of the hydrocarboxylation reaction mechanism, we present a rational model based on a combination of structural analysis and comprehensive bibliographic research [2,3,4,5]. The process commences with pyruvate expelling a proton under the influence of a Brønsted base, leading to the conjugation of the lone pair of the γ-carbon with the β-carbonyl group. Employing resonance structures, the enolate anion isomer arises from the carbonyl anion, offering enhanced suitability for nucleophilic attacks. Subsequently, the enolate form of the pyruvate anion participates in a nucleophilic addition with carbon dioxide originating from the formaldehyde oxidation pathway, resulting in the formation of the oxaloacetate anion. This oxaloacetate anion then undergoes further reduction to yield the malate anion, all facilitated by the enzymatic action of ME*. Finally, the malate anion is protonated to attain a stable species, culminating in the hydrocarboxylation reaction (Fig. 3).
Kinetic analysis stands as a time-honoured and highly effective approach for the anticipation and computational simulation of intricate biochemical networks. In the context of our study, the rate variables within this system, e.g., formaldehyde, NCD, CO2, are no longer represented directly but have instead been replaced by a combination of concentrations for specific chemical species. This transformation allows us to embark on the crucial task of simulating the dynamic behaviour of the newly redesigned metabolic network (Fig. 4).
For a general reaction,
where
Assign law of mass action to the above system of ODE, we have
where
Fig. 6 consists of five subgraphs (a)-(e) that collectively elucidate the dynamics of various chemical species over time or other species. This metabolic system behaves in the temporal evolution of concentrations for multiple interacting species (Fig. 6a), and a notable trend emerges, where the concentration of environmental/cytoplasmic formaldehyde decreases progressively with time. This decline is juxtaposed with the behaviour of other species, which exhibit a distinctive peak-shaped pattern, except for the constitutive expressed NCD. These species show an initial increase in concentration, followed by a subsequent decrease. This unique trend underscores the intricate and dynamic nature of the chemical interactions within the system, revealing a delicate balance of processes that drive these changes over time.
In contrast, Fig. 6b-e delves into the phase diagrams, illustrating the interdependencies and fluctuations of specific species concerning the concentrations of environmental formaldehyde, cytoplasmic formaldehyde, malate, and NCD. The phase graphs offer a comprehensive overview of the relationships and trends among these chemical components, facilitating a deeper understanding of their interactions and potentially uncovering critical insights into the system under study.
The time-dependent sensitivities of environmental formaldehyde
where the numerator
In conjunction with our previous analysis of dynamic trends of the metabolism system, we performed a local sensitivity analysis using SimBiology. The results revealed crucial insights into the system's behaviour. Specifically, we observed that the rate constants kf_0 and kr_0 exert a significant influence on the concentration of environmental formaldehyde, underscoring the system's dependency on diffusion progress rates (Fig. 7a). Furthermore, cellular formaldehyde concentrations exhibited high sensitivity to parameters kf_0, kr_0, and kf_ncd, highlighting the pivotal role of in vivo expression of the non-natural cofactor NCD in regulating cellular formaldehyde levels, particularly by facilitating downstream oxidation (Fig. 7c). Notably, the versatile cofactor NCD displayed heightened sensitivity to parameters kf_ncd, kf_0, kr_0 (partially), and kf_1, suggesting optimization prospects for endogenous NCD expression and the potential introduction of formaldehyde membrane transporters to enhance NCD's efficiency (Fig. 7d).
Fig. 7b shows the parameters involved in the LSA and the schematics of the external formaldehyde metabolic pathway. These LSA findings provide valuable insights for understanding and optimizing the system's behavior, offering opportunities for further research in various application domains.
In Fig. 8, we present the temporal evolution profiles of external formaldehyde, internal formaldehyde, and NCD under varying parameter conditions. To comprehensively understand the system's dynamics, we executed parameter iterations for kf_0, kr_0, kf_1, and kf_ncd. Remarkably, these iterations, performed under distinct initial conditions characterized by varying kinetic constants, led to discernable distinctions in the concentration dynamics of the species of interest. This observed variability in behaviour aligns seamlessly with the outcomes of our prior investigation into local sensitivity, as described earlier. These findings underscore the significant influence of parameter variations on the system's behaviour, offering valuable insights into its intricate dynamics and the potential for optimization.
The aim of our project is to metabolize formaldehyde through the use of modified enzymes capable of utilizing the non-natural coenzyme NCD. Consequently, conducting molecular docking simulations with these pertinent enzymes is of paramount importance for our forthcoming wet-lab experiments. Furthermore, successful molecular docking serves as an additional validation of the effectiveness of our bioengineering efforts.
Receptor | Ligand |
---|---|
Ncds-2 | CTP |
Ncds-2 | NMN |
Consider a molecule characterized by
The term
We use Autodock4 for molecular docking to simulate the affinity of the enzyme for various small molecules.
A quaternion, denoted as
Here, we recapitulate the main outcome of our prior work [7]. We presume that a molecule A with coordinates
With
We should mention that it is practical to work in the center-of-mass reference frame where
We are expanding on our previous work
[7]
by incorporating molecular flexibility through linear collective motions. These can be computed using techniques such as Normal Mode Analysis (NMA) or Principal Component Analysis (PCA). More specifically, let
We can rewrite the previous expression using the quaternion representation of vectors
Here, the unit quaternion
Using the scalar-vector representation of a quaternion,
Performing scalar and vector products in the above equation, then grouping the terms that depend on atomic positions together, and after introducing the inertia tensor
Here,
is the set of
and
This is our primary RMSD equation and the main result of this work, we call it master equation. It comprises the rigid contribution
A practical implication of the master equation is the expression of RMSD for a flexible target conformation with respect to a rigid reference conformation. In this scenario, all the amplitudes of the collective motions for the reference conformation
In this case, the calculation of one RMSD takes linear time with the number of collective motions
When the study is limited to the flexible movements of a molecule, such as in the refinement of docking poses or the generation of pseudo-random structural ensembles, the master equation becomes simplified
Indeed, in this case, the rotation matrix
The RMSD master equation can also be adapted for the particularly useful clustering application case. Here, we compare two possible target conformations, for which the transformation operators are defined with respect to the original reference conformation. Let
Computational analysis revealed that the Root Mean Square Deviation (RMSD) of the CTP molecule in relation to the Ncds-2 protein is 2.68, considering 43 to 43 atoms. In contrast, the RMSD of the NMN molecule with respect to the Ncds-2 protein is 2.32, considering 30 to 30 atoms.
Molecular docking simulation is a computational process that models the interaction between two or more molecules. This simulation aids in understanding the binding mechanisms of molecules and the strength of these interactions. Through molecular docking simulation, we have computationally validated the catalytic activity of Ncds towards the synthesis of
[1] Raj, A., & van Oudenaarden, A. (2008). Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences. Cell, 135(2), 216–226.
[2] Guo, X., Liu, Y., Wang, Q., Wang, X., Li, Q. X., Liu, W-J., & Zhao, Z. K. (2020). Non-natural Cofactor and Formate‐Driven Reductive Carboxylation of Pyruvate. Angewandte Chemie International Edition, 59(8), 3143–3146.
[3] Ji, D., Wang, L., Hou, S., Liu, W., Wang, J., Wang, Q., & Zhao, Z. K. (2011). Creation of Bioorthogonal Redox Systems Depending on Nicotinamide Flucytosine Dinucleotide. Journal of the American Chemical Society, 133(51), 20857–20862.
[4] Wang, J., Guo, X., Wan, L., Liu, Y., Xue, H., & Zhao, Z. K. (2022). Engineering Formaldehyde Dehydrogenase from Pseudomonas putida to Favor Nicotinamide Cytosine Dinucleotide. Chembiochem: A European Journal of Chemical Biology, 23(7), e202100697.
[5] Zelle, RM., de Hulster, E., van Winden, WA., de Waard, P., Dijkema, C., Winkler, AA., Geertman, J-MA., van Dijken, JP., Pronk, JT., & van Maris, AJA. (2008). Malic Acid Production by Saccharomyces cerevisiae: Engineering of Pyruvate Carboxylation, Oxaloacetate Reduction, and Malate Export. Applied and Environmental Microbiology, 74(9), 2766–2777.
[6] Borda-Molina, D., José Salvador Montaña, María Mercedes Zambrano, & Baena, S. (2017). Mining lipolytic enzymes in community DNA from high Andean soils using a targeted approach. Antonie Van Leeuwenhoek International Journal of General and Molecular Microbiology, 110(8), 1035–1051.
[7] Neveu, E., Popov, P., Hoffmann, A., Migliosi, A., Besseron, X., Grégoire Danoy, Pascal Bouvry, & Sergei Grudinin. (2018). RapidRMSD: rapid determination of RMSDs corresponding to motions of flexible molecules. Bioinformatics, 34(16), 2757–2765.