UTokyo - iGEM 2023

Overview

Introduction

Mathematical Modeling is, in a nutshell, the process of representing a system using mathematical concepts and language. Even complex phenomena can be analyzed by expressing them in equations and reasoning about them. In natural sciences, Mathematical Modeling is often used not only to analyze the data from experiments and observations, but also to predict the outcomes beforehand by using simulations and other methods.

One of the most important concepts in synthetic biology is the DBTL cycle, an engineering cycle that repeats the four steps of Design, Build, Test, and Learn to improve a project.

In Mathematical Modeling, a model is built, simulated, and the results are examined to provide feedback to the design process.

Compared to wet experiments, Mathematical Modeling is easy to prepare, and results can be obtained repeatedly under different conditions.

These features allow Mathematical Modeling to accelerate the DBTL cycle and boost project improvement.

Furthermore, recent advances in computers and AI allow modeling to handle more complex systems than ever before, making mathematical modeling increasingly indispensable for synthetic biology.

Brief Summary of Our Project

In this project, we aimed to construct a rapid detection and secretion system in mammalian cells without transcription and translation. The SWIFT system consists of several subsystems, and we divided it into three main modules for simplification and compatibility.

Tips: Advantages of Modularization

A module is a unit that represents a part of the problem, and each module has inputs and outputs. Appropriate modularization provides benefits such as clarifying the correlations within the system, facilitating modeling, and making it easier to improve each module. As shown by attributing individual functions to specific DNA fragments in synthetic biology, it is wise to apply modularity in a system composed of genetic circuits.

SWIFT is composed of three systems: MESA, Secretion, and Amplification System. The project went through several engineering cycles and decided to use the three modules as components. This page mainly outlines the modeling of the three modules in the last cycle. The contributions of the dry lab in each cycle can be confirmed by referring to the engineering cycle.

Following is an overview of Dry Lab's main relationships with the project.

An overview of Dry Lab's main relationships with the project

(1) Wet / Measurements ⇒ MESA Module

The Wet Lab measurements revealed a leak of MESA in the absence of Ligand, so we developed a model from the previous model. The model was fitted to the measurements and confirmed to be consistent with the experimental data.

Figure 1-a

Figure 1-b

Figure 1: Relationship between ligand concentration and fluorescence intensity
(Theoretical curve (blue), experimental value (red))

(2) MESA Module ⇒ Wet / Measurements

First, we evaluated the probability of our MESA design to penetrate the membrane and fedback to the design using TMHMM.First, TMHMM was used to design the MESA to increase its transmembrane potential. ¹Then, based on the ODE model for MESA, sensitivity analysis and other analyses were conducted to reduce the leaks of the MESA we used. As a result, a feedback was made to Wet Lab that it is important to reduce the receptor expression rate $μ$ , decrease the binding rate constant $k_{1on}$ , and increase the dissociation constant $K_{d}$ . In addition, it was concluded that it is important to increase the activity $k_{cat}$ of the protease to improve the rapidity of MESA detection. See MESA Module for more information.

(3) MESA ⇒ Secretion (Need for Amplification Module)

MESA has been studied as a system that releases TF (Transcription Factor) into the cytoplasm for transcriptional control, but in SWIFT, the protein released by MESA is a protease instead of TF, and the protease functions as a signal (input) in a secretory control system to improve the rapidity (Description for details). However, Dry Lab analysis suggested that the amount of protease output from MESA was not sufficient. Therefore, we added a new Protease Amplification Module to the project design and verified the effectiveness of the Amplification Module in (3) in the Dry Lab. (For details of the analysis, please refer to the Secretion Module.)

Figure 2-a

Figure 2-b

Figure 2: Comparison of concentration of S1(POI in ER and cis-GA) between transcription control and secretion control.
(Left) Before Amplification, (Right) After Amplification.

(4) Wet / Measurements ⇒ Secretion Module

Using the measurements from a previous study, we tested the validity of two models we built, one for transcriptional control by TF and the other for secretory control by Protease. ²As a result, the correlation coefficients of the developed models were 0.95 in the case of TF and 0.90 in the case of Secretion, which verifies the validity of the models. (See Secrertion for more information.)

Figure 3-a

Figure 3-b

Figure 3: Comparison of experimental data and simulated data.
(Left) Secretion, (Right) Transcription

(5) Secretion Module ⇒ Wet / Measurements

From the results of the analysis of the model, it was found that by amplifying the amount of protease released from MESA by about 100 times and by increasing the amount of secretory protein stored and the transcription rate, it was possible to achieve a rapidity that could not be achieved by transcriptional control. Therefore, we proposed the addition of a Protease Amplification module and feedback to Project Design with suggestions for improvement, such as increasing the expression level of COP I and using cells with high secretory capacity.

We also investigated the relationship between the concentration of protease released from MESA and the amount of secreted protease, and found that SWIFT's secretory control system is characterized by a sharp increase in secretion at concentrations above a threshold value. Therefore, it is possible to make the system more sophisticated by changing $k_{cat}$ according to the amount of leakage.

(6) Amplification Module ⇒ WET / Measurements

In the Amplification Module, the ODE model was used to verify whether the designed protease was amplified in the Dry Lab since the Wet experiment could not be conducted in time. In addition, the disorder of the Linker structure in the Protease-Linker-AI Domain was evaluated using Alphafold2 to increase the amplification of the protease and reduce leakage. ³⁴For details, please refer to the Amplification Module.

Figure 4: Comparison graphs of plDDT at each base of TVMV^Thrr-AI and mutated TVMV^Thr-AI sequences.

Figure 5: Comparison of predicted three-dimensional structures using LocalColabFold.

(7) Software ⇒ Wet / Measurements

SWIFT uses a total three proteases in three modules, MESA, Secretion, and Amplification Module. Each of the three modules showed that the values of protease parameters such as $k_{cat}$ and $K_{m}$ can adjust SWIFT's performance. We were keenly aware of the lack of software that could quickly find these parameters during system design. Therefore, we created a new software program, "Proteameter", that allows users to quickly narrow down candidate proteases by specifying a range of parameters. This software can also be used by future iGEMers to use proteases. (See Software page for more information.)

MESA Module

MESA is a system that detects soluble Ligand and releases POI.

Figure 6: Role of MESA Module in SWIFT MESA detects Soluable Ligand (Input) and releases Protease (Output). The released Protease is used as Input for the Amplification Module.

To begin with, in order for our designed MESA to work, Dry lab used THMHH to increase the transmembrane potential (SeeEngineering MESA cycle 1 for more information)⁵.

Next, in order for SWIFT to detect ligand and secrete proteins quickly and with less leakage, MESA must have (1) a low amount of leakage protease and (2) a rapid process from ligand detection to protease output. MESA modeling was conducted to promote understanding of the system for these improvements.

We performed mapping of reaction pathways and constructed a new ODE model. The constructed model was validated to ensure that it shows biologically stable behavior and that it is consistent with the measurement results. Based on the validated models, analyses such as sensitivity analysis were conducted.

The results showed that (1) decrease in absolute expression level and increase of binding affinity (2) increase in $k_{cat}$ of Protease in MESA were effective in improving the system. Overall, the modeling of the MESA Module has aided the design of our project, clarified the behavior of the MESA system, and provided clear guidance for further improvements in engineering. (See Conclusion for more details)

Secretion Module

In the previous MESA, transcription factors such as tTA are released from MESA to promote transcription.⁶ However, the response based on transcriptional control of secretory proteins lacks speed because it requires transcription and translation, so we devised a more rapid system. By attaching an endoplasmic reticulum retention signal called KKYL to a secreted protein and cleaving KKYL with protease, secretion can be achieved without transcription or translation.² In our project, we have established a secretion system without transcription and translation by releasing Protease from MESA and cleaving KKYL with this protease.

We developed two ODE models, one with tTA as input and one with protease as input, for the cases in which tTA and protease are placed downstream of the MESA system, respectively. We compared the rate from the release of a substance downstream of MESA to the secretion of the target substance, and analyzed the factors involved in the rapid secretion of the system. Since the experiments indicated the possibility of leakage in the MESA system, we also examined the relationship between protease concentration and secretion to determine its resistance to leakage.

Simulations based on this model showed that our system can produce a rapidity that cannot be achieved by previous TF-based MESA systems by increasing the amount of secreted protein stored in the endoplasmic reticulum, the transcription rate, and the concentration of protease released from the MESA. Based on the relationship between concentration of protease and the amount of secreted protein, we proposed that it is possible to design, depending on the amount of MSA leakage, the secretion system to be resistant to leakage by changing the type of Protease ( $k_{cat}$ ). If MESA leakage is high, using a Protease with a small $k_{cat}$ will reduce the leakage of secreted proteins.

Amplification Module

One of the things we aimed for in SWIFT is versatility and flexibility. We worked on modeling to enable the design of SWIFT that meets user needs, such as increasing the absolute expression level of target substances and improving the S/N ratio, by freely selecting Protease released by MESA and the reaction mechanism between Proteases.

The Amplification System used this time is one in which Protease (P1) released by MESA activates Protease (P2) prepared in advance inside the cell by cleaving the pair of P2 and its inhibitor domain. Once P1 cuts the linker of P2, it can cut other linkers without becoming inactive, enabling amplification as a whole⁷.

Figure 7: Protease 1 (P1) serves as the input, and it yields Protease 2 (P2) as the output.

First, we compared the expression levels of target substances with and without introducing the Amplification System using an ODE model to confirm the superiority of the Amplification System and to ensure that the absolute expression level is sufficient.

Furthermore, we utilized stereoscopic structure simulation to qualitatively evaluate the binding energy between P2 and Inhibitor Domain and the stability of linkers in the above system. This allows us to change the entire system in the direction desired by users. Specifically, by increasing the binding energy and improving linker stability, it is possible to suppress leakage at the expense of absolute expression level, and vice versa.

When creating a model, we felt a lack of Protease Database, so we newly constructed ProteaseDB. This allows us to narrow down Proteases based on conditions such as orthogonality when selecting Proteases and also makes important constants for Modeling referable. Therefore, it will be helpful for future SWIFT designers and iGEM Community. In addition, we incorporated Protease Amplification System into SWIFT to make SWIFT more useful for users in more combinations, and verified the results by Modeling.⁷

Conclusion

To accurately define the functions required for SWIFT, we initiated a process of modularization and refinement. Initially, the project comprised two modules: MESA and Secretion. The necessity to augment the project design became evident through the analysis of the Secretion Module in the Dry Lab, suggesting the addition of the Amplification Module. This enhancement fostered better collaboration among the individual modules and propelled SWIFT's practical functionality to new level.

We constructed ODE models for each of the modules - MESA, Amplification, and Secretion. The models' validity was verified against actual measurement results for MESA and Secretion. For the Amplification Module, the model evaluated and verified the feasibility of protease amplification. These model analyses provided us with a refined understanding of each module's behavior and offered suggestions for improvements regarding parameters that could be manipulated in the Wet Lab.

Additionally, during the design process, we recognized the need to quickly and efficiently narrow down parameters for the protease, specifically $k_{cat}$ and $K_{m}$ . To address this need, we developed the software "Proteameter." With the input of a range of $k_{cat}$ and $K_{m}$ values, Proteameter generates a list of proteases candidate that fall within the specified parameter range. This software not only serves present iGEMers but can also accept the registration of new proteases, saving time in the quest to identify proteases with the desired parameters.

Through the construction of concise and well-represented models, close collaboration with the Wet Lab, and dialogues with experts, we have successfully established SWIFT as a rapid and scalable detection and secretion system.

Our work involves the recognition of each genetic part function, and integretion into modules for the whole system. The optimization of module-specific functionalities, and the enhancement of interactions between modules, have collectively improved the system's practicality, as elaborated and illustrated above. We hope that our persistent and steady approach throughout the project will provide valuable insights to iGEM Community teams through our wiki.

MESA

Purpose

MESA is a system that detects a specific soluble ligand and releases the protein of interest (POI).MESA is a detection module that can express synthetic receptors corresponding to the soluble Ligand to be detected, allowing detection of a wide variety of ligands. (See Description for details).

Figure 1: Role of MESA module in SWIFT　MESA detects the desired soluble ligand (Input) and releases Protease (Output). The released Protease is used as Input for the Secretion Module.

In order for the MESA Module to work better in SWIFT, it needs to be enhanced in terms of (1) fewer leaks and (2) quicker system response. Although MESA is a composite part using existing proteases and ectodomains, there is no modeling of MESA as a whole. Therefore, we first mapped the reaction pathway of MESA and constructed a new ODE Model.

Method and Modeling

Mapping Reaction

As mentioned earlier, there is no Modeling for MESA, so we first mapped the reaction pathway. A schematic of the reaction pathway is shown below:

MESA_reaction_pathway_and_wordlist Figure 2: MESA reaction pathway and wordlist The reaction in this figure can be divided into three parts.

Receptor dimerization by binding of ligand to the receptor

The dimerization brings the TC and PC chains into physical proximity, and PR cleaves CS.

The target substance is released into the cytoplasm by cleavage

Based on the above information, we formulated the reaction equation. MESA can be utilized when the Ectodomain forms either a hetero or homodimer. In the homodimeric case, the PC and TC chains have the same binding strength. However, in the heterodimeric case, the binding affinity between the PC and TC chains and ligands is different, resulting in a different binding order. In this modeling, we will deal with the heterodimeric case because we used rapamycin in the Wet team.

Simulation Model

Based on the above reaction pathway mapped, an ODE model was constructed in MATLAB. The simulations were performed and results were visualized using MATLAB R2023a and Simbiology Model Analyzer 6.4.1.

A simple model based on the Hetero dimerization reaction was designed using rapamycin as the experimental system, and parameters were set based on literature values.MESA is a reaction in which a receptor on the membrane of the TC and PC chains detects the ligand, and the cleavage sequence is cleaved by protease under the membrane.Cleavage of the cleavage site by Protease is usually regarded as a type of enzymatic reaction and is described in the figure below.

MESA can be viewed as the association and dissociation of protease (enzyme) and cleavage sequence (substrate) in this enzymatic reaction, replacing ligand. Based on this idea, we performed modeling. A schematic diagram of the MESA reaction as compared to a normal enzyme reaction is shown below.

MESA_equation_and_reaction_schematic

Figure 3: MESA equation and reaction schematic
In a typical enzymatic reaction, Enzyme and Substrate meet at $k_{on}$ and dissociate at $k_{off}$ . The assembled complex $ES$ dissociates at a rate of $k_{cat}$ , converting the substrate to the target protein P. On the other hand, in MESA, the association and dissociation of Protease (Enzyme) and CS (Substrate) are performed by the extracellular Ligand and Receptor on the plasma membrane. Taking Rapamycin as an example, which has a strong tendency to Hetero, Ligand (rapamycin) and the first receptor (FKBP) first react at $k_{1on}$ , $k_{1off}$ .Then this dimer and the second receptor (FRB) meet and dissociate at $k_{2on}$ , $k_{2off}$ . ¹Finally, the trimer undergoes cleavage under the plasma membrane at a Protease-specific constant $k_{cat}$ , thus triggering the MESA reaction.

The dissociation constant between rapamycin and FKBP is 0.2 nM, while the dissociation constant between rapamycin and FRB is very large, 26 μM¹. Therefore, we constructed the above model under the assumption that the binding between rapamycin and FRB is negligible.

Tips: Michaelis-Menten Formula

The Michaelis-Menten equation is usually used for enzymatic reactions, assuming that there is an excess of substrate and that the dissociation of the complex is the rate-limiting reaction.

However, in MESA, the binding and dissociation reactions between ligand and receptor are not necessarily faster than the dissociation of the PR complex. Therefore, the Michaelis-Menten equation is not used in the modeling that follows.

Figure 4: MESA equation
　In the absence of ligand, no reaction occurs and the product of the receptor production rate $μ$ , the degradation rate $δ$ , and the receptor concentration $[R]$ is balanced; in the presence of ligand, the reaction described in Figure 3 occurs and the equation is shown above.

Assumption

The above model is based on mass action dynamics. To this end, several assumptions are made. It is assumed that the diffusion of protein is sufficiently rapid and concentrations are uniform for the various substances inside and outside the cell. The concentration of ligand is constant because ligand is universally present compared to intracellularly. Other than that, we do not consider the degradation of the complex of Ligand and Receptor; we assume that the non-reactive Target Chain R2 is rapidly degraded after the POI is cleaved, and that ligand and receptor are transformed into the target substance $H$ under a rate constant of $k_{cat}$ after becoming a trimer. Cleavage by protease released into the cytoplasm due to e.g., being separated from the protease chain is not considered since it is a very small amount. The association of target chains and Protease chains is not considered.

Intial Values

Results

Behavioral Stability

Time_dependence_of_R1_R2_H Figure 5: Time dependence of $R_{1}$ , $R_{2}$ , $H$ at $L$ =5e-9 M
The $R_{1}$ , $R_{2}$ react to produce the target substance $H$ . The response of the target substance is rapid, reaching a half response in about 10 minutes. Each concentration converges and the model shows biologically valid behavior.

Model Validation and Development by Wet Measurement

The results from the WET showed that MESA leak is about 50% even in the absence of ligand. Therefore, we developed the above model by adding a term in which receptors $R_{1}$ and $R_{2}$ collide incidentally at a rate of $k_{12}$ . We also added a term to convert the ligand concentration from the wet measurement to the ligand concentration, since the ligand concentration was relative in the above model. These were estimated from the measurement at wet and determined to be $k_{12}$ =5.20e6 (M*min)^-1, $A$ =2.00e-2.

Equations_of_the_constructed_model Figure 6: Equations of the constructed model

The terms $k_{12}R_{1}R_{2}$ for conversion of ligand concentration and accidental collision of $R_{1}$ and $R_{2}$ have been added. $[L]_{ex}$ represents the sample concentration of Rapamcin in the wet lab experiment.

The model was developed and fitted as described above.

Figure 7:Relationship between ligand concentration and fluorescence intensity
(Theoretical curve (blue), experimental value (red))

From these results, the model was refined to be more consistent with the Wet Lab measurements. Based on the results of the above wet lab measurements, the MESA Module should be improved in the following three areas where it works better in SWIFT.

To suppress the leakage of MESA in the presence of Ligand microspheres
Enhanced rapidity
Reduction of leak in the absence of Ligand

For (3), the main cause of leakage is thought to be the accidental collision of receptors with each other due to the physical proximity of the protease chain and the target chain, which is cleaved by the protease. Therefore, it can be proposed that reducing the receptor density by decreasing the receptor expression level and strengthening the steric hindrance of the ligand are effective.

However, it is difficult to reduce leakage by changing the parameters that can be moved in MESA.

Therefore, in the Dry lab, we embarked on (1) and (2).

Analysis

In the following Analysis, $k=0$ and $A=0$ was used to distinguish between leaks in the presence of ligands.

Figure 8: Relationship between ligand concentration in MESA and tne concentration of produced target substance

The figure above shows that even in the presence of a small amount of ligand (L=1e-10 M), the response of MESA is about half of the peak response.The graph also idicates that at L=1e-8 M the MESA response is approaching Plateau.

Therefore, in future discussions, we will assume L=1e-10M to have a minute amount of Ligand and a sufficient amount of Ligand at L=1e-8 M.

In order to increase the sensitivity of MESA, it is necessary to improve the signal-to-noise ratio by reducing only the amount of MESA response in the presence of a small amount of ligand.

Therefore, we performed a sensitivity analysis with the goal of reducing the response only when Ligand is in micro-presence.

Sensitivity_analysis_MESA

Figure 9: Sensitivity analysis for each parameter at L=1e-10 M (left) and L=1e-8 M (right)

The results of the sensitivity analysis show that the parameters related to $k_{1on}$ and $k_{1off}$ change significantly when ligand saturation and Ligand trace are present. This result suggests that the S/N ratio of MESA can be improved by changing the binding affinity between rapamycin and FKBP in experiment.

(1) To Suppress the Leakage of MESA in the Presence of Ligand Microspheres

Concentration of products varying k1_on Figure 10: Concentration of products when $k_{1on}$ (M×min)^-1is varied in the presence of ligand micro $L$ =1e-10 M (left) and excess ligand $L$ =1e-8 M (right).

Insight

Based on the sensitivity analysis and the results of the above analysis, it is effective to decrease $k_{1on}$ and increase the dissociation constant ( $K_{1d}=k_{1off}/k_{1on}$ ) between Rapamycin and FKBP in order to improve the signal-to-noise ratio.

The expression rate $μ$ is also a sensitive parameter in sensitivity analysis. However, $μ$ is sensitive regardless of the ligand concentration, and increasing the expression level of MESA will increase the output of MESA, but will increase the leak in the absence of ligand (as shown by the wet measurement), so increasing $μ$ is not a recommended It is not a recommended option.

On the other hand, reducing $μ$ is effective because it decreases the receptor density and reduces leak due to accidental dimerization in the absence of ligand. However, when receptor expression is reduced, the absolute amount of Protease released by MESA is also reduced. In addition, it was suggested in the later part of the Secretion Modeling that the output of MESA alone is not sufficient to produce sufficient secretion in the Secretion Module described below.

Therefore, the Dry Lab suggested the need for the Protease Amplification Module (described below) to ensure sufficient amount of protease for secretion while reducing the expression of receptors and suppressing leakage, and improvements were made in the project.

(2) Enhanced Rapidity

To analyze the rapidity of MESA, we varied the protease cleavage efficiency $k_{cat}$ , which was moderately sensitive in the sensitivity analysis, and the binding affinity $k_{2on}$ , $k_{2off}$ between the FKBP/Rapamycin complex and FRB.

Figure 11: Relationship between the amount of target substance produced when $k_{cat}$ (min^-1) of protease is varied

Figure 12: Relationship between the amount of target substance produced by varying $k_{2on}$ (M×min)^-1, $k_{2off}$ (M×min)^-1 of protease

Actually, $k_{2on}$ is a parameter related to binding affinity and depends on the sequence of the receptor, but $k_{cat}$ can be improved by changing the protease or mutating CS, which simplifies the work in the wet lab. Therefore, it is basically desirable to increase $k_{cat}$ . You can use our DB "Proteameter" for selecting proteases to increase $k_{cat}$ .

Insight

For rapidity it is desirable to increase the activity $k_{cat}$ of the protease (or enhance the binding affinity of the second bond).