
0 Overview
In order to further explore the intrinsic mechanism of the genetic circuits designed in this project, to provide a reasonable analysis and prediction of the complex response system, and to be able to provide some feedback and suggestions to the experiments, we have built a mathematical model and resorted to a number of software tools to guide the experiments in obtaining the data used to fit the model and to optimize the model further. This can provide more inspirations for our project.
In this module, our ultimate goal is to verify the feasibility of the DNA damage effect evaluation system and give the optimization direction of the genetic circuit from the perspective of gene components. In order to achieve this purpose, our main work is divided into two parts: (1) In the overall system, based on Hill equation to establish the sensor and amplifier model of the DNA damage detection system, to obtain the amplification and dynamic range of the system, and explore the optimization direction from the values of the parameters obtained from the fitting; (2) In the design of the circuit parts, combining with the results in (1), we analyze the sequences of the promoter RecA and RBS , with the help of a web tool. And then verify that the optimization of the sequences can improve the expression and amplification coefficients of the system in experiments.







1 System: The Analysis of DNA Damage Effect Evaluation System
1.1 Purpose
Hill equation, which is generally used to describe the ratio of macromolecules saturated by ligand molecules, can be used to determine the degree of receptors binding to an enzyme or receptor, and can well describe the process in which damaged DNA fragments bind to a protein to activate the expression of downstream genes in the whole-cell DNA damage detection system designed in our project. We quantitatively analyzes the system by using NA to damage DNA, measuring the expression intensity data of downstream fluorescent reporter genes, and establishing the relationship between the inputs and fluorescence outputs of different levels of damage.
The modeling process includes: (1) How the binding of LexA inhibits the transcription; (2) How the damaged DNA activate the inhibited promoters after adding the inducer; (3) Establish the relationship between the input and output of the sensor; (4) The process of amplification.
1.2 Method and Model
The modeling process of the sensors and amplifiers of this system can be divided into the following four processes[1]:
(1) Binding of LexA Inhibit Transcription
Whether or not damage inducers are added, repressor protein LexA ( denoted as
Set the rate constants for the collision binding of the repressor protein and the site
When the process of LexA binding is at steady state,
In the equation above,
(2) The Damaged DNA Binds to RecA to Activate the Inhibited Promoter
Add inducers to cause DNA damage. The broken DNA fragments are recorded as







The unbound RecA protein is denoted as
In the equation above,
When the reaction reaches a steady state,
This is the Hill equation that describes how damaged DNA binds to the protein RecA. Combined with process (1), the above equation can be understood as the proportion of activated RecA proteins in our system.
(3) The Relationship Between the Input and Output of the Sensor
The activated RecA protein can cause the repressor protein LexA to self-decomposition, thereby eliminating the inhibition of SOS self-repair responses. The equation can represent the activation rate of the promoters whose transcription is inhibited by LexA after the addition of an injuring agent. In order to quantify the expression of the reporter gene, assume the promoter have a maximum transcription rate of
In the equation above,
(4) The HrpRS Complex Binds HrpL to Initiate Transcription and Implement Signal Amplification







The HrpRS complex
In the equation above,
In the equation above,
In summary, we describe the input-output relationship of the DNA damage detection sensor by Eq.
The established assumptions include:
The measured data as well as the analysis are carried out in the steady state of the system;
Generally the activity can only be fully stimulated when the number of ligand-protein binding reaches the corresponding
or , so the treatment is simplified by disregarding the case where the binding process does not reach the corresponding or .







1.3 Results
1.3.1 Experimental Data Acquisition
We choose NA as the DNA damage inducer to characterize the effectiveness of the DNA damage detection system. Add a series of DNA damage agents in concentration gradients of 0.3, 0.625, 1.25, 1.5, 5, and 10 μ mol/L to the cultured bacterial fluids, and we measure the absorbance OD values of the samples after full reaction. The experiment data are processed using the following formula:
Where
1.3.2 Fitting and Analysis of the System
We measure the expression of wild strains (without Hrp amplification lines) induced by different concentrations of NA. And Eq. (8) is fitted by nonlinear fitting to obtain the parameters of the sensor input-output model:







This results in a DNA damage detection sensor input-output model. The model also quantitatively characterizes the RecA promoter transcription level in the designed circuit. Combined with the previous analysis, the parameter
The expression of strains containing Hrp amplification lines at the same concentration gradient are measured as the output of the amplifier. Parameters are obtained after nonlinear fitting:







According to the relational equation of the amplifier input and output expressions obtained from the analysis of process (4) in 1.2, with an
The amplifier designed for our system has some discrepancy with the results in Ref. By comparing the values of the parameters fitted in the equations on both sides, it is noted that there is a gap in the value of the parameter







To further verify the reliability of the model, we used the above model to calculate the output of the amplifier at different times and at different inducer concentrations then compare them with the results of the experiments. The Spearman correlation coefficient of between the model predicted and experimentally characterized responses is 0.88333, indicating a strong correlation between the two with a
1.3.3 Continuous Monitoring of DNA Damage Effect Evaluation System
In addition, in order to further explore the output of this system over time, we continuously monitor the wild and Hrp strains after the addition of the inducer for 3 hours, and the OD values and fluorescence are obtained every half hour. The concentrations of the inducer are 1, 2, 3, 4, and 5 μmol/L, and three sets of parallel experiments are performed at each concentration to take the average value.







This shows that the DNA damage detection system reaches relatively stable results after 2-2.5h. And it is very obvious that Hrp strain (i.e., the strain containing the complete DNA damage effect evaluation system) are able to achieve significant results in a much shorter period of time.
2 Parts: Optimization of Genetic Circuit Components
2.1 Overview
In order to further optimize the amplification effect of the system, we analyzed the RecA promoter, which is in the SOS response family of DNA damage, through web tools from the perspective of promoter engineering and RBS transformation, and found HrpR and HrpS in the igem.parts library. The RBS class related to HrpR and HrpS is screened according to its translation initiation rate to achieve the effect of reducing the background noise and improving the sensitivity.
The web tool we used for filtering is RBS calculator from https://salislab.net/software/.
2.2 Analysis of RecA Promoter
2.2.1 Theorem
Stress injury response ( SOS Response ) is a stress response of the organism itself. When bacterial DNA is damaged to a certain extent, it will repair the activity of related genes by activating SOS response. In the genome of E.coli, many genes were found to be involved in the damage repair process of SOS response. Under normal circumstances, the activity of DNA damage repair genes is inhibited by LexA repressor protein. When SOS response occurs, the activity of RecA protein is activated, which triggers the self-cleavage of LexA protein, and then initiates the transcriptional activity of downstream genes. Therefore, the RecA promoter plays an important role in the initiation of biosensors.
In order to further analyze the RecA promoter, we analyzed the RecA promoter based on the Promoter Caculator in online software https://www.denovodna.com/software/.
Promoter Caculator is an online software for predicting transcription rate, which performed large-scale parallel in vitro experiments on the designed promoter sequence with the designed barcode to systematically measure the interaction of control site-specific transcription on the σ70 promoter. Based on these data, Promoter Caculator developed a statistical thermodynamic model to calculate how RNAP / σ70 interacts with any DNA to predict the transcription initiation rate at each location. The model has only 346 interaction energy parameters, but can accurately predict the transcription rate of 22,132 bacterial promoters with different sequences. [3]
The Promoter Caculator established a free energy model to calculate
The free energy model is :
2.2.2 Design
We analyzed the RecA promoter fragment used in the experiment, and predicted the different speed records of each site to start transcription, so as to better understand the RecA promoter.







2.2.3 Discussion
In the 342-346 position of the promoter sequence, we can see a peak of transcription rate, with an average value of about 13442 ( au ), and we further look at the free energy of this part. It can also be seen that the







2.3 RBS Screening Based on Translation Initiation Rate
2.3.1 Theorem
In the four stages of prokaryote's translation, translation initiation is the rate-limiting step in most cases. Among them, the translation initiation rate is mainly affected by factors such as the RBS sequence on the mRNA, the initiation codon, and the secondary structure before translation initiation, which has an important influence on protein expression.
In order to further improve the expression level of HrpR protein and HrpS protein in the line amplifier, so as to improve the line response sensitivity, we predict the translation initiation rate based on the online software RBS calculator ( https://www.denovodna.com/software/ ).
RBS Calculator is a nucleic acid analysis and design software designed and developed by the University of Pennsylvania. It has the ability to predict the translation initiation rate ( TIR ) of protein coding sequence, which can be directly run online on the website.







RBS Calculator has a powerful analysis and design function: after inputting the protein coding sequence and the host type, the total free energy
In addition, we note that ribosomes and mRNA transcripts are in a state of dynamic equilibrium during the exponential growth phase of the host cell. Based on this, we further assume that the translation initiation rate r is proportional to the number of ribosome-mRNA complexes, and derive:
2.3.2 Design
We obtained a series of RBS sequences Community RBS Collection in the iGEM library, which are suitable for general protein expression in E.coli or other prokaryotes ( including HrpRS protein used in DNA damage detection system ), and cover a series of translation initiation rates. We connected the above sequences with the protein coding sequences of HrpR and HrpS, and predicted the corresponding transcription initiation rate through the RBS Calculator, so as to screen out the RBS sequences of HrpR and HrpS proteins that are most suitable for our application for experiments.
As shown in Table 3., when the encoded protein is HrpR, the starting rate corresponding to each RBS is as follows:







When the encoded protein is HrpS, the starting rate of each RBS is as follows:







2.3.3 Discussion
In order to verify the reliability of the model for scoring the effect of RBS transformation, we perform experimental characterization of the above RBS sequences. Experiments are conducted with HrpR protein as the target, and four gene lines after RBS modification are constructed preferentially, and transformation and characterization experiments are conducted with the expectation that the change in transcription rate due to RBS modification will be reflected by the change in the average fluorescent protein expression over the same period of time. Currently, we are conducting characterization experiments.
3 References
[1] Alon, U. An Introduction To Systems Biology: Design Principles Of Biological Circuits.
[2] Wang B, Barahona M, Buck M. Engineering modular and tunable genetic amplifiers for scaling transcriptional signals in cascaded gene networks. Nucleic Acids Res. 2014 Aug;42(14):9484-92. doi: 10.1093/nar/gku593. Epub 2014 Jul 16. PMID: 25030903; PMCID: PMC4132719.
[3] LaFleur, T.L., Hossain, A. & Salis, H.M. Automated model-predictive design of synthetic promoters to control transcriptional profiles in bacteria. Nat Commun 13, 5159 (2022). https://doi.org/10.1038/s41467-022-32829-5
[4] Salis, H., Mirsky, E. & Voigt, C. Automated design of synthetic ribosome binding sites to control protein expression. Nat Biotechnol 27, 946–950 (2009). https://doi.org/10.1038/nbt.1568