Model

Introduction

This year, we used a special bio-nanomaterial derived from Geobacter metallireducens, as the "variable resistor" in our disease biomarker detection system. By detecting the changes in conductivity of the e-pili at different concentrations of the target substance and combining it with a standard curve, we can achieve quantitative detection of the target substance (To learn more: design).

In our system, the conductivity of the e-pili is the most important parameter. Higher conductivity allows our detection system to become more stable and accurate. Additionally, higher conductivity may also imply more possibilities for future applications of this bio-nanomaterial.

In this section，we use optimization methods of mathematical modeling to optimize the structure of this pili to improve its conductivity.Conbing with an innovative screening method based on the microbial fuel cell system,we may eventually obtain a pili with higher conductivity.

1. Lower current thermal effect

Compared to inorganic conductive nanomaterials, an important point to note about e-pili is their chemical nature as proteins.This means that we need to consider whether they can function stably in the working environment of traditional inorganic materials.

In our system, the e-pili act as variable resistors. When current passes through the e-pili, work is done, resulting in heat proportional to the resistance of the e-pili. Since our system requires measuring the conductivity changes of the nanowires over a period of time, it implies that the e-pili will be constantly exposed to the Joule heating effect. Being protein-based materials, long-term exposure to high temperatures may cause denaturation (by using DeepSTABp tool,we estimated the melting temperature of these nanowires to be 43 degrees). This could potentially affect the stability and accuracy of the system.

2.Smoother resistance variation curve

We have found that the resistance of e-pili with higher conductivity has a smoother curve when pH changes. The range of conductivity changes within the measured range is also greater, which may help improve the detection threshold. For example, compared to the wild-type , W51W53 (a mutant of Geobacter sulfurreducens with higher conductivity) shows a stronger linear relationship in the change of its conductivity as the external pH varies from 2.0 to 10.5 [1]. A stronger linear relationship means that we can better establish models and quantify and analyze experimental results.

3.Further unlocking its application potential

In addition to the lack of mature large-scale production techniques, one of the reasons that currently limits the application of these conductive biological nanomaterials is their conductivity compared to traditional inorganic nanomaterials such as carbon nanotubes and silicon nanotubes still at a disadvantage[1]. However, by improving the conductivity of e-pili, we can further unlock the application potential of this green biogenic nanomaterial.

After clarifying our objectives, we extensively reviewed existing literature but found no studies related to optimizing the conductivity of e-pili.

An idea came to our minds: since no one has done it before, let's give it a try.

Theoretical basis

Within e-pili, there is a high density of aromatic amino acids (Figure 1). Current research suggests that the conductivity of e-pili mainly originates from the overlapping Π-Π orbitals of aromatic rings formed by the aromatic moieties of aromatic amino acid residues in their unique structure[2]. Numerous studies based on this theoretical foundation have made significant progress [1,3,4].

Further consideration suggested that not only aromatic amino acid abundance, but also the position and spacing of aromatic amino acids in the pilin could be important[5]. For example, there is a stretch of 53 amino acids without an aromatic amino acid within The G. uraniireducens pilin. This aromatic-free gap might prevent close-packing of aromatic amino acids within the assembled pili. Heterologous expression in G. sulfurreducens of other type IV pilins with high aromatic amino acid abundance and smaller aromatic-free gaps yielded e-pili that appeared to be as conductive as wild-type G. sulfurreducens e-pili and functioned well in extracellular electron transfer[5].

This means that if each aromatic amino acid in a sequence within the e-pili can form π-π interactions with adjacent aromatic amino acids, an effective conductive pathway is likely to form. At this point, the distance between aromatic amino acids and the rate of electron transfer exhibit a negative correlation.

However, when the external pH changes or other molecules are adsorbed onto the surface of e-pili, the orientation and spacing of the aromatic rings of the aromatic amino acids on the surface may change due to alterations in surface charge distribution. Consequently, this leads to changes in the conductivity of e-pili.

Fig1. Schematic illustration of the principle of conductivity/electrical conductivity changes in e-pili.

Conductivity optimization model

To establish an optimization model for the conductivity of e-pili, we need to consider three basic elements of optimization separately: Decision variables, Objective function, and Constraints.

Decision variables

The decision variables are factors that decision-makers can control.
In this model, the variables we can control are the amino acid sequences of the e-pili. By changing the types and positions of amino acids in the sequence, we can attempt to alter the overall conductivity of the pili. The most critical indicator affecting pili conductivity is the spacing of Π residues (aromatic amino acids capable of producing Π-Π interactions). There are two methods to modify this spacing:

1.Altering the quantity and position of aromatic amino acids within e-pili.

Fig2. Alterations in the quantity and position of aromatic amino acids within the e-pili.

The quantity and positioning of aromatic amino acids within the e-pili are important factors determining their conductivity. For example, in comparison to the wild type G. sulfurreducens's e-pili, a mutant variant Y27A exhibits less than one-fifth of the conductivity. This is because in Y27A, tyrosine, an aromatic amino acid, is replaced by alanine, a non-aromatic amino acid, leading to an increased spacing between the Π residues in that specific region. This alteration impacts the transfer of electrons, resulting in reduced conductivity.

2. Modifying the type of amino acids to alter the overall hydrophilicity and hydrophobicity of the e-pili:

Fig3. Impact of amino acid type alteration on the electrical conductivity of e-pili.

In addition, the hydrophobicity of amino acids in the e-pili can also influence the overall structure and spacing of Π residues within the e-pili. For instance, in the mutant W51W57, replacing the phenylalanine and tyrosine at the c-terminal of the pili with tryptophan enhances the hydrophobicity of the pili and reduces the diameter of the pili. This results in a more compact structure, leading to a reduction in the spacing between internal Π residues. As a result, the conductivity of W51W57 has been shown to be approximately 2000 times higher than that of the wild type in some studies[6].

Due to the complexity of the structural model for type IV pili assembly and the lack of suitable methods to predict the impact of hydrophobicity changes in individual pili subunits on the overall assembled pili structure, our focus in this model is primarily on method one: understanding the impact of introducing new aromatic amino acids or altering the positions of existing aromatic amino acids on the pili conductivity.

Furthermore, it's worth noting that the assembly of e-pili requires a complex and conserved type IV pilus assembly system. This assembly system has certain requirements for the conservation of amino acid sequences within the pili[7]. Modifying too many amino acids may not only result in the collapse of the original pili structure but also prevent proper assembly within cells.

This implies that our algorithm needs to find the optimal modification strategy under the constraint of minimizing the number of modifications required.

Objective function

The objective function is a function used to describe the goals pursued by decision-makers.

In our model, our ultimate goal is to obtain e-pili with higher conductivity characteristics. The conductivity characteristics of the e-pili are closely related to the distance between the centroid of the aromatic ring on the aromatic amino acid residues.

Based on this theory, we have designed the following two objective functions:

1. Average distance between the centroid of aromatic rings of Π residues on the shortest electron transfer chain in e-pili:

Fig4. Schematic representation of the shortest electron transfer chain.

This objective function is used to describe the maximum conductivity of conductive pili under ideal conditions.

We can assume that in an ideal state, electrons will move along the shortest propagation path composed of Π residues in e-pili. At this point, the conductive pili has the highest conductivity within this structure.

within this structure. This function describes the upper limit of the conductivity of pili under given structural conditions. Optimization based on this objective function may help us obtain the pili with higher conductivity.

2.Average length between neighboring nodes of the Minimum Spanning Tree for all Π residues.

Fig5. Schematic illustration of the minimum aromatic amino acid node spanning tree.

This objective function is used to describe the overall conductivity of e-pili under the influence of external environmental factors.

Under the influence of environmental factors such as pH changes or surface adsorption, other aromatic amino acids in e-pili, beyond the original shortest electron propagation path, may also participate in the electron transfer process as the structure of the e-pili changes.

This objective function describes the spacing between all Π residues inside the pili, reflecting the overall conductivity of the pili under different environmental influences. Optimization based on this objective function may improve the conductivity performance of our pili under different environmental conditions and make the conductivity curve smoother.

Constraints

Constraints are the limitations that decision variables must satisfy.

In this model, the constraints are the possible coordinates of the centroid of aromatic rings in aromatic amino acids. All decision variables (the added aromatic amino acids) can only occur at specific positions determined by the structure of e-pili themselves.

Since we only focus on the changes in conductivity caused by the quantity and position of aromatic amino acids , we can initially assume that adding or modifying a small number of aromatic amino acids has minimal impact on the overall structure of the e-pili. This means that we do not need to rebuild the molecular dynamics model for each iteration. Instead, we can use the spatial coordinate information of the original structure to roughly determine the range of the centroid coordinates for the new aromatic rings.

In our model, we approximate the centroid coordinates of the newly added aromatic ring by using the average coordinates of all carbon atoms in the side chain at the modified position.

Since the purpose of constructing this model is to discover potential mutation sites, we can combine the semi-rational directed evolution method to screen and obtain the actual optimal results. Therefore, the accuracy requirement for this model is not as strict as that of rational protein design.

Model Overview

Molecular Dynamics Simulations

Common methods to obtain pili protein structure include nuclear magnetic resonance imaging, X-ray technology and homology prediction. These methods have some problems such as static snapshot, signal interference and atomic position conflict. And when we use spiral symmetric assembly for pili monomers, it is easy to overlook the flexibility of subunits. Therefore, we first use amber23 for molecular dynamics optimization of the subunits, and then use Rosetta for helical symmetric assembly of the optimized subunits with fuzzy constraints. To eliminate the effects of rigid docking, molecular dynamics simulation (amber23) was used again to optimize the pili assembly structure to obtain the best pili polymer structure. Finally, pymol was used to further verify the obtained pili structure, including distance and electron transport chain. The use of molecular dynamics to optimize the assembly process has obvious advantages: the protein is simulated in a real biological environment and the dynamic effects of the organism are taken into account. Molecular dynamics simulation (MD) can be used to optimize the structure of pili proteins and annotate the structural details.

Optimization Design

The determination of the e-pili structure implies the establishment of constraints in our optimization model. Next, we will design corresponding optimization functions for the optimization objectives we previously proposed and use algorithms to optimize each of them separately.

Graph structure construction

We create a new data class for pi-residues to store all their information, with the average of the x, y, and z coordinates of all the carbon atoms in each pi-residue as its center-of-mass coordinates in the three-dimensional space. We then construct a graph with each pi-residue as a vertex and obtain the corresponding adjacency and distance matrices based on the center-of-mass coordinates. Among them, we set a value k to constrain that only pi-residues with a distance less than k are capable of transferring electrons, otherwise, we set their adjacency value to 0 and their distance to infinity.

For Objective function One

Our first optimization target is the average distance between the centroids of Π-residue aromatic rings on the shortest electron transfer chain in e-pili. This optimization target is used to describe the highest electrical conductivity of conductive pili under ideal conditions. To obtain the shortest electron transfer chain within the e-pili, we can view the network composed of Π-residue（aromatic amino acids capable of forming π-π interactions）nodes as a map and use an algorithm for identifying the shortest paths between nodes in the map to explore possible electron transfer pathways within a series of models.

The Dijkstra algorithm is a commonly used graph search and shortest path algorithm in computer science and mathematics. This algorithm aims to find the shortest paths between a starting node (or vertex) and all other nodes in a weighted graph, where each edge has an associated numerical weight or cost.

Based on the Dijkstra algorithm, we have designed an algorithm that can search for possible shortest electron transfer chains within the e-pili. This algorithm consists of three steps: (1) Identifying Π-residues；(2) extracting the coordinates of Π-residues；(3) calculating the distances between Π-residues and searching and comparing to determine the shortest path.

Inputs:

Source node, Target node, Distance matrix, Adjacency matrix

Initialization:

Distance array: An array of distances from the source node, initialized as follows:

Set the distance from the source node to itself (dist(source)) to 0.

Set the distance from all other nodes (v) to infinity (∞) for nodes v not equal to the source node.

Q: A queue of all nodes in the graph.

S: An empty set to indicate which nodes the algorithm has visited.

Proceeding:

While queue Q is not empty, follow these steps:

Step1:

Pop the node v from the queue Q that is not already in set S and has the smallest dist(v) value.

Step2:

Add node v to set S to indicate that it has been visited.

Step3:

Update the dist values of adjacent nodes of the current node v as follows:

For each new adjacent node u, if dist(v) + Distance_matrix[u, v] < dist(u), then update dist(u) to the new minimal value.

The algorithm stops after the target node is visited.

For Objective function Two

Our second optimization objective is the average distance between adjacent nodes in the minimum spanning tree formed by all Π-residue nodes. This optimization objective is used to describe the comprehensive conductivity of e-pili under the influence of the external environment. Similar to the approach for the first optimization objective, we only need to find the minimum spanning tree within the map composed of Π-residue nodes, and then we can obtain the required value by summing the weights of the edges in this tree and dividing it by the number of nodes.

Prim's Algorithm

We use Prim's algorithm here to solve this problem. Prim's algorithm is a well-known algorithm used in computer science and graph theory to find the minimum spanning tree of a connected, undirected graph with weighted edges. The algorithm starts with an initial node and then grows the minimum spanning tree one edge at a time until all nodes are included while minimizing the total edge weight.

Inputs:

Distance matrix

Initialization:

An arbitrary starting node.

MST: Initialize an empty set to represent the Minimum Spanning Tree (MST).

Key array: Create an array to keep track of the minimum edge weight from each node to the MST. Initialize all values with infinity except for the starting node, which is set to 0.

Visited array: Create an array to keep track of visited nodes. Initialize all values to false.

Proceeding:

While the MST set does not include all nodes, follow these steps:

Step1:

Select a node (u) that is not yet in the MST but has the minimum key value.

Step2:

Add node (u) to the MST set.

Step3:

Update key values: For each node v adjacent to the newly added node u, if Distance_matrix[u,v] < Key(v), update the Key(v) with the new weight.

Note:

As a tree structure satisfies:

Number of edges = Number of nodes - 1

The final objective function is obtained from:

(Total length of the MST) / (Number of nodes - 1).

Particle Swarm Optimization

Particle Swarm Optimization (PSO) is an optimization algorithm based on swarm intelligence, which simulates the behavior of biological swarms, such as flocks of birds or schools of fish, in their search for food or resources.

Particle Swarm Optimization (PSO) can be described as follows: Suppose the search space is L-dimensional and there are N particles in the population. The ith particle in the population can be represented as an L-dimensional vector $X_i=(x_{i1},x_{i2},\cdots , x_{il})$, and the best position it experiences is denoted as $P_{best}=(p_{1},p_{2},\cdots , p_{l})$.Each position of the particle represents a potential solution to the requirement, which is substituted into the objective function to obtain its fitness value. The optimal position searched so far by the whole population is denoted as $G_{best}=(g_{1},g_{2},\cdots , g_{l})$.

Next, we do the following iterations for each particle in the population : $$V_i^{(t+1)} = \omega·V_i^{(t)} + c_1r_1(P_{best}^{(t)} - X_i^{(t)}) + c_2r_2(G_{best}^{(t)} - X_i^{(t)})$$ $$X_i^{(t+1)} = X_i{(t)}+V_i{(t+1)}$$

The above formula consists of three parts. The first part is the current speed of the particle, with the positive value $\omega$ being the Inertia Weight, which indicates the current state of the particle. The second part is the Cognition Modal, which indicates the cognition of this generation of particles about their own state ($c_1$ is Self-recognition Factor). The third part is the Social Modal, which indicates the sharing of information between particles through generations ($c_2$ is Social-recognition Factor).

Discrete Mapping

In the optimization process, we defined the independent variable as the sequence number of regular residues in a monomer that needs to be adjusted to pi-residues. Suppose there are $N$ non-pi residues in each monomer and the number that needs to be changed to pi-residues in each monomer is $d$, then our independent variable X can be denoted as \[X=(x_1,x_2,\cdots x_d), x_i \in \{1,2,\cdots N\}\] Before performing the optimization, we sorted all the residue sequences of each monomer according to their L1 norm from the origin to ensure that the spatial distances of residues with similar sequence numbers are also relatively closer spatially, ensuring sequence numbers and the geometrical positions are consistent with each other. However, APSO still deals with only the continuous problem, so we need to discretize the continuous particles $Y=(y_1,y_2,\cdots y_d), y_i \in [0,1]$ to correspond to the sequence number of residues in each monomer. The method we use is to stick to the continuous optimization in the interval [0, 1] and divide [0, 1] into $N$ equally spaced sub-intervals of length $\frac{1}{N}$. Therefore, we can map $Y$ to $X$ one on one by letting $x_i=[y_i \times N]$, which means keeping the integer part of $Ny_i$.

APSO makes three main improvements on the basis of PSO:

1.Evolutionary State Estimation

ESE calculates the distributional information of the population of each generation in the space of independent variables. We first calculate the average Euclidean distance from particle $i$ to all the other particles.

Accordingly, we define the Evolutionary Factor $f$ that measures the optimizing state of the population:

\[f=\frac{d_g-d_{min}}{d_{max}-d_{min}} \in [0,1]\]

The Evolutionary factor we calculated above can help to decide which evolutionary state the population is in. The different evolutionary states of the population affect our different adaptive control strategies for the parameters (here we are referring to $\omega, c_1, c_2$). We apply the obtained $f$ to define four different evolutionary states $S_1,S_2,S_3,S_4$ and their fuzzy membership function, representing the states of exploration, exploitation, convergence, and jumping out respectively.

2.Parameters self-adaptive control strategy

We use a sigmoid mapping to allow $ \omega $ to change with the Evolutionary factor $ f $

\[ \omega(f)=\frac{1}{1+1.5e^{-2.6f}} \in [0.4,0.9] \]

We know $ \omega(f) $ is a monotonically increasing function. Therefore, a larger $ f $ in $ S_1 $ and $ S_4 $ will result in a larger value of $ \omega $, which facilitates a global search. On the contrary, $ S_2 $ and $ S_3 $ are more suitable for a local search.

$ c_1 $ represents self-recognition factor, which pulls the particles to their own historical optimal position; $ c_2 $ represents social-recognition factor, which pushes the population to converge to the current global optimal region and helps fast convergence. These coefficients are initialized to 2.0 and are adaptively controlled according to the evolutionary state with the following strategies:

(1)$ S_1 $ - Increase $ c_1 $ and decrease $ c_2 $ in the exploration state, which helps particles to explore individually and achieve their own historical optimal positions, rather than crowding around the current best particles that may be associated with the local optimum.

(2)$ S_2 $ - Slightly increase $ c_1 $ and slightly decrease $ c_2 $ in the exploitation state.

(3)$ S_3 $ - Slightly increase $ c_1 $ and slightly increase $ c_2 $ in the convergence state.

(4)$ S_4 $ - Decrease $ c_1 $ and increase $ c_2 $ in the jumping-out state.

3.Elitist Learning Stratrgy

The jumping out effect done by the ELS mechanism on $ G_{best} $ is necessary for the global optimal nature of this algorithm, which randomly selects a dimension of $ G_{best} $, which is represented by the $ g_d $ of the dth dimension:

\[ g_d = g_d + (X_d^{max} - X_d^{min})\cdot Gaussian(\mu,\sigma^2) \]

APSO's adaptive tuning of parameters can improve the performance of the PSO algorithm in terms of both accelerated convergence and global search. In addition, its ELS strategy will maintain the diversity of the population in order to allow the algorithm to jump out of potential local optima.

Optimization Results

We can observe that adding new aromatic amino acids can significantly optimize both our objective functions one and two.

In objective function one , we found that replacing serine with aromatic amino acids at position 2 of the original amino acid sequence of the e-pili can reduce the average distance in objective one from 4.72 Å to 4.63Å. Furthermore, we found that adding more aromatic amino acids on top of this does not further reduce this distance. In objective one, we can consider that the electrical conductivity of this mutant has reached its optimal point.

To verify the accuracy of the results, we attempted to iterate through all possible spacing scenarios for adding 1, 2, and 3 pi-residues (Fig. 7) and found that only the modification at position 2 changes the original shortest electron transfer pathway, and adding more aromatic amino acids does not alter this result.

In objective function two , because the objective function we chose is used to describe the distribution density of aromatic rings within the e-pili, this means that adding 1 or 2 pi-residues cannot yield an optimal result. Therefore, we started considering the addition of three Π-residues, as shown in Fig. 6. Adding 5 aromatic amino acids can reduce the original spacing from 7.6 Å to less than 5.5 Å.

Fig.6 Optimization Results.

Fig.7 Traversal results after adding 1, 2, and 3 Π-residues (where the x-axis represents the index of the list composed of non-aromatic amino acids, for example, list[0] corresponds to the residue at position 2).

Model Evaluation

Based on the optimization results from the model described above, we reconstructed the structural models for single point mutants T2F and triple point mutants A31F, Q53F, and N59F to validate our conclusions:

In optimization objective one, we replaced the threonine at position 2 of the wild-type metal-reducing rod bacterium pilin amino acid sequence with phenylalanine. We only selected the shortest electron transfer path in the pilus structure obtained by the Dijkstra algorithm (where the original aromatic rings are marked in red, and the newly added aromatic rings in mutant T2F are marked in yellow). The results are as follows: We observed several gaps in the electron transfer chain obtained by the algorithm, which is because the pilin monomers providing the aromatic rings for these gaps are not in our model. We counted the distances between the aromatic amino acids on the electron transfer chains inside the original wild-type pilus and the mutant pilus. We found that the newly added aromatic amino acids changed the original electron transfer chain to (F1p, Y27p+4, F24p+4) (electrons are transferred from position 1 of the first monomer to positions 27 and 24 of the fifth monomer, and finally to position 1 of the second monomer to complete the cycle), where the centroid distances between adjacent aromatic rings are (4.6 Å, 5.5 Å, 4.1 Å). In the mutant, according to our model prediction, the electron transfer chain would change to (F2p, Y27p+4, F24p+4). However, we can clearly observe that the centroid distance between the aromatic rings in the electron transfer chain of this mutant is significantly larger than that of the wild type, with distances of (7.1 Å, 5.5 Å, 9.5 Å).

Fig.8 The comparison in electron transport chain between the WT（yellow）and the T2F（orange）

This situation arose because, when determining the constraint conditions of the decision variables in the optimization model, to simplify the model, we used the average coordinates of all carbon atoms on the side chain of the modified position to approximately predict the centroid coordinates of the newly added aromatic ring. The error caused by simplifying the model ultimately led to incorrect optimization results.

This model illustrates that objective function one can no longer be optimized by adding new aromatic amino acids to the original amino acid sequence. Subsequent teams may try to optimize by changing the type of aromatic amino acids to affect the overall hydrophilicity and hydrophobicity of the pilus.

Fig.9 The comparison in electron transport chain between the WT（yellow）and the T2F（orange）

In objective function twowe added 3, 4, and 5 phenylalanine residues at the positions calculated by our optimization algorithm, respectively, on top of the original structure. We can intuitively observe that the introduction of aromatic amino acids significantly improves the uniform distribution of aromatic amino acids in the original pili structure, causing the core of electron transfer to spread outward from the pili's central axis. This means that when the structure of conductive pili changes under the influence of different environmental factors, there are more aromatic rings involved in the electron transfer process, potentially leading to an increase in the overall electrical conductivity of the pili.

Fig 10.Optimization results of objective function 2

Semi-rational Directed Evolution

To reduce the computational workload in modeling, a common practice is to simplify the model by introducing assumptions. For example, when determining constraints for optimizing, we approximate the centroid coordinates of the newly added aromatic ring by using the average coordinates of all atoms in the side chain at the modified position.However, introducing assumptions means that our model can not achieve the level of precision required in rational protein design. We can only identify several potential mutation sites, and we cannot guarantee that the final structure of the pili and the internal electron transfer will conform to our design.

This situation is quite common in protein structure optimization, and the most common solution to address this is to use a non-rational (high-throughput screening) approach to compensate for the shortcomings of rational design.

Directed evolution simulates the process of Darwinian evolution in the laboratory by creating a large number of mutants through random mutation and recombination. Specific selection pressures are applied for desired features, allowing the selection of proteins with desired characteristics, achieving molecular-level simulated evolution.

Among the various methods of directed evolution, semi-rational directed evolution is the most widely applied with the most successful cases. This method aims to mutate proteins based on some understanding of their physicochemical properties, three-dimensional structure, structure-activity relationships, catalytic mechanisms, etc. With the assistance of computers, it then uses a reasonable high-throughput screening method to quickly obtain the target mutants.

In our project, after identifying potential sites that may enhance the electrical conductivity of the pili, the next step is to construct a mutant library with these sites as hotspots and perform screening using the aforementioned methods. In our original plan, we intended to express and purify the mutants from the library one by one in engineered bacteria to measure their electrical conductivity. However, we quickly realized that this approach was too inefficient. Additionally, due to the relative conservation in the assembly of Type IV pili, a significant portion of mutant pili may fail to assemble correctly and lose their conductive function. Screening these pili would consume a considerable amount of effort.

We plan to leverage the original function of e-pili, using Geobacter sulfurreducens as a chassis organism and combining it with the improved H-type microbial fuel cell model previously developed by previous researchers[8,9]. This bioelectrochemical system will be used for the initial screening of pili with higher electrical conductivity by observing the maximum current generated by the strains in microbial fuel cells.

Fig11.The Microbial Fuel Cell Model

Future work

To validate our design, we still need to measure the electrical conductivity of individual pili. However, due to project timelines and the unavailability of the necessary equipment (low-noise nanoelectrode measurement platform and atomic force microscope [10]), it is regrettable that we won't be able to experimentally verify our modeling results this year.

We plan to continue our research after the competition ends, and we sincerely hope that future iGEM teams can refer to the construction approach of our model to design proteins that meet their project requirements.

Hardware Model

In our hardware design, to ensure the credibility of the data we obtain, we need to establish a model for assessment. As our system demands a high level of quantitative accuracy, we introduce corresponding control systems to assist operators in evaluating the stability and correctness of the system's operation, thus avoiding false positives or negatives.

In our system, since our hardware can read the change in material conductivity over a period of time, an intuitive approach is to use the curve of conductivity changes induced by known pH values of standard buffer solutions as a reference for system quality control and environmental factor correction.

Based on the above idea, we can easily establish the following mathematical model:

Fig12.The process flow chart of our verification model

Our model consists of three main steps:

1. Building a standard curve based on the known pH solutions' impact on the conductivity of the bacterial nanowires. Since the pH of the buffer solution added in advance is known, the curves obtained in different batches of testing should be relatively consistent in this case. We can use the curve obtained in this step as the calibration standard.

2. Collecting data during the actual measurement phase. In this phase, multiple sampling points can be set for comparison with the standard curve.

3. Comparing the sum of squared residuals between the sampling points and the standard curve and assessing whether it falls within the specified confidence interval (at a 95% confidence level).

This approach helps ensure the reliability of the data and assists in quality control and environmental factor correction during the operation of the system.

If you are interested, you can click on the link to download the relevant code:Hardware_model.py

[1] Tan, Y., Adhikari, R.Y., Malvankar, N.S., Pi, S., Ward, J.E., Woodard, T.L., Nevin, K.P., Xia, Q., Tuominen, M.T. and Lovley, D.R. (2016), Synthetic Biological Protein Nanowires with High Conductivity. Small, 12: 4481-4485.

[2]Nikhil S. Malvankar , Madeline Vargas , Kelly Nevin , Pier-Luc Tremblay , Kenneth Evans-Lutterodt , Dmytro Nykypanchuk , Eric Martz , Mark T. Tuominen , and Derek R. Lovley Structural Basis for Metallic-Like Conductivity in Microbial Nanowires,10.1128/mbio.00084-15

[3] Toshiyuki Ueki, David J.F. Walker, Pier-Luc Tremblay, Kelly P. Nevin, Joy E. Ward, Trevor L. Woodard, Stephen S. Nonnenmann, and Derek R. Lovley ACS Decorating the Outer Surface of Microbially Produced Protein Nanowires with PeptidesSynthetic Biology 2019 8 (8), 1809-1817 DOI:10.1021/acssynbio.9b00131

[4] Tan Y, Adhikari RY, Malvankar NS, Ward JE, Woodard TL, Nevin KP, Lovley DR. 2017. Expressing the Geobacter metallireducens PilA in Geobacter sulfurreducens yields pili with exceptional conductivity. mBio 8:e02203-16.

[5]Walker DJF, Adhikari RY, Holmes DE, Ward JE, Woodard TL, Nevin KP, Lovley DR. 2018. Electrically conductive pili from genes of phylogenetically diverse microorganisms. ISME J 12:48-58.

[6]Yang Tan , Ramesh Y. Adhikari , Nikhil S. Malvankar , Joy E. Ward , Trevor L. Woodard , Kelly P. Nevin , and Derek R. Lovley Expressing the Geobacter metallireducens PilA in Geobacter sulfurreducens Yields Pili with Exceptional Conductivity.

[7]Campos M, Cisneros D A, Nivaskumar M, et al. The type II secretion system - a dynamic fiber assembly nanomachine[J]. Res Microbiol. 2013, 164(6): 545-555

[8]Nevin, K. P. et al. Anode biofilm transcriptomics reveals outer surface components essential for high density current production in Geobacter sulfurreducens fuel cells. PLoS One 4, e5628 (2009).

[9]Bond, D. R. & Lovley, D. R. Electricity Production by Geobacter sulfurreducens Attached to Electrodes. Appl. Environ. Microbiol. 69, 1548–1555 (2003).

[10] Ramesh Y. Adhikari ,a Nikhil S. Malvankar,‡ab Mark T. Tuominena and Derek R. Lovley*b Conductivity of individual Geobacter pili 10.1039/C5RA28092C

Model

Model

Introduction

1. Lower current thermal effect

1.更低的电流热效应：

2.Smoother resistance variation curve

2.更平滑的电阻变化曲线：

3.Further unlocking its application potential

3.进一步解放其应用潜力：

Theoretical basis

Conductivity optimization model

Decision variables

1.Altering the quantity and position of aromatic amino acids within e-pili.

1.改变导电菌毛内芳香族氨基酸的数量和位置

2. Modifying the type of amino acids to alter the overall hydrophilicity and hydrophobicity of the e-pili:

2.改变氨基酸的种类以改变整个菌毛亲疏水性：

Objective function

1. Average distance between the centroid of aromatic rings of Π residues on the shortest electron transfer chain in e-pili:

1.导电菌毛中最短电子传递链上的Π残基芳香环质心平均间距：

2.Average length between neighboring nodes of the Minimum Spanning Tree for all Π residues.

2.所有Π残基节点最小生成树中的相邻节点平均间距：

Constraints

Model Overview

Molecular Dynamics Simulations

Optimization Design

Graph structure construction

For Objective function One

Inputs:

Initialization:

Proceeding:

Step1:

Step2:

Step3:

For Objective function Two

Prim's Algorithm

Inputs:

Initialization:

Proceeding:

Step1:

Step2:

Step3:

Note:

Particle Swarm Optimization

Discrete Mapping

1.Evolutionary State Estimation

2.Parameters self-adaptive control strategy

3.Elitist Learning Stratrgy

Optimization Results

Model Evaluation

Semi-rational Directed Evolution

Future work

Hardware Model