Engineering Success | TecMonterreyGDL

EryK-FRET Overview

Emerging contaminants in water pose a significant risk to long-term health (Pereira et al., 2015; Stuart et al., 2012). Current detection methods require specialized laboratory equipment and trained personnel, thus we aimed to develop a quick and easy fluorescence-based detection method for erythromycin. To generate a prototype, we went through iterations of the engineering design process to test and improve upon different aspects of the project such as transformation efficiency and protein overexpression yield.

For donor fluorescence to occur, the chromophores in the system must be close enough in space for FRET to be possible, allowing acceptor fluorescence to occur. With our ECFP-EryK-mVENUS fusion construct, if erythromycin is present, EryK will ideally undergo a conformational change that should alter the distance between the donor, ECFP, and the acceptor, mVENUS (Figure 1).

To achieve this, we started by first transforming Escherichia coli TOP10 as a maintenance strain and then E. coli BL21 with a plasmid encoding the 6xHis tagged ECFP-EryK-mVENUS fusion construct (Chan et al., 2013; Jeong et al., 2015). After expression, our construct will be purified which will produce a detectable fluorescent signal change when bound to erythromycin (Figure 2).

📐 Design

To assess the feasibility of our approach, protein structure predictions and molecular dynamics simulations were carried out to ensure the fluorophores of the fusion construct were within a functional FRET distance.

ECFP-EryK-mVENUS model

Using the ab initio structure prediction workflow and ColabFold (Mirdita et al., 2022), a predicted structure of the ECFP-EryK-mVENUS fusion construct was obtained (Figure 3). ColabFold produces 5 predictions, however only the one with the best predicted aligned error (PAE) will be considered, as we are mainly interested in the 3D arrangement of the three constituent proteins.

While the 3D fold of each constituent protein matches previously characterized structures (Montemiglio et al., 2013; Pletnev et al 2012; Park et al., 2016), the connector regions and geometrical relationship between each protein is predicted with low confidence. This is indicated by the PAE values (Figure 4).

It should be noted that ColabFold uses multiple sequence alignments and neural networks to predict a 3D structure, however it doesn't consider real-world biophysics and interactions with solvent, spontaneous changes between open and closed conformations, and other possible variables.

Molecular dynamics

To increase our confidence in the predicted structure of our construct, molecular dynamics (MD) simulations of the ColabFold model were performed to assess if the predicted structure obtained was maintained after biophysical interactions were introduced to the system.

To achieve this, a preliminary simulation was run with GROMACS using model 5 from Colabfold to assess whether the structure eventually converges over the course of the 10 ns simulation when including biophysical potentials. First, the topology was generated from the ColabFold model with the AMBER99SB-ILDN force field (Lindorff-Larsen et al., 2010) and a TIP3P (Jorgensen et al., 1983) water model. The model was put in a simulation box with a distance to the edge of 1.2 nm. The system was solvated in water and the charges were neutralized with Na+ ions. After this, the system was energy minimized to avoid problems that arise from disagreement between ColabFold's predicted structure and the energy minimum of our system according to the AMBER99SB-ILDN forcefield. To equilibrate the system, first a NVT simulation was run for 200 ps followed by a NPT simulation for 1 ns, both with 2 fs timesteps. The production MD was run for a total of 10 ns with 2 fs timesteps under NPT conditions (Figure 5). Simulations were performed according to previous work (Lemkul, J. A., 2019) and available documentation (Hess et al., 2008).

To ensure the reliability of the simulation, thermodynamic variables such as the temperature, pressure, density, and total energy of the system were monitored for convergence. It should also be noted that after 4 ns the root-mean-square deviation of the simulation stopped diverging from the energy-minimized ColabFold prediction. Interestingly, the most dynamic regions of our system (Figure 5) correspond to the regions that display the greatest conformational change when comparing X-ray structures of the open and closed conformations of EryK (Figure 6). A potential limitation of our in silico system is that the ionic strength may not match in vitro conditions thus altering the strength of electrostatic interactions (Zhou & Pang, 2018). Nevertheless, the simulation was promising and yields greater confidence in the predicted structure.

🏗️ Build

The original plasmid was made up of a pUC57 backbone and the insert for ECFP-EryK-mVENUS. As an initial step we digested and ligated the insert into a pBAD/HisB and pBAD/HisC backbone with XhoI and NcoI (New England Biolabs). Both the vector and the insert were digested with the enzymes (Figure 7A) and, after gel purification and ligation, transformed into E. coli TOP10 as a maintenance strain (Figure 7B).

Colony polymerase chain reaction (PCR) was performed to confirm successful transformation (Figure 8). Following this, miniprep was performed on 5 mL liquid cultures from the transformation palate, these were grown in LB and carbenicillin and incubated overnight at 37 ºC with agitation at 200 rpm. For miniprep the Wizard Plus SV Minipreps DNA Purification Systems (Promega) was used.

The subsequent PCR step was performed several times varying annealing temperatures, number of cycles, and elongation step duration. The backbone (Figure 9 A) and the insert (Figure 9 B) were successfully amplified. After this the fragments were ligated together using T4 DNA Ligase Reaction Buffer and T4 DNA Ligase (New England Biolabs).

The next step was to transform pET28b(+)/ECFP-EryK-mVENUS into E. coli BL21, an expression strain. This proved difficult as the process was repeated several times varying amounts of plasmid added and incubation times with no success. Some of the mistakes made include not using enough plasmid during the transformation, not letting the transformation reaction recover for enough time, or the temperature for heat shock being too high. Once optimized, the transformation was successful (Figure 10).

📊 Test and Optimization

Once the plasmid with the whole construct was successfully transformed into E. coli BL21, the cultures were induced with 0.4 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) to stimulate protein overexpression (Studier, F. W., 2014). This was repeated several times with different temperatures and for different periods of time, the last attempt at 16 ºC for 12 hours yielded a faint band at the approximate expected molecular weight of our construct (~83 kDa) after sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) (Figure 11). Despite not seeing a clear band from the expression, we proceeded with the purification.

Purification was done through a Ni-NTA resin (Sigma-Aldrich) and all elutions were run on an SDS-PAGE gel to confirm the presence of our protein. Unfortunately, no protein was purified (Figure 12).

As we couldn't purify the complex, different tests were performed to assess viability of transformed E. coli in the presence of erythromycin. LB plates were made with different concentrations of erythromycin 0.2 μg mL^-1, 2.5 μg mL^-1, and 10 μg mL^-1. Liquid cultures were made of transformed E. coli BL21 and plated onto these petri dishes. This was done to determine if E. coli remained viable after exposure to the antibiotic. Once colony growth and fluorescence were observed (Figure 13), it was determined that there is a detectable level of fluorescence when exposed to erythromycin, however, it was not established whether fluorescence was directly related to erythromycin or was basal.

🦾 Future steps

Optimizing overexpression for subsequent protein purification is the next step. This will be done in the hopes of adding the purified protein directly to the sample with erythromycin to ensure proper detection and remove the risk of unwanted mutations and genetic variability caused by evolutionary pressure. Additionally, this will help calibrate the baseline of fluorescence with different concentrations of antibiotic to improve read accuracy and effectiveness.

In the future, and provided that ligand binding induces conformational changes in the sensor, EryK could be switched out for a different protein. This would allow detection of different emerging contaminants such as other antibiotics, per- and polyfluorinated substances, and heavy metals.

AtPCS as a new biopart overview

🤔 What's the deal with AtPCS?

Current FRET system design incorporating biopart BBa_K4447001 would allow us to determine the presence of erythromycin in water bodies, which represents a first step towards the detection of antibiotics, emerging contaminants of significant importance. An advantage of our FRET system is the potential to replace biopart BBa_K4447001 with other genes that are compatible with different compounds such as other antibiotics or heavy metals.

Pollution of water bodies with heavy metals is very threatening as they are persistent, non-biodegradable and toxic.This applies to any metal with a density greater than 5 g cm^-3 which is harmful even at low concentrations. Arsenic, cadmium, cupper, chromium, mercury, lead, nickel, and zinc are classified as heavy metals. These often originate from various sources such as volcanic activities, soil erosion, fertilizers, pesticides, industrial waste, and others. For each of the aforementioned metals, there are maximum tolerable levels that, when exceeded, can cause poisoning, brain damage, and even initiate cancer in humans (Jyoti et al., 2022).

Cadmium is classified as one of the five priority heavy metals for public health due to its high degree of toxicity. Various studies prove carcinogenic properties linked to cadmium and prolonged exposure to it both in humans and animals (Tchounwou et al., 2012). Additionally, cadmium occupies the second position of the metals with the highest toxicity in cells, according to Lin et al. (2016), surpassed only by mercury.

The aforementioned reasons led us to search for an enzyme that catalyzes a reaction involving this metal, with the aim of generating a new biopart and be able to incorporate it into the biosensor, as a step towards detecting heavy metals in water bodies. The enzyme chosen for the generation of the biopart was phytochelatin synthase (PCS) (EC 2.3.2.15), which catalyzes the synthesis of glutathione (GSH) polymers called phytochelatins (PCs), with cadmium being the main inducer in plants and other organisms (García-García et al., 2014; 2020).

⚡ Molecular dynamics

As with the structure for ECFP-EryK-mVENUS, a ColabFold model was obtained using the provided sequence for AtPCS (UniProt: Q9S7Z3) (Figure 14).

The sequence corresponds to a monomer and this was properly modeled by ColabFold with a high amount of confidence according to the PAE and PLDDT values (Figure 15).

MD simulations were performed to assess if the predicted structure obtained of the system was conserved after including biophysical potentials. A preliminary simulation was run with GROMACS using model 5 from ColabFold. The topology was generated with the same force field (AMBER99SB-ILDN) and water model (TIP3P) that was used for the simulation of the other construct (Lindorff-Larsen et al., 2010; Jorgensen et al., 1983) . The simulation was run in the same way and with the same parameters as the one for ECFP-EryK-mVENUS (Figure 16).

Contrary to the ECFP-EryK-mVENUS model, there are no published experimental structures of AtPCS1. The simulation shows that the protein structure does not significantly diverge from the ColabFold model and, while it doesn't encompass our entire construct, it is still informative as the fluorophores will not induce conformational changes. On the other hand, it would be necessary to run a simulation with the presence of the ion to determine whether the enzyme would undergo a conformational change when catalyzing the reaction.

🧪 Experimental design and future perspectives

Clone AtPCS into expression vector pET-28b(+)

The goal of this was to construct the vector that would allow the expression of AtPCS in E. coli. In order to achieve this, the following workflow was necessary:

Amplification of AtPCS by PCR.
Confirmation of the correct amplification and integrity of the DNA by agarose gel electrophoresis and subsequent purification.
Digestion reaction of AtPCS and pET-28b(+) with EcoRI and NdeI (New England Biolabs) to create complementary sticky ends.
Ligation reaction with T4 DNA ligase (New England Biolabs) to generate the complete expression vector.
Transformation of competent E. coli TOP10 cells by heat shock.
Selection of transformants by cultivating with kanamycin.
Confirmation of plasmid length by colony PCR and enzymatic digestion.

This goal was successfully achieved as shown in Figure 17, where a band corresponding to the length of the AtPCS gene was identified in sample D1 from colony 1 after a digestion assay with enzymes NdeI and EcoRI HF.

Induce protein overexpression to confirm the production of the enzyme in E. coli

The goal of this step was to confirm that E. coli was able to synthesize the AtPCS enzyme. To achieve this, the following workflow was carried out:

Transformation of competent E. coli BL21 cells with AtPCS-pET-28b(+).
Induction with 0.4 mM IPTG.
Cell lysis by sonication.
SDS-PAGE to confirm the presence of the expected enzyme.

This goal was partially achieved, as several attempts were made to induce expression of AtPCS. Induction was attempted with different temperatures and for different periods of time, the last attempt at 16 ºC overnight produced a very faint band (like a stain) on the SDS-PAGE gel, as shown in Figure 18.

Multiple attempts were carried out to overexpress the protein, varying the induction conditions (incubation time and temperature) in an attempt to optimize them; however, no band was observed on the gel after performing the purification process using the colony 1 culture. After setting a 5 mL pre-culture from colony 4, we proceeded with inoculation, cell harvesting and sonication. This was done to continue with a final attempt of purification through a Ni-NTA resin (ThermoFisher). An SDS-PAGE was run to confirm the presence of our protein. Unfortunately, no protein was purified (Figure 19).

Optimization of induction conditions are needed for confirmation of AtPCS enzyme production. To complete the incorporation of the enzyme into the FRET system, the plans for future activities are as follows:

1. Purify AtPCS and characterize its activity.

This is meant to indicate whether AtPCS produced by E. coli is able to interact with cadmium, which is necessary for its incorporation into the FRET system, as it would allow the detection of this heavy metal. Furthermore it would help determine whether a conformational change occurs or not.

2. Clone AtPCS into an expression vector with ECFP and mVENUS to generate the full FRET system by Gibson assembly, a method that has advantages including its speed, versatility, and scarless vector assembly (Thomas et al., 2015).

Linking AtPCS with ECFP and mVENUS will allow the generation of a fluorescent signal once the enzyme interacts with cadmium. This step is meant to assemble the components of the FRET system so it can be produced by E. coli.

Overexpress the FRET system, purify and characterize it.

Once the protein construct is overexpressed it will need to be tested. To do so, the construct will be purified and then tested with different amounts of cadmium. We expect to be able to detect a change in FRET signal with an increasing intensity as the concentration of cadmium becomes higher.

Experiments undertaken and future endeavors are summarized and displayed in Figure 20.

Chan, W. T., Verma, C. S., Lane, D. P., & Gan, S. K. (2013). A comparison and optimization of methods and factors affecting the transformation of Escherichia coli. Bioscience reports, 33(6), e00086. https://doi.org/10.1042/BSR20130098
García-García, J. D., Sánchez-Thomas, R., Saavedra, E., Fernández-Velasco, D. A., Romero-Romero, S., Casanova-Figueroa, K. I., … Moreno-Sánchez, R. (2020). Mapping the metal-catalytic site of a zinc-activated phytochelatin synthase. Algal Research, 101890. doi:10.1016/j.algal.2020.101890
García-García, J. D., Girard, L., Hernández, G., Saavedra, E., Pardo, J. P., Rodríguez-Zavala, J. S., Encalada, R., Reyes-Prieto, A., Mendoza-Cózatl, D. G., & Moreno-Sánchez, R. (2014). Zn-bis-glutathionate is the best co-substrate of the monomeric phytochelatin synthase from the photosynthetic heavy metal-hyperaccumulator Euglena gracilis. Metallomics : integrated biometal science, 6(3), 604-616. https://doi.org/10.1039/c3mt00313b
Hess, B., Kutzner, C., van der Spoel, D., & Lindahl, E. (2008). GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. Journal of chemical theory and computation, 4(3), 435-447. https://doi.org/10.1021/ct700301q
Jeong, H., Kim, H. J., & Lee, S. J. (2015). Complete Genome Sequence of Escherichia coli Strain BL21. Genome announcements, 3(2), e00134-15. https://doi.org/10.1128/genomeA.00134-15
Jyoti, D., Sinha, R., & Faggio, C. (2022). Advances in biological methods for the sequestration of heavy metals from water bodies: A review. Environmental toxicology and pharmacology, 94, 103927. https://doi.org/10.1016/j.etap.2022.103927
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R., & Klein, M. L. (1983). Comparison of simple potential functions for simulating liquid water. Journal of Chemical Physics, 79(2), 926-935. https://doi.org/10.1063/1.445869
Lemkul, J.A. (2019). From Proteins to Perturbed Hamiltonians: A Suite of Tutorials for the GROMACS-2018 Molecular Simulation Package [Article v1.0]. Living Journal of Computational Molecular Science. https://doi.org/10.33011/livecoms.1.1.5068
Lin, X., Gu, Y., Zhou, Q., Mao, G., Zou, B., & Zhao, J. (2016). Combined toxicity of heavy metal mixtures in liver cells. Journal of Applied Toxicology, 36(9), 1163-1172. doi:10.1002/jat.3283
Lindorff-Larsen, K., Piana, S., Palmo, K., Maragakis, P., Klepeis, J. L., Dror, R. O., & Shaw, D. E. (2010). Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins, 78(8), 1950-1958. https://doi.org/10.1002/prot.22711
Mirdita, M., Schütze, K., Moriwaki, Y., Heo, L., Ovchinnikov, S., & Steinegger, M. (2022). ColabFold: making protein folding accessible to all. Nature methods, 19(6), 679-682. https://doi.org/10.1038/s41592-022-01488-1
Montemiglio, L. C., Macone, A., Ardiccioni, C., Avella, G., Vallone, B., & Savino, C. (2013). Redirecting P450 ERYK specificity by Rational Site-Directed Mutagenesis. Biochemistry, 52(21), 3678-3687. https://doi.org/10.1021/bi400223j
Park, S. W., Kang, S. K., & Yoon, T. S. (2016). Crystal structure of the cyan fluorescent protein Cerulean-S175G. Acta Crystallographica Section F: Structural Biology Communications, 72(7), 516-522. https://doi.org/10.1107/s2053230x16008311
Pereira, L. C., de Souza, A. O., Bernardes, M. F. F., Pazin, M., Tasso, M. J., Pereira, P. H., & Dorta, D. J. (2015). A perspective on the potential risks of emerging contaminants to human and environmental health. Environmental Science and Pollution Research, 22, 13800-13823.
Pletnev, S., Subach, F. V., Dauter, Z., Wlodawer, A., & Verkhusha, V. V. (2012). A Structural Basis for Reversible Photoswitching of Absorbance Spectra in Red Fluorescent Protein rsTagRFP. Journal of Molecular Biology, 417(3), 144-151. https://doi.org/10.1016/j.jmb.2012.01.044
Stuart, M., Lapworth, D., Crane, E., & Hart, A. (2012). Review of risk from potential emerging contaminants in UK groundwater. Science of the Total Environment, 416, 1-21.
Studier F. W. (2014). Stable expression clones and auto-induction for protein production in E. coli. Methods in molecular biology (Clifton, N.J.), 1091, 17-32. https://doi.org/10.1007/978-1-62703-691-7_2
Tchounwou, P. B., Yedjou, C. G., Patlolla, A. K., & Sutton, D. J. (2012). Heavy metal toxicity and the environment. Experientia supplementum (2012), 101, 133-164. https://doi.org/10.1007/978-3-7643-8340-4_6
Thomas, S., Maynard, N. D., & Gill, J. (2015). DNA library construction using Gibson Assembly®. Nature Methods, 12(11), i-ii. doi:10.1038/nmeth.f.384
Zhou, H. X., & Pang, X. (2018). Electrostatic Interactions in Protein Structure, Folding, Binding, and Condensation. Chemical reviews, 118(4), 1691-1741. https://doi.org/10.1021/acs.chemrev.7b00305