Design

Abstract

This section outlines the design of an actuator system aimed at silencing specific target genes using the CRISPR interference (CRISPRi) mechanism. The project started by searching the literature for plasmids compatible with the chosen bacterial strains, as well as Escherichia coli TOP10, to establish a modular and versatile platform following the Biobrick RFC[10] standard. Plasmids were modified and assembled to achieve compatibility. Additionally, a reporter gene (rfp) was inserted to validate genetic parts and optimize transformation protocols for non-model bacterial strains. An expression cassette for dCas9, made standard during the project, was also tested. These standardized plasmids will be employed for toxicity testing, silencing efficiency measurement, and transduction of bacteria when transformed into phagemids, working in collaboration with the developed Delivery system. Guide RNAs were designed on appropriate target genes; the selection process was guided by the knowledge that antibiotic-resistant strains utilize various pathogenesis factors, including antibiotic resistance genes and genes for biofilm formation and host fitness. This section provides justification for the chosen target genes based on this information.

CRISPRi method description

The CRISPR (clustered regularly interspaced short palindromic repeats/CRISPR associated) systems are naturally present in bacteria and archaea as their adaptive immune system: it is a highly adaptable and heritable mechanism that provides resistance against bacteriophages.

There are three classes of CRISPR: the most used in genetic engineering is the CRISPR/Cas9 system that is part of the second group. In order to cleave the target sequence, the endogenous CRISPR/Cas9 system needs the Cas9 protein, the CRISPR RNA (crRNA) and the trans-activating crRNA (tracrRNA) [1]. The crRNA must be previously in complex with a trans-activating RNA (trascRNA) in order to bind the Cas9. The CRISPR/Cas adaptive immune system requires 3 phases: the acquisition phase involves acquiring the molecular record from the infesting virus or plasmid, with the following insertion of the sequences of the molecular records into the CRISPR array. The transcription and maturation phase include the transcription of the CRISPR array and the maturation of the guide RNA. During the last phase, the crRNA guides the endonuclease to the target sequence where the protein cleaves the DNA assisted by its two nuclease domains HNH and RuvC [2].

The selection of the molecular records, called protospacer and then spacer once inserted in the CRISPR array, is determined by the presence of a specific protospacer adjacent motif (PAM) that is dependent on the species of the Cas9 protein (our system involves the Streptococcus pyogenes Cas9, so the PAM is 5’-NGG-3’). The PAM sequence is not integrated within the CRISPR array in order to prevent the cleavage of the endogenous genome [1]. From it’s first use to cleave target DNA in vitro in 2012 the Cas9 system has been improved, now we can perform a CRISPR/Cas9 cutting using a single chimeric guide (sgRNA) consisting of a fusion of the crRNA and the tracrRNA; this facilitates the rapid implementation of this system for the engineering of genomes [1].

In our system, we have chosen to use the dead Cas9 (dCas9), in which the two nuclease domains, HNH and RuvC, are catalytically inactive. Therefore, the protein is unable to cleave the DNA but can only bind to the target DNA sequence that is complementary to the guide RNA [2]. The dCas9:sgRNA complex performs the transcription repression (Figure 1.B) or the modulation of the expression of the target.

FIGURE 1| B) Binding and cleavage of the target sequence by the wild-type Cas9:sgRNA complex (left). Repression of RNA polymerase (RNAP) progression along the DNA by the dCas9:sgRNA complex (right). The two white points represent the D10A and H841A mutations. (Qi et al., 2013, [3])

The decision to use the dCas9 was based on the increased controllability of the system, and since the dCas9 cannot cleave the DNA, we can prevent the formation of uncontrollable mutations during the DNA repair process.

Another reason why we chose to use an interference system to target non-essential genes, instead of a cleavage system, is to avoid the selective pressure that DNA cleavage can create. Cutting the double strand of the DNA can result in mutations during the repair process or even cell death. We want to avoid the possibility that our system generates high selective pressure due to its significant negative effect.

Without killing the bacteria, there is the possibility of re-sensitizing them to antibiotics by silencing resistance or pathogenesis genes, such as those related to biofilm formation. The main cause of death of the bacterium would remain the antibiotic. Obviously, evolution of resistant strains is still possible, but in the case of mutations that occur on the complementary sequence of the guide RNA, the latter can be easily and quickly replaced in the host vector to silence new genes, after the characterization of the novel strain. This allows our project to have a higher level of security and also of versatility, since it will be possible to test other guides in order to improve the silencing efficiency.

dCas9 CDS and expression cassette

It was decided to utilize the plasmid J116-dCas9_3k3 (composed of the backbone pSB3K3 and the insert BBa_J107201, Bellato et al., 2022 [11]), which was available as a kind gift from the authors and reasonably exploitable for the purposes of the project; it already had an origin of replication for Escherichia coli (p15A), the kanamycin resistance gene and an expression cassette for d(Sp)Cas9 flanked by the standard assembly Prefix and Suffix (RFC[10], Figure 2). The sequence of the dCas9 was obtained from the plasmid pdCas9 (Addgene #44249). The cassette cannot be considered as a Part for iGEM due to an EcoRI site in position 1339 5'-end of the dCas9 coding sequence and thus not complying with any of the RFCs.

FIGURE 2| dCas9 expression cassette between Prefix and Suffix of plasmid J116-dCas9_3k3. In order, there are: the Promoter J23116 (BBa_J23116), from the synthetic promoter library developed by Anderson (iGEM Parts Catalog, Anderson Library), a scar (from a previous cloning), the RBS BBa_B0034, the coding sequence of d(Sp)Cas9 (4,107 kb long), the Double Terminator BBa_B0015 (composed by the rrnBT1 terminator BBa_B0010, in this case only a part of it was cloned, and the TE from coliphageT7 BBa_B0012).

To obtain a cassette compatible with the iGEM standards, a PCR mutagenesis was carried out to eliminate the EcoRI site while maintaining the codon usage in E. coli. The obtained standardized expression cassette will be cloned into the plasmids able to replicate into the bacteria strain of interest exploiting the restriction enzymes in Prefix and Suffix.

Guide design

The sgRNA is composed of a 3' constant region, the scaffold, and a 5' variable region, corresponding to 17-24 nt complementary to the target DNA (Figure 3). Focusing on the scaffold, we have found that the same 83 nt long sequence (developed by Qi et al., 2013,[3]), including the handle for the dCas9 and a terminator from S. pyogenes, was used in all the strains of interest during different studies exploiting CRISPRi in different bacterial species. For example, Wang et al., 2019 [4] for A. baumannii, Wang et al., 2018 [5] for K. pneumoniae and Chen et al., 2018 [6] for P. aeruginosa.

FIGURE 3| A) Example of the composition of a sgRNA: the base pairing region of about 20 nt, the scaffold composed by 40 nt dCas9 handle, the hairpin that binds the dCas9, and 42 nt of the terminator from S. pyogenes. B) sgRNA:DNA complex downstream the PAM sequence [3].

Given an annotated bacterial genome, if the goal is gene repression, the sgRNA has to bind either the non-template (NT; or coding) DNA (so the sgRNA coding sequence should be equal to the template DNA upstream the PAM, Figure 3.B) or the promoter. But in the case of the described project there was insufficient knowledge on the regions upstream of the genes of interest. Furthermore, the silencing efficacy generally depends on the binding distance of the guide from the transcription start site (TSS), in an inversely proportional manner. The greater the distance from the TSS, the greater the likelihood of truncated protein formation (Figure 4.B). Another important feature is that the required GC content is between 30% and 70%.

FIGURE 4| A) Binding of dCas9 to NT DNA of a coding sequence. B) In general, it is optimal that the guide binds by 20% of the coding region [3].

One of the most critical difficulties in using CRISPR/Cas, and thus CRISPRi, is the design of a specific sgRNA to prevent a high number of off-target sites. Cui et al (2018) [7] reviewed a list of tools for on and off-target predictions in the proper design of sgRNA, such as CHOPCHOP and CasOFFinder respectively. It is important that there are no single base mutations and therefore no mismatches with the target DNA in the first 12-7 nt at 3' end (seed region) of the guide; otherwise, the effectiveness and specificity of the system decrease. A further option is to target the same gene with two guides simultaneously, hence with two dCas9:sgRNA complexes. According to Anderson et al., (2021) [8], two guides must be at least 40 nt apart to not interfere with each other.

Target Genes

We decided to target genes involved in the antibiotic resistance, bacteria needs and biofilm production in order to restore bacterial susceptibility to antibiotics commonly used and enable their better absorption. Here, we report information regarding these genes that also can explain the main reasons under our choices.

Acinetobacter baumannii

OmpA

OmpA is a porin protein. Its N-terminal domain is an antiparallel β-barrel structure made up of eight transmembrane strands in the outer membrane, and these strands are connected by four loops. The globular C-terminal domain consists of three short turns in the periplasmic domain. OmpA is the most studied virulence factor in Acinetobacter baumannii among the outer membrane proteins (OMPs), and it plays a key role in regulating adhesion, aggressiveness, biofilm production, and the host's immune response [9]. Our interest focuses on its role in biofilm formation, which was confirmed in [10] through mutagenic experiments, also conducted on our strain.

Csu cluster

The CSU (chaperon usher secretion system) is required for bacterial adhesion on abiotic surfaces as the primary stage for the biofilm formation. They consist of a tip fibrillum and adhesion protein, A. baumannii produces the type I Csu pilus that is encoded by the operon Csuab-A-B-C-D-E. The part that is responsible for the adhesion to hydrophobic plastic (widely used in medical equipment) is CsuE. This system is regulated by the two-component system BfmRS [15]. However, we had decided to prioritize ompA as the target against biofilm because it is also involved in the adhesion process [9].

orf3/CraA

CraA (which stands for chloramphenicol resistance Acinetobacter, originally named Orf3) is a multidrug efflux transport protein homologous to the MdfA efflux pump of Escherichia coli. The system has been found in all 82 A. baumannii strains tested in [14] and is believed to contribute to intrinsic resistance to chloramphenicol [14]. As reported in the image below from [13], the silencing of this gene significantly decreases the MIC (Minimum Inhibitory Concentration) for the antibiotic chloramphenicol, to which our strain is resistant [13]. For these reasons, we chose it as a target.

antibiotics
ampC

blaADC, also called AmpC, is a class C β-lactamase, i.e. a cephalosporinase encoded by the chromosome in A. baumannii. It involves resistance to Ampicillin, especially when it is overexpressed due to the insertion of short transposon sequences, which are stronger promoters than the intrinsic one.

algC

The algC gene was studied in Acinetobacter baumannii AIIMS 7 [16], a multidrug resistant strain, as a candidate gene in biofilm production. Further analysis identified the transcription product as the bifunctional enzyme phosphomannomutase/phosphoglucomutase, necessary to the production of the lipopolysaccharide core and exopolysaccharides such as alginate. In particular, this enzyme is thought to have a similar structure to that of P. aeruginosa. Transcript levels appear to be in close association with biofilm formation, particularly in the maturation phase (36–48 hours) and also in the initial attachment phase (3–9 hours).

Klebsiella pneumoniae

blaSHV-1

The penicillin resistance gene blaSHV-1 was first discovered in K. pneumoniae in the 1960s. It codes for a class A beta-lactamase; the enzyme also has β-lactamase activity against cephalosporins and it determines resistance against ampicillin. Over the years, many genes for broad-spectrum β-lactamases have developed, and have been identified, in various clinical strains of K. pneumoniae, increasing the emergence of infections caused by this bacterium.

ompA

OmpA is a predominant outer membrane protein in Enterobacteriaceae. Its expression can be influenced by various environmental factors. In Klebsiella pneumoniae, the gene has a similar key role as in A. baumannii. In particular, KpOmpA mediates adhesion to a wide range of immune effector cells, facilitating respiratory and urinary tract infections. It stabilizes Gram-negative bacteria by anchoring their outer membrane to the peptidoglycan layer. It is structurally similar to E. coli OmpA, with the difference that it has larger extracellular loops. These loops are thought to mediate adhesion during K. pneumoniae infection, while the C-terminal peptidoglycan-binding domain (PGBD) remains anchored to the peptidoglycan layer in the periplasmic space [18].

Pseudomonas aeruginosa

ampC

P. aeruginosa is considered one of the most common causes of opportunistic burn and nosocomial infections that can be difficult to treat. The chromosomal gene ampC codes for a broad-spectrum antibiotic hydrolase, determining the resistance to Aminopenicillins, Cephalosporins, Cephamycins, Carbapenems, and Monobactams.

pvdA

Pyoverdines are fluorescent proteins which have previously been defined as bacterial fluoresceins. However, being siderophores as well (i.e. iron acquisition and transport systems), they are important for the growth of P. aeruginosa within a mammalian host [12]. Infact, Ankenbauer et al. (1986) demonstrated that a deficit in pyoverdine production leads to slower growth, mostly determined by lower iron uptake rates, than a Pvd+ strain in the presence of human serum or transferrin. Pvd- mutants didn’t grow in an EDDA-containing medium. Studies on key roles of the pvdA gene in infections highlighted that pyoverdines are factors of virulence: Minandri et al. (2016) found out that they are central in the pathogenesis of lung infections in mice [19].

Vector design

The actuator system relies on a vector that carries the silencing mechanism of CRISPRi mechanism that, once transformed in the target cell, will target the selected genes. These vectors would serve in a first moment as a proof of concept of the silencing system previously described, whereas later would serve as the template for a new phagemid suitable for different host species.

To design a proper vector for the non-model organisms we are working on, the first step was to find plasmids that can be expressed in these species and then adapt them to the BioBrick Standard Assembly. In a second moment it was necessary to insert the CRISPRi cassette, including the dCas9 protein and the sgRNA, into the same plasmid to validate the interference system.

Acinetobacter baumannii and Klebsiella pneumoniae - pSGAb

For these two bacteria we have selected the same starting plasmid: pSGAb-km (6073 bp), visible in Figure 5 (Addgene #121999) [9] that has 2 ORI:

  • pMB1 (from pUC57 vector) for replication in E. coli and K. pneumoniae
  • WH1266 (from pWH1266 vector) for replication in A. baumannii

Since our strain Acinetobacter baumannii ATCC19606 is resistant to ampicillin and chloramphenicol and Klebsiella pneumoniae ATCC13883 is resistant to ampicillin as well, the kanamycin resistance of this plasmid was perfect for our experiments. This plasmid was employed as a vector for the sgRNA in a CRISPR/Cas9 system [4]. It includes a modified version of the Biobrick BBa_J23119 promoter from the synthetic promoter library developed by Anderson (iGEM Parts Catalog, Anderson Library) that contains a restriction site for SpeI. Furthermore there are restriction sites that are not compatible with the Standard Assembly.

pSGAb-km

FIGURE 5| pSGAb-km.

Therefore, to establish a standardized backbone suitable for various experiments in both of these bacteria, we opted to proceed in stages. Since all the necessary restriction sites that needed to be eliminated were within a Multiple cloning site (Figure 6), we have decided to excise that portion and insert a newly synthesized fragment containing both the Prefix and the Suffix accompanied by the standard promoter BBa_J23119. In this way we can also eliminate the modified J23119 promoter and the original sgRNA.

Excised portion of pSGAb-km

FIGURE 6| Excised portion of pSGAb-km.

The next step is to insert a reporter gene expression cassette (e. g BBa_I13507 or BBa_I13521), this step would be necessary to analyze the effective functionality of the plasmid within our strains and of all the different genetic parts, in particular the RBS and the double terminator that are also present in J116-dCas9_3k3. A reporter system is important to verify the transformation of ATCC 19606 and ATCC 13883 bacterial strains through electroporation protocols optimized after previous results. The expression cassette of the reporter should generate a phenotype easily measurable in the transformed cells.Then, it is necessary to insert in the standardized plasmid the CRISPRi system: the dCas9 cassette and the sgRNA cassette.

The approach we chose to follow involved initially inserting the dCas9 cassette. This would result in a plasmid where we could change only the sgRNA cassette, allowing us to attempt the inactivation of different targets.

Pseudomonas aeruginosa - pPAC

The proof of concept phase of our project requires testing the expression of a reporter gene inside Pseudomonas aeruginosa aeruginosa. In order to do so, we searched in literature which plasmids would be suitable for our purpose. In the preliminary phase we found two candidate plasmids: pMF230 and pUCP20 PilB. However, both of them showed some incompatibilities with our strain and required some modification in order to be used to transform the bacteria.

pMF230 is a broad host range low copy plasmid for constitutive expression of the eGFP. The selection marker is the gene for ampicillin resistance. We couldn’t use this plasmid as it was for a few reasons. First of all, the bacterial strain we use is Pseudomonas aeruginosa aeruginosa PAO1 (ATCC 10145), which is naturally resistant to Ampicillin; this makes pMF230 incompatible, because after transformation we wouldn’t be able to tell which colonies have actually incorporated the plasmid. Moreover, due to the production of pyoverdines (Schalk IJ et al., 2012 [17]), P. aeruginosa colonies appear green: thus, using an eGFP as a reporter gene would make it difficult to distinguish transformed colonies and also to measure appropriately the fluorescence produced by the eGFP.

pUCP20 PilB is a high copy plasmid specific for P. aeruginosa that expresses the PilB protein with a C-terminal his-tag. pilB is not an appropriate reporter gene, so we should substitute it with the gene for a fluorescent protein. However, this task could be difficult due to the fact that the terminators are not annotated on the plasmid map, thus by changing the coding sequence there could be problems with the transcription. In addition, it has a selection marker which is a gene for ampicillin resistance, so it occurs the same problem of pMF230.

In order to have a plasmid suitable for our strain, it was clear to us that assembling regions coming from different plasmids would be the best way to reach this goal. The assembly was performed using the Gibson assembly method. The parts that compose the final plasmid are described in the table below.

Templates Parts
pPUCP20 PilB Ori pMB1, pLac promoter, AmpR cassette
pMF230 Terminators rrnbT1 and rrnbT2
pSB4C5 cat gene (chloramphenicol resistance)

The final plasmid obtained is a standard backbone suitable for the expression in P. aeruginosa called pPAC (Figure 7).

pPAC

FIGURE 7| Final pPAC.

The next step is to clone between Prefix and Suffix the RFP cassette from BBa_I13507, which contains the RBS BBa_B0034, the rfp gene BBa_E1010, and the terminators BBa_B0010 and BBa_B0012. This step is necessary to assess the functionality of the plasmid itself and of the genetic parts used within our strain of P. aeruginosa, in particular the RBS and the Double Terminator, because the BBa_I13507 cassette has been designed for the expression of RFP in E. coli. After testing the functionality of the parts, the CRISPRi system will be inserted in the standard backbone pPAC, composed by the dCas9 cassette and the sgRNA cassette.


References

  1. Hryhorowicz, Magdalena, Daniel Lipiński, Joanna Zeyland, e Ryszard Słomski. «CRISPR/Cas9 Immune System as a Tool for Genome Engineering». Archivum Immunologiae et Therapiae Experimentalis 65, fasc. 3 (giugno 2017): 233–40. https://doi.org/10.1007/s00005-016-0427-5.
  2. Mahas, Ahmed, C. Neal Stewart, e Magdy M. Mahfouz. «Harnessing CRISPR/Cas Systems for Programmable Transcriptional and Post-Transcriptional Regulation». Biotechnology Advances 36, fasc. 1 (gennaio 2018): 295–310. https://doi.org/10.1016/j.biotechadv.2017.11.008.
  3. Qi, L. S., Larson, M. H., Gilbert, L. A., Doudna, J. A., Weissman, J. S., Arkin, A. P., & Lim, W. A. (2013). Repurposing CRISPR as an RNA-Guided Platform for SequenceSpecific Control of Gene Expression. Cell, 152(5), 1173–1183. https://doi.org/10.1016/j.cell.2013.02.022.
  4. Wang, Yu, Zhipeng Wang, Yan Chen, Xiaoting Hua, Yunsong Yu, e Quanjiang Ji. «A Highly Efficient CRISPR-Cas9-Based Genome Engineering Platform in Acinetobacter baumannii to Understand the H2O2-Sensing Mechanism of OxyR». Cell Chemical Biology 26, fasc. 12 (dicembre 2019): 1732-1742.e5. https://doi.org/10.1016/j.chembiol.2019.09.003.
  5. Wang, Y., Wang, S., Chen, W., Song, L., Zhang, Y., Shen, Z., Yu, F., Li, M., & Ji, Q. (2018). CRISPR-Cas9 and CRISPR-Assisted Cytidine Deaminase Enable Precise and Efficient Genome Editing in Klebsiella pneumoniae. Applied and Environmental Microbiology, 84(23), e01834-18. https://doi.org/10.1128/AEM.01834-18.
  6. Chen, W., Zhang, Y., Zhang, Y., Pi, Y., Gu, T., Song, L., Wang, Y., & Ji, Q. (2018). CRISPR/Cas9-based Genome Editing in Pseudomonas aeruginosa aeruginosa and Cytidine Deaminase-Mediated Base Editing in Pseudomonas aeruginosa Species. IScience, 6, 222–231. https://doi.org/10.1016/j.isci.2018.07.024.
  7. Cui, Y., Xu, J., Cheng, M., Liao, X., & Peng, S. (2018). Review of CRISPR/Cas9 sgRNA Design Tools. Interdisciplinary Sciences: Computational Life Sciences, 10(2), 455–465. https://doi.org/10.1007/s12539-018-0298-z.
  8. Anderson, D. A., & Voigt, C. A. (2021). Competitive dCas9 binding as a mechanism for transcriptional control. Molecular Systems Biology, 17(11), e10512. https://doi.org/10.15252/msb.202110512.
  9. Gaddy, Jennifer A., Andrew P. Tomaras, e Luis A. Actis. «The Acinetobacter baumannii 19606 OmpA Protein Plays a Role in Biofilm Formation on Abiotic Surfaces and in the Interaction of This Pathogen with Eukaryotic Cells». Infection and Immunity 77, fasc. 8 (agosto 2009): 3150–60. https://doi.org/10.1128/IAI.00096-09.
  10. Nie, Dan, Yue Hu, Zhou Chen, Mingkai Li, Zheng Hou, Xiaoxing Luo, Xinggang Mao, e Xiaoyan Xue. «Outer Membrane Protein A (OmpA) as a Potential Therapeutic Target for Acinetobacter baumannii Infection». Journal of Biomedical Science 27, fasc. 1 (dicembre 2020): 26. https://doi.org/10.1186/s12929-020-0617-7.
  11. Bellato, M., Frusteri Chiacchiera, A., Salibi, E., Casanova, M., De Marchi, D., Castagliuolo, I., Cusella De Angelis, M. G., Magni, P., & Pasotti, L. (2022). CRISPR Interference Modules as Low-Burden Logic Inverters in Synthetic Circuits. Frontiers in Bioengineering and Biotechnology, 9, 743950. https://doi.org/10.3389/fbioe.2021.743950.
  12. Ankenbauer, R., Hanne, L. F., & Cox, C. D. (1986). Mapping of mutations in Pseudomonas aeruginosa aeruginosa defective in pyoverdin production. Journal of Bacteriology, 167(1), 7–11. https://doi.org/10.1128/jb.167.1.7-11.1986.
  13. Roca, I., S. Marti, P. Espinal, P. Martínez, I. Gibert, e J. Vila. «CraA, a Major Facilitator Superfamily Efflux Pump Associated with Chloramphenicol Resistance in Acinetobacter baumannii». Antimicrobial Agents and Chemotherapy 53, fasc. 9 (settembre 2009): 4013–14. https://doi.org/10.1128/AAC.00584-09.
  14. Coyne, Sébastien, Patrice Courvalin, e Bruno Périchon. «Efflux-Mediated Antibiotic Resistance in Acinetobacter Spp». Antimicrobial Agents and Chemotherapy 55, fasc. 3 (marzo 2011): 947–53. https://doi.org/10.1128/AAC.01388-10.
  15. Pakharukova, Natalia, Minna Tuittila, Sari Paavilainen, Henri Malmi, Olena Parilova, Susann Teneberg, Stefan D. Knight, e Anton V. Zavialov. «Structural Basis for Acinetobacter baumannii Biofilm Formation». Proceedings of the National Academy of Sciences 115, fasc. 21 (22 maggio 2018): 5558–63. https://doi.org/10.1073/pnas.1800961115.
  16. Sahu, P. K., Iyer, P. S., Barage, S. H., Sonawane, K. D., & Chopade, B. A. (2014). Characterization of the algC Gene Expression Pattern in the Multidrug Resistant Acinetobacter baumannii AIIMS 7 and Correlation with Biofilm Development on Abiotic Surface. The Scientific World Journal, 2014, 1–14. https://doi.org/10.1155/2014/593546.
  17. Schalk IJ, Guillon L. Pyoverdine biosynthesis and secretion in Pseudomonas aeruginosa aeruginosa: implications for metal homeostasis. Environ Microbiol. 2013 Jun;15(6):1661-73. https://doi.org/10.1111/1462-2920.12013.
  18. Bosshart, P. D., Iordanov, I., Garzon-Coral, C., Demange, P., Engel, A., Milon, A., & Müller, D. J. (2012). The Transmembrane Protein KpOmpA Anchoring the Outer Membrane of Klebsiella pneumoniae Unfolds and Refolds in Response to Tensile Load. Structure, 20(1), 121–127. https://doi.org/10.1016/j.str.2011.11.002.
  19. Minandri, F., Imperi, F., Frangipani, E., Bonchi, C., Visaggio, D., Facchini, M., Pasquali, P., Bragonzi, A., & Visca, P. (2016). Role of Iron Uptake Systems in Pseudomonas aeruginosa aeruginosa Virulence and Airway Infection. Infection and Immunity, 84(8), 2324–2335. https://doi.org/10.1128/IAI.00098-16.

Abstract

In this section we will introduce the delivery mechanism of our interference system. Our priorities were to design something that could be safe, efficient and adaptable to the specific situation. We wanted to create a platform that could be used to address antibiotic resistant infections quickly and efficiently. At the same time, as our primary objective was to apply it in the human body, safety was of uttermost importance. To address these design considerations we thought of developing a phage delivering platform based on two engineered constructs that together allow the production of M13 phage particles bearing a genome of desire. The first construct is a plasmid, called phagemid, which incorporates the f1 intergenic region facilitating encapsidation within the M13 bacteriophage. This plasmid is further equipped with an origin of replication (ORI) and a selection marker, thereby enabling replication within host cells. The second plasmid, known as helper plasmid, serves as a repository for phage capsid proteins, in conjunction with an ORI and a selection marker. The absence of the encapsidation sequence within the helper plasmid allows for the production of phage particles carrying the phagemid. Consequently, upon infection of the target bacteria with the genetically modified phages, transduction of their genome is facilitated, while phage particle production within the host is hindered due to the absence of essential phage proteins. Thus, the phage system operates exclusively as a way for delivering the engineered DNA construct.

Introduction

In compassionate care, the employed bacteriophages are derived from environmental samples (i.e. from sewage systems) that have demonstrated in vitro lytic activity against the clinical isolate [1]. Therefore, up to now, phage particles are purified and administered without a precise characterization, and their potential clinical efficacy relies just on the assumption that their specific antibacterial activity in vitro could be reproducible also in vivo [1]; as a consequence, these therapies are usually personalized on the clinical case. However, in order for such an approach to reach everyday clinical practice, significantly higher levels of knowledge about what is being dispensed, as well as the standardization of the clinical protocol, are required. To this end, phage engineering has been proposed as a solution, allowing to thoroughly control phage particles production. In particular, a molecular engineering approach would allow the production of safer phage particles, purified from cellular debris and potential bacterial antigens, that currently represent one of the major issues [1, 2, 3].

In addition, the ability to produce engineered phage particles would make us capable of carrying a fully characterized genome of desire, containing gene editing tools or silencing systems that can interfere with the protein expression of pathogenic bacteria, namely the phagemid described lately. When co-transformed with the phagemid, the helper plasmid enables the production of viable phage particles carrying the desired genome [4,5](Figure 1).

packaging cycle

Figure 1. The packaging cycle of a phagemid. This image shows how the co-transformation of a helper plasmid with a phagemid allows the production and the subsequent purification of phage particles with the desired genome [5]

Such an approach will result in the production of engineered bacteriophages that are no longer unknown and therefore suitable as clinical therapy. Nevertheless, many challenges still need to be faced, with a primary aim of promoting the widespread adoption of bacteriophage research, by developing innovative solutions that simplify exploration and implementation of novel and pragmatic approaches. Building on these foundations, the present work aims to lay the groundwork for an approach to bacteriophage engineering, based on the concepts of parts standardization with the scope of making research in this field more approachable.

Phagemid

In our pursuit of a phage delivery system capable of accommodating a customizable genome to facilitate the expression of a targeted silencing system in recipient cells, a construct with specific attributes was indispensable. This construct had to fulfill several criteria: it needed to be capable of packaging within bacteriophages, easily subject to modifications, and compliant with the BioBrick standard. To address these prerequisites, we started from a commercially available construct, pTZ19R (ThermoFisher Scientific). This particular construct possesses the ability to be encapsulated within M13 phage particles. This can be achieved either by co-transformation with a helper plasmid or by subjecting cells to infection by a helper phage. The basis of this ability lies in the presence of the f’ intergenic region within the plasmid, a sequence that encodes the encapsidation signal.

Hence, this plasmid can be denoted as a phagemid, a construct capable of being encapsulated when exposed to the viral proteins. However, when present solely within a host cell, it lacks the capability to independently generate viable phage particles. This fundamental characteristic makes it a secure conduit for delivering the interference system. Importantly, this delivery approach safeguards the recipient cell from harm and precludes exerting evolutionary pressure on bacteria. Such pressure could potentially lead to bacterial resistance against the phage system itself, something we clearly want to avoid.

While this construct (Figure2) exhibited several favorable attributes aligned with our objectives, certain notable issues necessitated resolution. Primarily, there existed an obstacle due to the presence of a multiple cloning site embedded within the LacZ gene. This site encompassed three restricted cut sites, a circumstance that could be readily rectified through targeted deletion, achieved using a primer pair that excludes this region. Another significant concern was the absence of the BioBrick RFC[10] prefix and suffix. This challenge was similarly addressed straightforwardly, utilizing PCR mutagenesis to seamlessly incorporate the two required sequences.

Plasmid map of pTZ19R

Figure2. Plasmid map of pTZ19R

Through these essential modifications, we successfully constructed a novel standard backbone, deposited as BBa_K4727003 in the registry. This backbone has demonstrated its ability to maintain a high copy number within E. coli cells (replicon pMB1) and to effectively express the encoded protein. Critically, when coexisting with a helper plasmid or helper phage, this construct is capable of being encapsulated within M13 phage particles. This achievement leads to the creation of engineered bacteriophages carrying a customized genomic payload.

As a consequence, this construct emerges as a viable solution to the essential need for comprehensive characterization in propelling phage therapy towards routine clinical implementation. Importantly, its safety profile is well-established, devoid of significant threats to patient well-being. Furthermore, its inability to propagate within host cells due to the absence of phage protein encoding genes, ensures the prevention of unintended dissemination of engineered constructs. Lastly, its adaptability is a remarkable feature - it can be easily tailored to the specifics of individual cases, thus presenting a versatile and readily adjustable platform tailored to combat diverse pathogens.

Helper plasmid

Currently, the production of engineered M13 phage particles through a helper plasmid involves constructs composed of a replication origin (ORI) that facilitates replication within E. coli, an antibiotic selection marker that allows for the selection of transformed cells, and the genes encoding the M13 phage capsid proteins. These constructs are typically obtained by removing the encapsidation sequence from the M13 phage genome and replacing it with ORI and the selection marker. Of particular interest to us was the M13cp plasmid, built by Chasteen et al.[6]. However, this plasmid is not compatible with the BioBrick standard as it carries two PstI restriction sites flanking the ORI, furthermore, it lacks the BioBrick prefix and suffix. Thus, the region bearing the ORI and the chloramphenicol resistance was replaced with the standard pSB3K3 backbone, which carries the same ORI as the original helper plasmid, particularly p15a, and utilizes kanamycin resistance as the selection marker, compatible with the marker carried by the M13 phagemid BBa_K4727003. In this way we propose the new plasmid backbone BBa_K4727002, that allows for the production of engineered phage particles.

Tropism variation

Furthermore we wanted to explore the possibility of varying the tropism of the bacteriophage. Viral tropism is defined as the ability of a virus to target a specific molecular pattern which is characteristic of a cell or a tissue. Bacteriophages are capable of infecting bacteria only, and they do so by recognizing peculiar molecules exposed on the cell surface[1] (i.e. protein structures as pili, flagella, porins, or efflux pumps [1]. Otherwise, sugars in lipopolysaccharides have been reported [1]).

The interactions of the phage are highly specific and rely on proteins exposed on its surface. This renders the phage's ability to discriminate against the bacteria it can infect, and in some cases even among different strains, exceptionally selective. If, on one hand, it is a great advantage to have an organism able to target bacteria in such a specific manner, on the other it poses many limits in terms of spectrum of application. These considerations prompted an exploration of potential strategies for engineering host phage range. Despite the presence of examples in the existing literature and by past iGEM teams, none of them proved suitable for the intended purposes due to variations in the bacteriophages utilized [7,8]. Moreover, while tropism variation was explored in some head-tail phages [7, 8], as the Zurich iGEM team, there are few examples for filamentous phages, such as the Aix-Marseille 2017 iGEM team.

This purpose could have been achieved in two opposites, yet complementary, ways. The first one uses a random approach, inspired by antibody specific engineering, that allows mutagenesis of regions involved in target recognition, followed by a screening against the desired host(s) [8], this was the approach developed and exploited by the 2019 Zurich iGEM team. They developed a system to introduce random mutations in the tropism determinant proteins of T7 bacteriophages and to select them against the putative target. On one hand this methodology is far more time consuming, on the other it can result in a broader extension of the tropism as many more possibilities are explored. The second method can be otherwise described as a rational approach, using synthetic biology to modulate phage host ranges by swapping tropism determinants between different phage species[7]. This allows a higher confidence over the possible results nevertheless it requires a deeper knowledge of the tropism determinants. These two different approaches, even if applied to profoundly different families of bacteriophages (i.e. T4, T3, and T7) [7,8], represented the guidelines of our research. Keeping in mind these considerations the rational design of parts was chosen, as it allows easy swap between alternative genes to vary the host range of the system.

M13 tropism

As stated above, there are few examples of tropism variation in filamentous bacteriophages, for this reason a possible approach in M13 bacteriophage is proposed in this work. M13 uses a minor protein (Gp3) of its capsule to interact with the pilus (F’ factor) of Escherichia coli [9,10]. Subsequent publications also demonstrated how this interaction is mediated and takes place, involving not only the protein of the pili but also another protein of E. coli outer membrane, TolA9. On the virus side, the tropism of filamentous phages is determined by a protein encoded in the gene 3, referred as Gene Product 3 (Gp3)[10]. Gp3, a 406 amino acids-long protein[9,10] that sits in three to five copies on one end of the capsid[10], being responsible not only for the interaction with the outer structures of the bacteria but also for the termination of the virion assembly[9,10]. Once correctly folded it forms three functionally distinguished domains (N1, N2 and the C-terminal domain CTD), each of which is involved in the infective process[9,10]. The CTD anchors Gp3 to the phage by interacting with other phage coat proteins, whereas the N1 and N2 domains are responsible for the interaction with the bacteria[10]. The three-dimensional structures of the N1 and N2 domains have been determined by circular dichroism (CD) spectrum analysis and X-ray diffraction[11,12], whereas the CTD has not yet been characterized. Given N1 and N2 domains structure, it has been possible to determine the functions of the respective interacting parts. Even though N2 is in the middle of the amino acid sequence, the 3D structure reveals how it is located on one end of the protein and is responsible for the interaction with the F’ factor[10]. The positioning of the domain hides N1 which includes the N-terminus of the protein, but it becomes available after the first contact with the pili resulting in N1 domain interaction with the TolA protein[9]. TolA is a 421 amino acids-long protein, located in the periplasmic space of E. coli[13]; it anchors itself to the inner membrane with the N-terminus, spanning through the periplasm with a long α-helix ending with the C terminal domain (TolAIII)[13]. It has been demonstrated that TolAIII is functional in the infection process of M13 [9,13] thanks to its high affinity interaction (1-1,9μM) with N1 [9].

For this reason, two approaches to tropism variation will be discussed, the first revolving around the interaction of Gp3 with TolA and its homologues, whereas the second focuses on other “gene 3” homologues of the one encoded in M13 bacteriophage.

An approach to tropism engineering

Many benefits come with standardization of the helper plasmid, in particular the ability to easily interchange the parts present between the prefix and suffix of the vector. For this reason, we wanted to take advantage of this strategy to address the problem posed by tropism variation in bacteriophages. As described above, tropism in the M13 bacteriophage is determined by Gp3 protein, encoded by gene 3 [10]. Based on this knowledge, we decided to adopt a rational design strategy for new genes that can modify tropism. The rationale behind this choice primarily lies in the reduced time required by this approach and the increased predictability of potential outcomes. Such advantages would not be achievable through a random gene mutation approach, although it might enable a broader exploration of tropism variants. It is remarkable to note that the work made by the Zurich iGEM team in 2019 could represent a viable method to further extend the ideas we developed. Coupling the rational design with a random mutagenesis approach and an efficient selection method can take to the next step phage research in tropism variation.

Exploit TolA homologues

The rational design of alternative genes took two paths, both founded on literature research, past works from iGEM teams and the use of computational tools. The first path, leading to the generation of "Gp3_cut" (part BBa_K4727004), was based on the fundamental work of DA Marvin [14], which suggested that mutants lacking the N2 domain can infect bacteria without pilus with a 100-fold higher efficiency [14], as N2 masks the interaction of N1 with the co-receptor TolA. Building on this, the idea was to design a gene with a deleted N2 domain, drawing from the findings of Nilsson et al.[10], who precisely delineated the amino acid regions corresponding to individual domains. Therefore, an alternative gene was designed for tropism that could ideally enable E. coli infection in the absence of a pilus, thus extending the tropism. This gene, named "Gp3_cut," was obtained by in silico deletion of the region between residues 87 and 256, corresponding to the N2 domain [10]. However, considering this protein's ability to interact with the TolA protein, specifically the TolAIII domain [9,10], it was hypothesized that this same modified gene could also interact with homologous TolA proteins. Through bioinformatic analysis, using the BLASTp tool [15], a surface protein of Klebsiella pneumoniae, called "cell envelope integrity protein TolA" (protein ID: MBS4159696.1), was identified, sharing 100% similarity with the TolAIII domain of E. coli (residues 305-421 [13]). This suggests that such modification could potentially allow the engineered bacteriophage's entry into K. pneumoniae as well.

Gp3 homologues

The second path that was taken, which led to the generation of “Gp3_Pf3” (part BBa_K4727005), stem from the research of photogenically closed bacteriophages and the work of Aix-Marseille 2017 iGEM team. This choice follows the idea that proteins of evolutionary related phages maintain a certain grade of similarity thus being still able to interact between each other and correctly assemble the capsid. M13 bacteriophage is part of the Inoviridae family, that includes other filamentous phages, among them the Pf3 bacteriophage was identified as it is capable of infecting Pseudomonas aeruginosa. Given the relevance of this pathogen in the AMR phenomenon [1] it was considered as an interesting possibility in attempting tropism variation. As in all filamentous phages the tropism determinant protein is annotated as “Gene 3 product”[10], the coding sequence was identified by analyzing the deposited genome in the NCBI databank (ID: NC_074763), and isolated.

In silico design of expression cassettes

Once the putative genes for tropism variation were identified, it was necessary to design appropriate expression cassettes that could be inserted into the helper plasmid. To accomplish this, preliminary considerations were made. Firstly, the metabolic burden imposed by this plasmid is significant, as it expresses a total of 12 proteins (11 from the phage capsid, plus the selection marker), which significantly prolong colony growth (Figure 3). Secondly, our goal is to preferentially express alternative tropism genes and consequently incorporate them into the phage, as opposed to the wild-type gene 3 product, which is normally expressed. Only in this way can phage particles with altered tropism be obtained.

Metabolic burden of pSB3KM13

Figure 3. Metabolic burden of pSB3KM13. In the figure, E. coli TOP10 colonies bearing pSB3KM13 after different incubation times are reported. (a) After overnight incubation (16 hours), few and small colonies are visible on the plate. (b) After 24 hours the colonies were bigger and in a higher number. (c) After 36 hours, the colonies were clearly visible and could be easily isolated.

The aforementioned compromise stems from an essential consideration. The phage genome inserted into the standard helper plasmid is inherently complex, comprising multiple overlapping sequences, with promoters and terminators placed on-frame within coding sequences. This complexity, along with the lack of precise genome annotation, prevents the silencing, given current knowledge, of the naturally expressed gene 3, as doing so would risk disrupting other fundamental regulatory sequences required for the expression of the remaining genes. Consequently, the expected result is a mixed population of phages with two different tropisms: the natural one resulting from Gp3 expression and an altered one conferred by the gene placed between the prefix and suffix.

Given these premises, in constructing the expression cassettes, a strong but inducible promoter was chosen. This choice allows the metabolic burden to be limited, if necessary, while, when fully induced, enabling high expression levels of the alternative gene so that it can be preferentially incorporated into the phage particles. The selected promoter was pL-lac0-1 (part BBa_J428041) that combines the strength of the pL promoter, with the possibility to regulate it, derived from the lacO binding site. Indeed, if the cell expresses the lacI gene at high levels, it becomes feasible to induce the expression of the gene regulated by this promoter through the addition of isopropyl β-D-1-thiogalactopyranoside (IPTG) to the culture medium. Moreover, by employing varying concentrations of the inducer, it is possible to finely adjust the expression level, thus creating a controllable system that is in line with the requirement of promoting cell vitality. Similarly, the choice of the ribosome binding site (RBS) fell on a strong element to ensure that, upon transcript production, it is efficiently translated by ribosomes. This decision aligns with the aim of favoring the production and incorporation of this gene product over the gene 3 of phage M13. Hence, the RBS chosen was part BBa_B0034.

Finally, the terminator was selected based on its strength and efficiency, with a preference for an intrinsic terminator, independent from the expression of cellular proteins. Therefore, the T1 terminator of E. coli was opted (part BBa_B0010).

Synthetic biology is based not only on the standardization of parts but, as the name suggests, on the synthesis of DNA, made possible by the development of new technologies. For this reason, the sequences designed as described above were synthesized in vitro by IDT Technologies. However, before sending the sequences, it was deemed necessary to perform the codon optimization for the two gene sequences. Codon optimization refers to the modification of base triplets that encode the same amino acid, in a way that optimizes the genetic code according to the characteristic frequency of codon usage in the target organism. It is used in genetic engineering when expressing heterologous proteins in an organism to achieve ideal expression efficiency. In the case under consideration, codon optimization was performed at two levels: the first aimed to harmonize the genetic code to favor the expression of the encoded protein in E. coli, and the second to optimize the sequence for synthesis, thus avoiding the presence of repeated sequences and hairpins.

The first step was carried out using a bioinformatic tool known as CODONATOR 3000, developed by the Sydney Australia iGEM team in 2019 (Ofner et al.). This software can provide four different results, each optimizing codons based on different rules. In this case, the optimization was chosen according to the relative proportion in which the starting codon is translated into the codon of the target species (specifically E. coli) based on the most similar frequency of occurrence, considering the bias in amino acid appearance.

The second step was performed using IDT Technologies' proprietary tool, which aimed to eliminate repeated sequences and low complexity regions that could complicate the synthesis process. Using the same algorithm, a total of four forbidden restriction sites, present in the two sequences, were also deleted, to ensure compatibility with the Biobrick standard.

In the end to assemble the final cassettes all the aforementioned elements were put together in the right order, with some minor adjustments. For both the constructs the standard prefix and suffix were added at the 5’ and 3’ ends respectively. The suffix is then followed by the pL-lacO-1 sequence accompanied by the RBS. To the cassette expressing the Gp3_cut gene was added a start codon (ATG), coding for methionine and a double stop codon (TAA). Whereas to the cassette expressing the Gp3_Pf3 was added just a double stop codon (TAA). Both the coding sequences are then followed by the T1 terminator and the suffix. Thus, the two cassettes were assembled and registered as part BBa_K4727006 (G3 deleted gene) and BBa_K4727007 (Pf3 gene).

Once the cassettes were cloned into the helper plasmid pSB3KM13, the construct was transformed into the E. coli MG1655 Z1 cell line, which has the characteristic of constitutively expressing the regulatory genes LacI, AraC, and TetR at high levels. Consequently, this allows the utilization of the regulated cassettes under suitable promoters in presence of the correct inducer (in this case IPTG). Finally, to produce phage particles with altered tropism, it is sufficient to co-transform the helper plasmid here assembled with an M13 phagemid, such as pTZ19R. The phagemid, due to the presence of the encapsidation sequence, can be inserted into the assembled phage particles. Consequently, it becomes possible to generate phages carrying a desired genome with modified tropism, thereby targeting specific cellular receptors. Thus, the described modifications propose a versatile platform for facile and consistent production of phage particles carrying a desired genome, with the potential to easily alter tropism through standardized genetic components. This method allows the tropism-determining genes of the phages to become standardized parts, readily exchangeable among them.


References

  1. Kortright, K. E., Chan, B. K., Koff, J. L. & Turner, P. E. Phage Therapy: A Renewed Approach to Combat Antibiotic-Resistant Bacteria. Cell Host Microbe 25, 219–232 (2019).
  2. Liu, D.; Van Belleghem, J.D.; de Vries, C.R.; Burgener, E.; Chen, Q.; Manasherob, R.; Aronson, J.R.; Amanatullah, D.F.; Tamma, P.D.; Suh, G.A. The Safety and Toxicity of Phage Therapy: A Review of Animal and Clinical Studies. Viruses 2021, 13, 1268. https://doi.org/10.3390/v13071268
  3. Caflisch KM, Suh GA, Patel R. Biological challenges of phage therapy and proposed solutions: a literature review. Expert Rev Anti Infect Ther. 2019 Dec;17(12):1011-1041. doi: 10.1080/14787210.2019.1694905. Epub 2019 Dec 2. PMID: 31735090; PMCID: PMC6919273.
  4. Citorik, R. J., Mimee, M. & Lu, T. K. Sequence-specific antimicrobials using efficiently delivered RNA-guided nucleases. Nat. Biotechnol. 32, 1141–1145 (2014).
  5. Krom, R. J., Bhargava, P., Lobritz, M. A. & Collins, J. J. Engineered Phagemids for Nonlytic, Targeted Antibacterial Therapies. Nano Lett. 15, 4808–4813 (2015).
  6. Chasteen, L., Ayriss, J., Pavlik, P. & Bradbury, A. R. M. Eliminating helper phage from phage display. Nucleic Acids Res. 34, e145–e145 (2006).
  7. Ando, H., Lemire, S., Pires, D. P. & Lu, T. K. Engineering Modular Viral Scaffolds for Targeted Bacterial Population Editing. Cell Syst. 1, 187–196 (2015).
  8. Yehl, K. et al. Engineering Phage Host-Range and Suppressing Bacterial Resistance through Phage Tail Fiber Mutagenesis. Cell 179, 459-469.e9 (2019).
  9. Karlsson, F., Borrebaeck, C. A. K., Nilsson, N. & Malmborg-Hager, A.-C. The Mechanism of Bacterial Infection by Filamentous Phages Involves Molecular Interactions between TolA and Phage Protein 3 Domains. J. Bacteriol. 185, 2628–2634 (2003). [10] Nilsson, N., Malmborg, A.-C. & Borrebaeck, C. A. K. The Phage Infection Process: a Functional Role for the Distal Linker Region of Bacteriophage Protein 3. J. Virol. 74, 4229–4235 (2000).
  10. Holliger, P., Riechmann, L. & Williams, R. L. Crystal Structure of the Two N-terminal Domains of g3p from Filamentous Phage fd at 1.9 AÊ : Evidence for Conformational Lability. J. Mol. Biol. 288, 649–657 (1999).
  11. Lubkowski, J., Hennecke, F., Plückthun, A. & Wlodawer, A. The structural basis of phage display elucidated by the crystal structure of the N-terminal domains of g3p. Nat. Struct. Biol. 5, 140–147 (1998).
  12. Levengood, S. K., Beyer, W. F. & Webster, R. E. TolA: a membrane protein involved in colicin uptake contains an extended helical region. Proc. Natl. Acad. Sci. 88, 5939–5943 (1991).
  13. Marvin, D. Filamentous phage structure, infection and assembly. Curr. Opin. Struct. Biol. 8, 150–158 (1998).
  14. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389-3402.