Overview

The main objective of the proof of concept is to demonstrate whether or not our project is feasible. We synthetically designed short sequences (mimicking the viral mRNA) that we attempt to detect with our system based on Cas13. To achieve this purpose, numerous steps were required, which are presented below.

NCBI Search : Sequence Retrieval and Evaluation

The focus of our project is the West Nile Virus (WNV). The sequences were retrieved from the viral genome section of the National Center for Biotechnology Information (NCBI).

On those sequences, a BLAST (Basic Local Alignment Search Tool) was performed, with a maximum target sequence limit at 5 000 (the maximum possible). By fine tuning the parameters “Percent Identity” and “Query Coverage" with values respectively between 85-100 and 95-100, we were able to exclude most of the sequences non-related to WNV.

Partial genome and coding sequences were also excluded from the analysis, leading to a dataset of 223 sequences. Those were then downloaded as fasta files.

Multiple Sequence Alignment

The fasta sequences above were subjected to multiple sequence alignment, using the MAFFT tool. The resulting alignment was saved in the Clustal format. In order to visualize the alignment and color the sequences with 100% identity, the Jalview website was used.

The sequences of interest were selected based on a length criterion. The minimum of 25 nucleotides is required, although 28 nucleotides are recommended (28+ nucleotides were used for our project).

The following target sequences were obtained for the WNV, derived from the 223 sequences:

**Fig 1.1 :** First target sequence of the WNV (34 nt): CTCTCGGATTCCTCAATGAAGACCACTGGCTGGG

**Fig 1.2 :** Second target sequence of the WNV (29 nt): AAGACACCAGAAGGAGAAATTGGGGCAGT

First conserved sequence : Code for a catalytic core domain of Flavivirus RdRp (RNA dependent RNA polymerase).
It make sense that it is relatively conserved as RdRp is responsible of RNA replication of viruses.

Second conserved sequence : Code for a peptidase S7 : NS3 serine protease. Flavivirus like WNV virus encodes
only one polyprotein precursor. This precursor will be able to form mature protein under the action of the complex
NS2B-NS3. Therefore it make sense that this particular sequence is conserved among variants of WNV as it is resposible of proteins production.= https://www.ebi.ac.uk/interpro/entry/InterPro/IPR001850/

DNA Sequences Amplification

Once we identify the sequences to target, the aim was to synthetise them as well as guide RNA, we used Integrated DNA Technologies (IDT) for this. You may ask yourself why use DNA if we are studying RNA viruses. Well, the main reasons are :

DNA synthesis is cheaper

DNA is more stable than RNA

Easier Amplification (PCR vs RT-PCR)

Afterward, we employed PCR (Polymerase Chain Reaction) to increase yield of our target, and guide sequences. Different forward and reverse sets of primers were used for this purpose (see tables below) :

Target :

The forward primer starts with the enhancer, followed by the T7 promoter, an upstream sequence (-20 nucleotides
before the target), 28 nucleotides of the actual target sequence, and finally a downstream region of the target site
of 20 nucleotides. Those 20 nucleotides upsteam and downstream are used to keep the general spacial context of

The reverse primer is composed of the reverse complement of the +20 nucleotides sequence and the reverse
complement of the 28 nucleotides to target.

You will find below the target DNA sequence after amplification

Guide :

The synthesis of primers for the guides use the same principle as the target sequences. The difference is that the guide need to be able to hybridize with the 28 nucleotide sequence site of the target, also it contain a scaffold that is specific to the type of Cas used: CasRX or LwCas13a (see Fig 3).

**Fig 3:** Scheme for the synthesis of LwCas13a and CasRX guides primers

**Tab 1.3:** Table of the LwCas13a guide sequences

Below is repertoried, the tables containing the sequences of our LwCas13a and CasRX guides.

**Tab 1.4:** Table of the CasRX guide sequences

Transcription

The sequences were transcribed through T7 transcription. It is a method used for rapidly making a lot of RNA copies using only a single piece of DNA. Accurate and efficient, it possesses the downside of being able to transcribe one sequence at a time, while other techniques are able to do multiple one simultaneously.

The quality of the RNA was assessed using a nanodrop spectrophotometer. Another efficient way to measure RNA concentration, that was not available to us, is to use the Qubit technology. Considering the nanodrop as a tool to get a quick estimation of the concentration, in this way the Qubit is a tool giving an exact measurement. It’s a choice between the speed and the accuracy that must be done when measuring RNA.

Cas Activity Assessment

Verification of the LwCas13a enzyme functionality was made through a fluorescence detection assay, performed on a Cytation plate reader. Gel electrophoresis was deemed unsuitable, according to the low cutting efficiency, as the band cleavage would not have been observable on a gel. The results of the fluorescence detection are presented in the Results section.

Further informations are available in the:

Engineering section for the designing of the sequences and experiments

Experiments section for the protocols and details about the experiments

Notebook section for the access to the lab notebook

Results section for the results and discussion

Perspective section for information about what can be done in future experiments and how the project can be applied outside of the lab

Validation

The results of these experiments serve as evidence of proof of concept of our project. This foundational work can lead to further investigations, including applications in mosquito and diseases control.

During this project, many results were achieved, including making chemically competent cells, transforming a plasmid in the so-said cells, making de novo sequences through primer amplification, with sequences that are able to proceed a transcription step and even work with enzymes to make fluorescent detections. More details can be found below, including the steps, the results and the discussion of everything that was completed.