Engineering

Lignans are a class of naturally occurring active components with antiviral activity and glycosylation modifications are important for the structure and bioactivity of lignans. Lignan glycosyltransferase is a component involved in the biosynthesis of lignan glycosides, and our group is committed to developing a more efficient and stable synthetic biology method to produce this glycosyltransferase.


Design

Over the past three years, influenza viruses such as the new coronavirus have seriously affected people's lives, people have taken Chinese patent drugs such as Banlangen Granules, Qingkailing Granules, Lotus Clearance Capsules, etc., to fight against viral infections. And all of these Chinese patent drugs have a common ingredient in them, Banlangen. The root is the dried root of I. indigotica, which has the effect of clearing heat and removing toxins. It is commonly used clinically for the treatment of influenza, herpes simplex virus, and respiratory syncytial virus. Its active ingredients are mainly divided into three main categories: lignans, alkaloids, and polysaccharides. The group confirmed in vitro antiviral experiments in the early stage that lariciresinol and lariciresinol monoglycoside have significant activity. In addition, academician Zhong Nanshan's team proved through clinical trials that lignans in I. indigotica, represented by lariciresinol monoglycoside and bisglycoside (gibberellic acid B), have anti-influenza A and B virus activity, and pointed out that gibberellic acid B has a stronger pharmacological activity. It suggests that the glycosidic group is a necessary structure for antiviral activity and that different forms of glycosidization lead to variability in antiviral activity.

The aim of this study was to discover the enzyme that catalyzes the glycosylation of lignans in I. indigotica, to provide key bioparts for the heterologous and efficient synthesis of the active ingredient, and to achieve the targeted synthesis and accumulation of the target active ingredient, lignan glycosides. Enzymes are a class of catalytically active proteins, which are polypeptide chains with spatial structure, and the key to exerting catalytic function is the difference in spatial structure. The key to the catalytic function is the difference in spatial structure. Current research has found that the more similar the amino acid sequences are, especially the key amino acid sequences, the more similar the spatial structure of the proteins, and the more the catalytic function is the same. Based on this principle, we can obtain the UGT gene sequence from the whole genome data of I. indigotica, and compare the amino acid sequence similarity with the known enzyme gene/amino acid sequences, and then screen to get the candidate genes. The specific work is to construct the lignan glycoside biosynthesis pathway, and the whole experiment includes gene screening and cloning, then constructing the prokaryotic expression system, and protein induction to obtain the recombinant proteins. Then using the recombinant protease to verify the in vitro activity, completing the characterization of the enzyme by determining the enzyme activity mechanistic parameter, and finally designing it into the completed biopart.

Figure 1. Schematic diagram of synthetic biology

Based on the reported transcriptomic data of I. indigotica, the sequences of genes with glycosyltransferase functions were screened and a phylogenetic tree analysis was conducted with the functionally characterized lignan UGTs, and UGT72B2 was selected as the candidate. We then constructed expression vectors using homologous recombination and ultimately verified protein function.

A phylogenetic tree analysis of the UGT sequences of I. indigotica was performed with well-characterized UGTs. UGT72B2 showed the highest homology to UGT71B5-1 (Figure 2), which has been shown to catalyze the glycosylation of lignan, and is therefore also hypothesized to catalyze the glycosylation of lignan.

Figure 2. A neighbor-joining phylogenetic tree of UGTs from I. indigotica and selected plants.

Xho I and Nco I were chosen as the cleavage sites to design primers for the construction of UGT72B2-pET-32a+ prokaryotic expression vector (Figure 3).

Figure 3. UGT72B2-pET-32a+ prokaryotic expression vector.

Results

1. Identification and clone of UGTs

PCR amplification was performed using the cDNA library of I. indigotica as the template, and the product was detected by 1% agarose gel electrophoresis, which showed a specific fragment at about 1500 bp, which was consistent with the expected result of the full length of the UGT72B2 gene (Figure 4). Sequencing of the cloning result confirmed that it was identical to the full length of the sequence obtained from the database.

Information analysis of the sequence using the open reading frame (ORF) Finder (https://www.ncbi.nlm.nih.gov/orffinder) online software revealed that the full length of the ORF sequence of the UGT72B2 gene was 1455 bp, coding for 484 amino acids, with a relative molecular mass of 53,269.86, a relative molecular mass of 53269.86, a theoretical isoelectric point of 5.90, and a total average protein hydrophobicity of -0.074, making it an unstable protein.

Table 1. Primers in construction of UGT72B2-pET-32a+ expression vector.

Figure 4. PCR amplification of UGT72B2 gene from I. indigotica. M: marker, 1-4: UGT72B2 gene PCR amplification band.

2. Expression and purification of UGT72B2

The recombinant expression vector was transformed into E. coli Rosetta (DE3) competent recipient cells after sequencing correctly and used as the subsequent protein expression. pET-32a+ vector was transformed into Rosetta (DE3) recipient cells by the same treatment as the negative control.

The recombinant protein expression was detected by SDS-PAGE after induction by IPTG. As shown in the Figure 4, the strain containing the recombinant plasmid UGT72B2-pET-32a+ (the relative molecular mass of the target protein of UGT72B2 is about 53 kDa) has a protein band near 70 kDa; whereas, the control strain with pET-32a+ empty vector has no protein expression at the same position and only expresses His-Tag tagged protein near 18 kDa, and the results indicate that UGT72B2 was successfully expressed in E. coli Rosetta (DE3).

Western blotting was further utilized to detect whether the resulting protein was the target protein. The results are shown in Figure 4. Compared with the empty vector control, UGT72B2-pET-32a+ expression signal was detected with a single band and good purity.

The His-tagged fusion proteins were purified by Ni-NTA affinity chromatography. The concentration of purified UGT72B2-pET-32a+ fusion protein was determined by BCA quasi-curve method, and finally the UGT72B2 recombinant protein at a mass concentration of 3.60 mg/mL was obtained to be used as the subsequent enzyme activity experiments.

Figure 5. Expression and purification of UGT72B2. (A)The correct clones were selected for culture. (B) Ultrasonic crushing of bacteria. (C) SDS-PAGE electrophoresis. (D) Proteins transferred from the gel onto a PVDF membrane for antibody staining and detection. (E) SDS-PAGE and western blot analysis of purified proteins.

3. Substrate promiscuity of UGT72B2

(+)-Pinoresinol, (+)-lariciresinol, secoisolariciresinol, and matairesinol were selected as potential substrates for the catalytic analysis of UGT72B2. The reaction mixture was in a total volume of 50 μL containing phosphate buffer saline (pH 7.4), 2 mM UDP-glycoside, 200 μM substrates, and 10 μg of purified proteins. The mixture was preincubated at 30 °C for 10 min without proteins and then incubated for 12 h at 30 °C with a supplement of proteins. The enzyme activity products were characterized using UPLC-Q-TOF/MS.

Figure 6. UPLC-Q-TOF/MS for productions of UGT72B2 with lignans as substrates. 1: (+)-lariciresinol, 2: (+)-pinoresinol, 3: secoisolariciresinol, 4: matairesinol, 5 and 6: lariciresinol-4-O-β-D-glucopyranoside and lariciresinol-4'-O-β-D-glucopyranoside, 7: pinoresinol-4-O-β-glucopyranoside, 8: secoisolariciresinol-O-β-D-glucopyranoside and 9: matairesinol-O-β-D-glucopyranoside.

Table 2. Parameters in MS spectrum of compounds in Figure 5.

According to previous studies, UGTs have a broad range of promiscuity towards the substrates of sugar acceptors. In our research, we found that with UDP-glycoside as the sugar donor, UGT72B2 could catalyze all tested substrates including (+)-pinoresinol, (+)-lariciresinol, secoisolariciresinol, and matairesinol. So UGT72B2 is a catalytic enzyme with relatively heterogeneous substrates. However, the catalyzed products were only monoglycosides, and the formation of diglycosides was not detected. Taken together, UGT2B2 catalyzes the monoglycosylation of lignans with substrate promiscuity.

4. Enzyme Kinetic Parameters of UGT72B2

Considering that lariciresinol has much more pharmaceutical properties than others, we focused on the kinetics of UGT72B2 with lariciresinol as the substrate. The reaction mixture was the same as mentioned above, but the reaction time was shortened to 20 minutes. The enzyme activity products were characterized using LC-MS/MS.

The kinetic parameters were analyzed through the Lineweaver-Burk plot. As a result, the catalytic efficiency of UGT72B2 for pinoresinol was as follow: Km 189 µM, Vmax 0.1079 µM/min, [E] 3.75 µM, Kcat 0.028min-1.

Figure 7. The double reciprocal equations of substrate concentration [S] and the initial rate of enzyme catalysis [v].

5. Future work

We have completed the set goal of the laboratory in the summer, but whether this UGT we produced can be applied to the production of the factory still needs further experiments. Our next goal is to compare the conversion rates of UGT72B2 to other UGTs (UGT71B2, UGT71B5, UGT4) for glycosylation of different lignans, select the UGT with the highest conversion rate as the target UGT, and construct the heterologous synthesis from coniferyl alcohol to lariciresinol. Further characterization of new UGTs and finding more specific UGTs with better catalytic efficiency is also very necessary for this project.

Figure 8. Equipment for further experiments: bioreactor (pilot test)

Conclusion

In this study, we cloned and obtained the UGT72B2 gene, and for the first time, we comprehensively examined and validated the catalytic features of this gene. The constructed part provides a basis for realizing the heterologous and efficient synthesis of lignan or lignan-derivate glycoside components.