In the face of tackling the blood shortage crisis, we were able to contribute in the following ways. Via events and social media, we educated people on the problems of blood shortage the world is currently facing. We presented not just solutions to help solve this problem on a big scale, such as converting blood types to the universal O-antigen blood, but also showed how every single person can contribute to solving this shortage. We facilitated a bioinformatics-based workflow to screen for and assist in designing novel enzymes that in theory can specifically modify ligands, such as saccharides, on antigens or other proteins. We demonstrated this by designing candidate enzymes that specifically convert a galactose/ fucose residue to a keto-galactose residue on the type B blood antigen using publicly available software and scripts. This workflow has the potential to support researchers in their design of substrate modifying enzymes for pharmaceutical purposes. However, we feel obliged to note that this workflow currently lacks the proper wetlab-based proof of concept. To prove or dispute this workflow, we challenge ourselves as well as other scientists to make use of this toolchain and computational foundation we have provided and report on its accuracy and usefulness.
The GOX mechanism suggests that the substrate and active site copper share a covalent bond as intermediate, suggesting a minimal distance to the copper ion as suitable parameter for predicting if an enzyme will be active on a given docked ligand. This mechanism also suggests that it is possible to catalyze any sugar hydroxy group given, that it is capable of forming a copper covalent bond.
In order to analyze if molecular docking is a meaningful model for our analyses of finding enzymes that could potentially act on specific ligands, we tested docking the known substrate galactose to an experimentally solved structure of GOX, as well as glucose, which was known not to be a substrate of GOX. This docking was performed using Glide (as part of maestro).
Docking of galactose to GOX showed that Glide docked the ligand inside the active site, therefore suggesting that docking potentially is a biologically meaningful model.
Docking of glucose to GOX showed that Glide docked the ligand inside the active site, but with a worse fit compared to galactose, approximately 0.4 Ångström further away from the copper ion, suggesting fit into the active site to be worse for glucose.
We also investigated if modelling ligand binding strength was a suitable model for prediction of ligand activity for GOX using molecular mechanics with generalised Born and surface area solvation (MMGBSA) using Maestro.
Using MMGBSA simulations of glucose and galactose we were unable to show a difference in binding energy of glucose and galactose to GOX, although glucose has a very high variance of binding energy between docked poses (10 Simulations per ligand, ligands were first docked using Glide and docked poses were then scored on binding energy using MMGBSA).
As GOX is effectively inactive with glucose as the substrate, but turns over Galactose [1], this is suggesting that molecular mechanics predicting ligand binding strength is either not a biologically meaningful model, or binding strength is less relevant to predicting ligand activity than active site fit, and that instead active site fit and distance to the copper ion is a more meaningful metric to judge likelihood of a ligand being able to be a substrate of an AA5 family enzyme.
In order to analyze the structural, as well as the presumed functional diversity of our enzyme database, which would allow us to search the structural space easier by excluding similar enzymes, we integrated data from structural alignments of generated alphafold structures with a phylogenetic tree (phyml), as well as data on conserved domains found in the enzyme sequences (CDD/SPARCLE).
The phylogenetic tree of AA5 shows clades. These clades are comprised of enzymes with different conserved domains and different active site geometries of the AlphaFold structures of it s members, of which derived groups match up with observed clades in the phylogenetic tree. This suggests difference in function, which is supported by the differing conserved domains, which are most likely due to differing functional interactions, roles or substrates of the enzymes, as well as differing enzyme active site geometry, all of which in this case suggests a difference in ligands that can potentially be catalyzed by the enzyme. Enzymes A0A0A1TMB5 (TMB5) and A0A178E5X4 (E5X4), belonging to the clade1 (dark green), have docking results of the tetrasaccharide model of Antigen B with < 3 Å distance (OH 3 for TMB5) to the copper ion, which, in theory means that a covalent bond might be possible.
TMB5 has a low distance of the second (2.91 Å) and third (2.89 Å) hydroxy group of the terminal galactose to the active site and we wanted to see if there were any activity, how selective the activity was for each of these groups.
E5X4 has a small distance (2.69 Å) of the fith hydroxy group of the fucose to the copper ion, which is also an interesting modification that we wanted to explore along TMB5, for potential other modifications of fucose containing blood antigens, for example glycan tagging.
In our experimentes we aimed to assemble our final plasmid via gibbson assembly, transform the plasmid into E. coli, induce and purify the protein for use in a peroxidase assay, in which we wanted to test the amount of peroxide production on different substrates. Peroxide is a sideproduct of the reaction these enzymes catalyze and can therefore be used to test for activity of the enzyme.
The gene fragments were synthesized by a commercial supplier (twist bioscience and IDT). Gene fragments were first amplified by PCR using taq polymerase. The Plasmid, which we used for our cloning was pXZ11, which was a gift by Dr. Xiaobo Zhong. This plasmid was also amplified by PCR using Q5 polymerase to intoduce Gibbson overhangs and linearized the plasmid for gibbson assembly.
The Plasmids were isolated via miniprep and sequenced using sanger sequencing.
The target protein has a molecular weight of 52,8 kDa and should be visible in the area marked yellow in the picture. From the picture it is visible that a protein with the right molecular mass was present until the eluation, but disappears here. Furthermore there are too many proteins visible in the eluation, leading to the conclusion that the purification failed. There is no protein present in the other fractions. It is unknown why the purification failed and due to time constraints not possible to repeat the experiment, leading to an inconclusive result. the next step would be to troubleshoot the protein purification and attempt it again.
[1] ChemBioChem2002 WILEY-VCH-Verlag GmbH 1439-4227/02/03/08 Modification of Galactose Oxidase to Introduce Glucose 6-Oxidase Activity Lianhong Sun,Thomas Bulter,Miguel Alcalde,Ioanna P. Petrounia and Frances H. Arnold