Model

Goal

The goal of our modelling was to optimize the yield of a target product in our Komagataella pastoris (formerly Pichia Pastoris) strains through knockouts or gene overexpression.

Methods

Model

We used the model iMT1026v3, which is an updated version of the iMT1026 model. The model's accuracy was improved by updating biomass equations and validated with flux balance analysis (FBA). The updated model has improved predictive capabilities over a wider range of substrates, providing valuable insights for bioprocess engineering in Komagataella pastoris. (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5743807/)

Target

The primary target we selected was the increase in production of Acetyl CoA. There were multiple reasons for this selection.

  1. Acetyl CoA is rate limiting in sterol production. (https://www.nature.com/articles/cr200861)
  2. Acetyl CoA is key engineering target for a wide range of strain design problems meaning that our results can be more easily compared to and contextualized within an existing body of literature.
  3. Acetyl CoA is needed for the Production of all three of our Products in the three strains we designed meaning that our Yield optimization strategy may be effective for all designed strains without the need for individual optimization.
  4. Our results will be generalizable to other strain design problems in Komagataella pastoris that respond to an increased availability of Acetyl CoA.

Knockouts-Strategies

OptKnock

OptKnock is designed to identify gene knockouts within genome scale models that will lead to overproduction of a desired product, while still allowing for fast growth. Optknock is also designed to facilitate growth coupling. Growth coupling is when the production of the desired compound is tightly coupled to the growth of the microorganism. (https://onlinelibrary.wiley.com/doi/abs/10.1002/bit.10803)

OptGene

OptGene operates similarly to OptKnock but builds on the concept by using Evolutionary Algorithms and Simulated Annealing to optimize sets of gene deletion for the desired objective function. Simulated annealing is a probabilistic optimization strategy that iteratively explores the solution space by accepting worse solutions with a decreasing probability, controlled by a temperature parameter. Evolutionary algorithms are optimization techniques inspired by natural selection. They maintain a population of candidate solutions, iteratively applying genetic operators like selection, crossover, and mutation to evolve and improve solutions over generations. Both techniques eventually converge towards an optimal or near-optimal solution. The strains are then simulated using FBA or Minimization of Metabolic Adjustment (MOMA). (https://core.ac.uk/download/pdf/55609541.pdf) In our case MOMA was used.

Overexpression-Strategies

flux scanning based on enforced objective flux (FSEOF)

In FSEOF you first enforce a flux directed toward your target product and then search for variations in the metabolic fluxes caused by this change. Fluxes that increase because of this change are assumed to be possible overexpression targets. The flux through your target is increased iteratively so that you can determine the relationships between the fluxes. (https://www.researchgate.net/publication/230713107_Flux_variability_scanning_based_on_enforced_objective_flux_for_identifying_gene_amplification_targets)

The Cameo package was used to perform all three of these metabolic modelling techniques. (https://github.com/biosustain/cameo)

Results

OptKnock

Target: Acetyl CoA c

We ran the OptKnock algorithm on the iMT1026v3 model with Acetyl CoA cytosolic and Acetyl CoA mitochondrial.

  • 1 KO: no Results
  • 2 KO: Dihydroneopterin triphosphate pyrophosphatase + succinate dehydrogenase Dihydroneopterin monophosphate dephosphorylase + succinate dehydrogenase Alternative oxidase + glutamate dehydrogenase (NADP)
  • 3 KO: no Results
  • 4 KO: Acetyl-Coa carboxylase, mitochondrial + Alternative oxidase + methenyltetrahydrifikate cyclohydrolase + Sedoheptulose 1,7-bisphosphate D-glyceraldehyde-3-phosphate-lyase Acetyl-Coa carboxylase, mitochondrial + Alternative oxidase + methenyltetrahydrifikate cyclohydrolase + phosphofructokinase malate dehydrogenase + glutathione oxidoreductase + Alternative oxidase + catalase
  • 5 KO: no Results
  • 10 KO: no Results

Target: AcetylCoA m

  • 1 KO: ergosterol ester hydrolase, (Not Useful for our goals)
  • 2 KO: pyruvate decarboxylase + fructose-bisphosphate aldolase pyruvate decarboxylase + ALG2 (glycan reaction) pyruvate decarboxylase + glycan reaction (several combinations) pyruvate decarboxylase + Dolichyl-diphosphate phosphohydrolase
  • 3 KO: Glycerol dehydrogenase (NADP-dependent) + Chorismate pyruvate lyase pyruvate carboxylase + Malonyl-CoA-ACP transacylase, mitochondrial pyruvate decarboxylase + yUMP synthetase transaldolase + glutamate dehydrogenase
  • 4 KO: NADH dehydrogenase, cytosolic/mitochondrial + Sedoheptulose 1,7-bisphosphate D-glyceraldehyde-3-phosphate-lyase + glutamate dehydrogenase (NADP) + D-lactate dehydrogenase NADH dehydrogenase, cytosolic/mitochondrial + + glutamate dehydrogenase (NADP) + D-lactate dehydrogenase + phosphofructokinase
  • 5 KO: no Results
  • 10 KO: no Results

OptGene

We ran the OptGene algorithm on the iMT1026v3 model with Acetyl CoA cytosolic and Acetyl CoA mitochondrial.

No soloutions found for both mitochondrial and cytosolic Acetyl-CoA

Flux scanning based on enforced objective flux (FSEOF)

Here we used the iMT1026v3 model to perform FSEOF with mitochondrial Acetyl CoA as a target.

Target: AcetylCoA m Flux begining Flux end Change
H2Ot H2O transport via diffusion PAS_chr3_0763 -278.174 -356.926 0.787521
GLUDxi glutamate dehydrogenase (NAD) PAS_chr2-1_0311 0.507641 0.055312 0.452329
PDHcm pyruvate dehydrogenase (dihydrolipoamide dehydrogenase) PAS_chr2-1_0089 or PAS_chr2-2_0048 or PAS_chr1-3_0094 0.605326 0.329448 0.275878
FRDO frdo PAS_chr3_0058 -0.03704 -0.30093 0.263891
ICDOXSUCm Isocitrate:NADP+ oxidoreductase PAS_chr2-1_0120 and PAS_chr4_0580 0.283293 0.063642 0.219651
OSUCCCLm oxalosuccinate carboxy-lyase (2-oxoglutarate-forming) PAS_chr2-1_0120 and PAS_chr4_0580 0.283293 0.063642 0.219651
SUCD1m succinate dehydrogenase PAS_chr3_0225 0.230071 0.057793 0.172278
SUCD3_u6m succinate dehydrogenase (ubiquinone-6), mitochondrial PAS_chr3_1111 and PAS_chr2-2_0283 and PAS_chr4_0733 and PAS_chr1-4_0487 and PAS_chr3_0424 0.230071 0.057793 0.172278
AKGDH1 2-Oxogluterate dehydrogenase PAS_chr2-1_0089 0.22385 0.05711 0.16674
AKGDH2 2-Oxogluterate dehydrogenase PAS_chr2-1_0089 0.22385 0.05711 0.22385 0.05711 0.16674
10FTHFtm 10-Formyltetrahydrofolate mitochondrial transport via diffusion -0.03699 -0.20197 0.164986
GLUDyi glutamate dehydrogenase (NADP) PAS_chr1-1_0107 0.836535 0.685176 0.151359
Ht H+ diffusion -0.41045 -0.53987 0.129425
PGK phosphoglycerate kinase PAS_chr1-4_0292 -0.70111 -0.80223 0.101124
GAPD glyceraldehyde-3-phosphate dehydrogenase PAS_chr2-1_0437 0.701107 0.802231 -0.101124
GLNS glutamine synthetase PAS_chr4_0785 0.088692 0.207654 -0.118962
FUM fumarase PAS_chr3_0647 0.038788 0.20217 0.038788 0.20217 -0.163382
MTHFCm methenyltetrahydrifikate cyclohydrolase, mitochondrial PAS_chr1-4_0632 0.036985 0.201971 -0.164986
MTHFDm methylenetetrahydrofolate dehydrogenase (NADP), mitochondrial PAS_chr1-4_0632 0.036985 0.201971 -0.164986
THFtm 5,6,7,8-Tetrahydrofolate transport, diffusion, mitochondrial  0.036985 0.201971 0.036985 0.201971 -0.164986
PPA inorganic diphosphatase PAS_chr1-3_0028 or PAS_chr1-3_0070 or PAS_chr2-1_0101 0.204803 0.41832 -0.213517
PGCD phosphoglycerate dehydrogenase PAS_chr2-1_0657 0.06652 0.304171 -0.237651
PSERT phosphoserine transaminase PAS_chr3_0566 0.06652 0.304171 -0.237651
PSP_L phosphoserine phosphatase (L-serine) PAS_chr4_0285 0.06652 0.304171 -0.237651
PIt2r phosphate reversible transport via symport PAS_chr3_0495 or PAS_chr4_0337 or PAS_chr2-1_0235 0.043446 0.301635 -0.258189
ADK1 adenylate kinase PAS_chr3_0257 0.105745 0.407435 -0.30169
NH4t ammonia reversible transport PAS_chr1-4_0394 or PAS_chr2-2_0391 0.401186 0.736762 -0.335576

Conclusion

We have successfully identified potential target genes for both overexpression and knockout. It could potentially be beneficial to do modelling on a similar organism like Saccharomyces cerevisiae that has newer and more refined models and compare these results to the results we generated.

More experimental work is necessary to validate our results and further optimize the yield.