Modeling

Machine Learning to study HRT Access

This page uses neural networks trained on the SWAN dataset to provide predictions on Hormone Replacement Therapy (HRT) accessibility for menopausal women based on various socioeconomic and health factors. By inputting details such as race, menopause status, age, income, and frequency of hot flashes, one can receive personalized predictions on their likelihood of accessing different HRT treatments.

SVG Image

Enter details to see HRT accessibility prediction:


Race:

Menopause Status:

Age:

Language:

Insurance:

Affordibility:

Income:

Hot Flashes per 2 weeks:

Therapy Access Likelihood Predictions:


Outputs:

Estrogen 110

Estrogen Injection 110

Estrogen Progesterone Combination 110

Other Hormone Treatment


Key:

Estr110 represents Estrogen pills (such as Premarin, Estrace, Ogen, etc).

EstrInjec110 is Estrogen treatment by injection or patch (such as Estraderm).

EstrProgComb110 is Combination estrogen/progestin treatment (such as Premphase or Prempro).

OtherHormone1 represents miscelleneous other menopause treatments.


Flux Balance Analysis


In addition to the prior model, our dry lab team also implemented a computational project utilizing flux balance analysis to explore the metabolic model of our recombinant E. Coli to optimize for the production of Daidzein and Equol by running knockout analyses to channel flux towards our target pathway.


We encourage you to explore the github repository, download the code and play around with our FBA.py script on your own models! The repository can be found here:

FBA Github Repo

This was a research project focused on optimizing the biosynthetic pathway from p-coumaric acid to daidzein. The endeavor utilized Flux Balance Analysis (FBA) models and CobraPy solver, illustrating the journey from initial obstacles through adaptive problem-solving, to significant discoveries that informed subsequent work.


Initial Hurdles and Strategic Shift

The project's commencement was marked by a series of complexities while employing the traditional CobraPy solver. The occurrence of inconsistent values in knockout analysis and confounding results from identical analyses led to a decision to transition to the Gurobi solver. Through Linux terminal commands, the Gurobi solver was integrated into the existing setup, significantly improving the stability and consistency of the analysis results based on control tests.


Flux Variability Analysis and Model Verification

The extensive time and computing power requirements of a full genome knockout analysis led to the application of flux variability analysis (FVA). This shift allowed the identification and concentration on relevant genes impacting the biosynthetic pathway. To ensure the credibility of this model and chosen method of analysis, sanity checks were performed. Key genes directly involved in the p-coumaric acid to daidzein pathway were knocked out, and the anticipated changes in flux for the targeted enzymes confirmed the model was working.


Identifying Shortcomings and Changing Focus

Despite the improvements made, limitations surfaced while attempting gene knockout analyses on the adjusted model. The flux of target enzymes remained unaffected even when relevant genes were knocked out. This observation indicated potential inadequacies in the baseline model or perhaps a compensating multi-tiered system that prevented major changes form the loss of a single gene to the target pathway. Thus, since the only way forward that I could think of was dynamic FBA, wet lab experimentation of enzyme fluxes for critical data was deemed an unwise use of their limited time and the FBA project was put aside. The FBA and gene knockout analysis did however produce promising results when the target was set to other pathways within E Coli, indicating potential utilization as an IGEM software development. The identified challenges motivated a pivot from the FBA tools towards enhancing the Wiki and developing predictive machine learning models to support IHP components of the project.


References


We utilized the Study of Womens Health Across the Nation dataset to train this model. We wanted to utilize this model for the purposes of showcasing barriers to access and to contextualize the work of Yale iGEM in the landscape of difficult-to-access treatments.

Swan study data access - women’s health across the nation. SWAN. (2023, April 12). https://www.swanstudy.org/swan-research/data-access/