M.S. defense next Monday

Graduate Defenses
Graduate Defenses

M.S. Defense: Achilles Gasper Rasquinha
Monday, November 14, 2022
10:30 AM
Beadle N177 and Zoom
Join Zoom Meeting: https://unl.zoom.us/j/91745517509

"A Deep Learning Pipeline to Generate Genome-Scale Metabolic Emulators"

Genome-Scale Metabolic Models (GEMMs) are powerful reconstructions of biological systems that help metabolic engineers to understand and predict growth conditions subjected to various environmental factors around the cellular metabolism of an organism in observation, purely in silico. Applications of metabolic engineering range from perturbation analysis and drug-target discovery to predicting growth rates of biotechnologically important metabolites and reaction objectives within different single-cell and multi-cellular organism types. GEMMs use mathematical frameworks for quantitative estimations of various flux distributions within metabolic network pathways. The behavioral reasons behind why an organism activates, stuns, or fluctuates between alternative pathways for growth and survival, however, remain relatively unknown. GEMMs rely heavily on manual intervention during their curation and annotation process which can potentially induce substantial experimental bias. Also, solution spaces that cater to the flux distributions are extremely sensitive to the addition, updation, and deletions of metabolites and reactions as well as gene-enzyme-reaction rules within the model. Therefore, the quest for optimality can be often lost due to the number of hyper dimensions represented by these networks. Recently, Deep Learning (DL) has played a significant role in building function approximators for highly complex input datasets correlating in extremely large hyper dimensions. In this thesis, we use an interpretable learning-driven approach towards building surrogate GEMM models that act as alternatives to existent Flux Balance Analysis (FBA)-based approaches for predicting intracellular fluxes of reactions. We exploit the network characteristics of a well-curated input organism and build a synthetic subset of the flux cone containing thermodynamically feasible reaction growth rates. We then feed this dataset into a deep generative model capable of reconstructing intracellular flux values of the input organism. We evaluate its efficiency based on time-to-construct, accuracy, and ease of use. In order to provide a fair comparative analysis, we explore our learning approach with other traditional regression-based models and test our pipeline on 3 different input organisms subjected to network reduction techniques and different hyperparameters.

M.S. Committee:
Dr. Massimiliano Pierobon, Advisor
Dr. Thomas Helikar, Co-Advisor
Dr, Juan Cui
Dr. Ashok Samal