How to infer metabolite abundace
What we eat consists us. The chocolate I ate breaks down into carbohydrates when passing through digestive organs, and cell absorb carbohydrates to make energy or to make building blocks for the body. In this sense, metabolism is the way of organisms interacting with the environment.
Thus, modeling metabolism would improve our understanding on metabolism. Here I will give you brief & easy introduction of one of the modeling method: Constraint-based model.
Why should we model metabolism with constraint-based model?
[Objective] To understand interactions with biological and environmental components
- Previous approaches & drawbacks
- Mathematical modeling : Lack of dynamics
- Dynamic modeling data : a type of mathematical modeling : Lack of experimental data (eg. kinetic parameters, total enzyme concentrations) required for modeling
- Constraint-based modeling : Compensate the defects of previous models
- Only stoichiometric matrix is required (easily retrieved from metabolic network)
What is constraint-based model?

For decades, metabolic network has been constructed from biological databases, literatures, and experiments. This shows how metabolites interact with each other and the ratio between the amounts of metabolites involved in reactions. From the ratio, stoichiometric matrix can be built. It’s just mathematic representation of the network.
To model metabolic network means to figure out the distribution of reactions involved in the network. For example, if we find out that R1 is twice faster than R2 and R3 is consistent, the relationship between the reactions is clarified. The distribution (rate) of reactions is called flux distribution (flux vector). Our goal is to find out the range of flux distribution. From stoichiometric matrix, there are infinite number of combinations of reaction rates that can be represented as N-dimensional space (if there are N reactions in the network).
Fortunately, there are constraints to reduce the solution space. Those constraints are from experiments, from network topology, or from assumption(eg. steady state assumption, mass balance assumption) in chemistry/physics. Following figure shows how solution space is restricted to give more precise solution space.

However, the number of solutions is still infinite after the restriction of solution space. To retrieve one or few solutions, one more assumptions is adopted: “A cell is striving to achieve a metabolic objective.2” Objective function is defined as the linear combination of flux variables. Each reaction has its own meaning (eg.Uptake rates, gene knockout). Assume that reaction R1 refers to “metabolite uptake”. If the objective of the cell is to maximize the uptake rates, the cell is going to adjust its metabolism to maximize the rate of R1. Here, objective function is the rate of R1, which can be restated as linear combination of flux variables of all the reactions where all other coefficients of reactions except for R1 are set to 0. Using this method, it is possible to get optimal solutions that maximizes a objective function among the infinite number of solutions in solution space.
Reference
- OD kim. et al. A Review of Dynamic Modeling Approaches and Their Application in Computational Strain Optimization for Metabolic Engineering. Front Microbiol. 9. 1690. (2018)
- A Bordbar. et al. Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet. 15, 107-120. (2014)