Difference between revisions of "Team:KU Leuven/Modeling/Internal"
Line 633: | Line 633: | ||
</div> | </div> | ||
</div> | </div> | ||
− | + | </div> | |
<br> | <br> | ||
<br> | <br> |
Revision as of 22:34, 15 September 2015
Internal Model
1. Introduction
We can think of many relevant questions in implementing a new circuit: how sensitive is the system, how much will it produce and will it affect the growth? As such, it is important to model the effect of the new circuits on the bacteria. This will be done in the Internal Model. We will use two approaches. First we will use a bottom-up approach. This involves building a detailed kinetic model with rate laws. We will use Simbiology and ODE's to study the sensitivity and dynamic processes inside the cell. This is the bottom-up approach. Afterwards, a top-down model, Flux Balance Analysis (FBA), will be used to study the steady-state values for production flux and growth rate. This part is executed by the iGEM Team of Toulouse as part of a collaboration and can be found here
2. Simbiology and ODE
In the next section we will describe our Simbiology model. Simbiology allows us to calculate systems of ODE's and to visualize the system in a diagram. It also has options to make scans for different parameters, which allows us to study the effect of the specified parameter. We will focus on the production of leucine, Ag43 and AHL in cell A and the changing behavior of cell B due to changing AHL concentration. In this perspective, we will make two models in Simbiology: one for cell A and cell B. First we will describe how we made the model and searched for the parameters. Afterwards we check the robustness of the model with a parameter analysis and we do scans to check for the effects of molecular noise.
3. Quest for parameters
We can divide the different processes that are being executed in the cells in 7 classes: transcription, translation, DNA binding, complexation and dimerization, protein production kinetics, degradation and diffusion. We went on to search the necessary parameters and descriptions for each of these categories. To start making our model we have to chose a unit. We choose to use molecules as unit because many constants are expressed in this unit and it allows us to drop the dillution terms connected to cell growth. We will also work with a deterministic model instead of a stochastic model. A stochastic model will show us the molecular noise, but we will check this with parameter scans.
The next step is to make some assumptions:
- The effects of cell division can be neglected
- The substrate pool can not be depleted and the concentration (or amount of molecules) of substrate in the cell is constant
- The exterior of the cell contains no leucine at t=0 and is perfectly mixed
- Diffusion happens independent of cell movement and has a constant rate
3.1 Transcription
Transcription is the first step of gene expression. It involves the binding of RNA polymerase to the promoter region and the formation of mRNA. The transcription rate is dependent on a number of conditions like the promoter strength and the used strain. Our system has both constitutive promoters, which are always active, and inducable promoters, which can be activated or repressed.
Maximum Transcription rate
First we will try to find the maximum transcription rate. The prediction of the transcription rate has been an important hold-back in the past, even though Polymerases per Second was introduced as a unit. This is the amount of Polymerases that passes through a given position in the DNA per time unit and is essentially the transcription rate at a particular location on the DNA (open wetware http://openwetware.org/wiki/PoPS). We can use this value as the transciption rate of the whole gene because it is the value of the slowest and rate-defining step, the binding of polymerase on the promoter. The other step being the movement of polymerase over the gene. It remained difficult to measure in vivo but Kelly et al have introduced a way to measure the activity of promoters using an in vivo standard, promoter J23101. This gave rise to a new unit: Relative activity of promoter (RPU)$=\frac{PoPS_{phi}}{PoPS_{J23101}}$. By using relative units, the variability due to equipment and conditions was drastically decreased. The PoPS of J23101 was found to be 0.03.
As our constitutive promoter, we used J23114. On the iGEM website we found that the J constitutive promoter family had been characterized using RFP and we found that J23101 had a strength of 1791 au and J23114 had a strength of 256 au with J23112 as standard in this measurement. We conclude that J23114 has a PoPS of 0.00429. The iGEM team of UNIPV-Pavia 2010 also measured the activity of J23114 in E. coli in LB with a high copy number. The RPU is equal to around 0.05. This means that J23114 has a PoPS of 0.0015. The two values are of the same magnitude but we take the smaller one, since we can not find the references of the first measurement and to play on safe.
The strength of the pLuxPR promoter has been measured relatively to pLacIQ by iGEM Tokyo_Tech 2010. They found a value for pLuxPR of 550 and for pLacIQ of 390. pLacIQ has also been measured by iGEM Upssala university 2012 and was found to have a RPU of 0.04 giving it a PoPS of 0.0012. From these values we calculate the PoPS of pLuxPR to be 550/390 * 0.0012 =0.00169 PoPS.
Next we sought values for pCI. iGEM NYMU-Taipei 2009 measured the pLux promoter relative to pCI, which allows us to calculate the PoPS for pCI: $\frac{1}{0.43} {\cdot} 0.00169=0.0394$ PoPS. Their resulst of pPen were not consistent and the team mentioned they lacked experience in the beginning. Results of the earlier used pCI and pLux were consistent though. This is why we only used their last results and took the mean of pPen compared to pTet: $(0.801 {\cdot} 0.537 + 0,191) {\cdot} {\frac{1}{2}}=0.3106$. Using this mean value we were able to calculate the PoPS: $0.3106 {\cdot} 0.0394=0.01224$.
These values all depend on measuring activities and since the strains, media and antibiotic markers won’t be completely the same, the real values will diverge from the values found with these simple calculations. These values can give us an idea about the relative strength of the different promoters. We could gather the real values of mRNA concentration in our cells by executing a qPCR but for now we will use the found values for our model. The last important parameter for transcription is the copy number in which the genes are present in the cell. The plasmid that incorporates the used genes, has an ORI with a copy number between 100 and 300. In our model we will use a copy number of 200.
Inducable promoters
For inducable promoters the maximum transcription rate has to be multiplied with a factor to integrate the effect of the transcription factors. Our system includes two types of DNA binding proteins: repressors and activators. These proteins bind certain regions in the promoter DNA. This binding can either repress or activate the promoter activity, affecting transcription. The repressors used in our system are the cI protein of the lambda phage and the PenI protein. The activator protein is LuxR, which should be activated by AHL. All proteins first need to form homodimers before being able to bind their target DNA.
DNA binding will be simulated using the Hill function : ${\theta} = \frac{{[L]}^n}{{K_d}^n + [L]^n} = \frac{1}{1+(\frac{K_{d}}{[L]})^{n}}$. $\theta$ is the amount of DNA bound by the protein, $L$ is the amount of protein, $K_d$ is the dissociation constant and $n$ is the Hill coefficient. This Hill function has values between 0 and 1 and acts like an ON/OFF switch.
For repressors we are interested in how much of the DNA is still unbound and active:
$\frac{1}{1 + ({\frac{[Repressor]}{K_{d}}})^{n}}$
For activators we are interested in how much of the DNA is bound and active:
$\frac{1}{1+(\frac{K_{d}}{[Activator]})^{n}}$
The Hill coefficient gives an idea about the strength about the interaction between the DNA and the protein and is sometimes used to estimate the amount of proteins that bind the DNA but caution is needed in using it. Indeed, the Hill function has knwown shortcomings, but it is still very useful because of its simplicity (citate paper).
The constants needed for our model are $K_d$ and $n$. :
Protein | Kd (molecules) | n | Source |
---|---|---|---|
cI | 20 | 2 | 2009 iGEM Aberdeen and Wang et al. (2009) |
LuxR/AHL | 3.7 | 1 | 2007 iGEM ETH Zurich and Basu et al. (2005) |
PenI | 17 | 2 | Wittman et al. (1993) |
${K_d}$ is the value which corresponds with the amount of repressor or activator necessary to repress or activate half of the promoters. Since our copynumber is 200, we multiply the Hill constant with 200 because this higher value seems more logical to us.
Leakiness
A last important value in transcription is the leakiness of the promoter. A shutdown promoter still has a very small transcription which is called the leakiness of the promoter. This value should be as low as possible, because there should be a big difference between ON and OFF promoters. The effect of leakiness of promoters is something that should be checked.
iGEM 2012 Upssala measured the leakiness of the pLuxPR promoter. Without AHL and LuxR, the pLuxPR was active and had a RPU of 10. This means that the leaking promoter has a PoPS of $\frac{10}{390} {\cdot} 0.040 {\cdot} 0.03 = 3.077 {\cdot} 10^{-5} PoPS$.
We can use this value to estimate the leakiness of the other promoters.
3.2 Translation
The next step in gene expression is the translation of the synthesized mRNA to a protein. This involves the binding of a ribosome complex to the RBS site of the mRNA, elongation and termination. The initiation step is the rate-limiting step of the three. The translation rate is thus tied to the thermodynamics of RBS-ribosome interaction but other sequences (5’-UTR, Shine-Delgarno,repeats, internal startcodons) and possible secondary structures (hairpins) also play a role (Chiam Yu Ng, Salis). (Fong) Salis et al have developed an algorithm to predict the translation initiation rate based upon thermodynamic calculations. This algorithm is used in the Ribosome Binding Site Calculator (https://salislab.net/software/ , Borujeni & Salis). We used this Calculator to predict the translation rates of our mRNAs.
The results are in table 1. The program gives us results in au (arbitrary units). Since we know the translation rate of LuxI and LuxR, we can use these as a base to calculate the other translation rates since the used scale is proportional. (table 1, column 2) LuxI and LuxR have almost the same output from the calculator which corresponds with the paper where they also have the same translation rate. (Ag43 has a very low translation rate, which is not completely illogical, since Ag43 is by far the biggest protein.) Our values are normal values since 1000 is a moderate value and values between 1 and 100 000 are possible. (efficient search, mapping and optimization, salis). We did get warnings about the prediction (NEQ: not at equilibrium) which happens when mRNA may not fold quickly to its equilibrium state.
Protein | Translation rate (a.u.) | Translation rate (1/s) |
---|---|---|
LuxI | 261.13 | 0.016 |
LuxR | 279.61 | 0.016 |
cI857 | 5695.79 | 0.35 |
Transaminase B | 2195.1 | 0.13 |
Ag43-YFP | 33.22 | 0.0020 |
PenI | 6696.73 | 0.41 |
RFP | 2604.51 | 0.16 |
CheZ-GFP | 396.95 | 0.024 |
3.3 Complexation and dimerization
Before the proteins can bind the promoter region, they first have to make complexes. cI and penI form homodimers, while LuxR first forms a heterodimer with AHL and afterwards forms a homodimer with another LuxR/AHL dimer. This next part will describe the kinetics of such complexation. We only need the parameters of LuxR and cI, because the Hill function found for penI already was adapted for the protein in monomer form.
Biomolecule | Complexation rate | Source |
---|---|---|
LuxR/AHL association | $1{\cdot}10^{-5}$ 1/(molecule*s) | Goryachev et al. (2005) |
LuxR/AHL dissociation | 0.00333 1/s | Goryachev et al. (2005) |
LuxRdimer association | $1{\cdot}10^{-5}$ 1/(molecule*s) | Goryachev et al. (2005) |
LuxRdimer dissociation | 0.01 1/s | Goryachev et al. (2005) |
cI dimer association | 0.00147 1/(molecule*s) | iGEM Aberdeen 2009 |
cI dimer dissociation | 0.01 1/s | Bernstein et al. (2002) |
3.4 Protein production kinetics
Our models comprise two product forming enzymes: LuxI and Transaminase B.
LuxI is the enzyme responsible for AHL production and Transaminase B is responsible for Leucine formation.
For LuxI kinetics Schaefer et al. (1996) found a value of 1 molecule/minute for the Vmax. Since we assume that LuxI is fully saturated with substrate, we take this value as the synthesis rate of AHL.
Transaminase B forms leucine from aKIC and glutamate. In this reaction glutamate is converted to aKG. The reaction kinetics is of the ping-pong bi bi form. This model describes a mechanism in which the binding of substrates and release of products is ordered. The enzyme shuttles between a free and a substrate-modified intermediate state.
Ping-Pong Bi-Bi equations
\begin{align} \frac{{\large d}{TB}}{d t}= & \beta_{TB} {\cdot} {m_{ilvE}} - {kf}_{1}{\cdot}{TB}{\cdot}{Glu} + {kf}_{-1}{\cdot}{[TB-GLU]} - {kr}_1{\cdot}{Leucine}_{in}{\cdot}{[TB-GLU]} \\\\ & + {kr}_{-1}{\cdot}{[TB-Leu]} + {kcat2}{\cdot}{[{TBNH}_2-aKIC]} + {kcat4}{\cdot}{[{TBNH}_2-aKG]} - d_{TB}{\cdot}{TB} \end{align} $$\frac{{\large d}{[TB-GLU]}}{d t}= -{kcat1}{\cdot}{[TB-GLU]} + {kf}_{1}{\cdot}{TB}{\cdot}{Glu} - {kf}_{-1}{\cdot}{[TB-GLU]} $$ \begin{align} \frac{{\large d}{[{TBNH}_2]}}{d t}= & {kcat1}{\cdot}{[TB-GLU]} + {kcat3}{\cdot}{[TB-Leu]}- {kf}_{2}{\cdot}{{TBNH}_2}{\cdot}{aKIC} + {kf}_{-2}{\cdot}{[{TBNH}_2-aKIC]} \\\\ & - {kr}_2{\cdot}{{TBNH}_2}{\cdot}{aKG} +{kr}_{2}{\cdot}{[{TBNH}_2-aKG]} \end{align} $$\frac{{\large d}{[{TBNH}_2-aKIC]}}{d t}= -{kcat2}{\cdot}{[{TBNH}_2-aKIC]} + {kf}_{2}{\cdot}{{TBNH}_2}{\cdot}{aKIC} - {kf}_{-2}{\cdot}{[{TBNH}_2-aKIC]} $$ $$\frac{{\large d}{[TB-Leu]}}{d t}= {kr}_1{\cdot}{Leucine}_{in}{\cdot}{[TB-GLU]} - {kr}_{-1}{\cdot}{[TB-Leu]} - {kcat3}{\cdot}{[TB-Leu]} $$ $$\frac{{\large d}{[{TBNH}_2-aKG]}}{d t}= {kr}_2{\cdot}{{TBNH}_2}{\cdot}{aKG} - {kr}_{2}{\cdot}{[{TBNH}_2-aKG]} - {kcat4}{\cdot}{[{TBNH}_2-aKG]}$$
Yang et al. (2005) studied these reactions and found a method to estimate the constants. Their constants of this reversible Ping-Pong Bi-Bi reactions are put in the next table :
Constant | Value |
---|---|
${kf}_{1}$ | 0.0098 |
${kf}_{-1}$ | 3300 |
${kf}_{2}$ | 0.0882 |
${kf}_{-2}$ | 5940 |
${kr}_{1}$ | 0.0031184 |
${kr}_{-1}$ | 4620 |
${kr}_{2}$ | 0.0041161 |
${kr}_{-2}$ | 3465 |
$kcat1$ | 33.33 |
$kcat2$ | 60 |
$kcat3$ | 46.67 |
$kcat4$ | 35 |
3.5 Degradation
Proteins, mRNA and other metabolites have a turnover rate. They are degraded over time. The degradation rate will be described as proportional to the amount of biomolecules. The coefficient of proportionality d is the degradation constant. Not every molecule has the same degradation rate, since some molecules are more stable than others. We can influence the stability of the molecules. For example, in cell B it is important that there is a fast switch between conditions and a fast turnover of CheZ and RFP is necessary. This is why we add a LVA-tag to these proteins. This tag destabilizes the protein and makes them degradade faster. The degradation rates used in the model are put in the next table:
Biomolecule | Degradation rate (1/s) | Source |
---|---|---|
LuxI | 0.01 | Goryachev et al. (2005) |
luxI-mRNA | 0.006 | Goryachev et al. (2005) |
LuxR | 0.001 | Goryachev et al. (2005) |
luxR-mRNA | 0.006 | Goryachev et al. (2005) |
cI857 | 0.00288 | iGEM Aberdeen 2009 |
cI-mRNA | 0.002265 | Bernstein et al. (2002) |
Transaminase B | 4.5 | Estimated |
ilvE-mRNA | 0.00304 | Bernstein et al. (2002) |
Ag43-YFP | $2.84{\cdot}10^{-5}$ | Bernstein et al. |
Adhesine mRNA | 0.00186 | Bernstein et al. (2002) |
PenI | 0.00333 | Estimated |
PenI mRNA | 0.002265 | Bernstein et al. (2002) |
RFP-LVA | 0.0002814 | Andersen et al. (1998) and iGEM KULeuven 2008 |
RFP mRNA | 0.066 | Bernstein et al. (2002) |
CheZ-GFP-LVA | 0.0002814 | Andersen et al. (1998) |
CheZ-GFP mRNA | 0.00333 | Estimated |
AHL | 0.01 | Basu et al. (2005) |
AHL external | 0.00048135 | Horswill et al. (2007) |
Leucine | 0.000167 | Yang et al. (2005) |
Leucine external | 0.000333 | Estimated |
3.6 Diffusion
Our model has 2 types of diffusion. Diffusion from the inside of the cell to the outside over the cell membrane and diffusion in the external medium. The diffusion over the cell membrane is more complicated because some proteins play a role in it and the membrane is not equally permeable for every molecule. For AHL, we found a value given in molecules/second. This unit seems strange because diffusion is usually used with units in concentration. It also leads to strange results since the amount of molecules is being leveled out so the inside amount equals the outside amount even though the concentrations are different. This is why we added a correction for the volume of the cell and the external volume to it.
We assume that the volume of the cell has a shape of a cilinder and is constant. For this simplified volume we found a value of $5.65 {\cdot} 10^{-16} $ l. We can take this volume as a constant, since cell growth is very small compared to the diffusion. The outside compartment will be modeled as half a sphere with the cell as center and a radius equal to $\sqrt{2*D*t}$ + r0. We only take half a sphere because there is no upward diffusion. For the initial value of the outside compartment we take a volume (and thus r0 slightly bigger than the cell volume (radius).
With this approximation we get more logical results. We take the diffusion rate for AHL equal to 0.23 1/s. For Leucine we make a differce between inward and outward diffusion. Inward diffusion is facilitated by transporters, so we choose this value to be the largest. The inward diffusion rate is
Biomolecule | Diffusion rate (1/s) | Source |
---|---|---|
AHL IN and OUT | 0.23 1/s | Goryachev et al. (2005) |
Leucine IN | 0.05 1/s | Estimated |
Leucine OUT | 0.0005 1/s | Estimated |
AHL extracellular | $7.1{\cdot}10^{-9}$ dm²/s | Goryachev et al. (2005) |
Leucine extracellular | $7.3{\cdot}10^{-8}{\cdot}0.09$ dm²/s | iGEM Aberdeen 2009 |