Difference between revisions of "Team:KU Leuven/Modeling/Internal"
Line 122: | Line 122: | ||
Transcription is the first step of gene expression. It involves the binding of RNA polymerase to the promoter region and the formation of mRNA. The transcription rate is dependent on a number of conditions like the promoter strength and the used strain. Our system has both constitutive promoters, which are always active, and inducable promoters, which can be activated or repressed. </p> | Transcription is the first step of gene expression. It involves the binding of RNA polymerase to the promoter region and the formation of mRNA. The transcription rate is dependent on a number of conditions like the promoter strength and the used strain. Our system has both constitutive promoters, which are always active, and inducable promoters, which can be activated or repressed. </p> | ||
− | < | + | <p><b> Maximum Transcription rate </b></p> |
<p> | <p> | ||
First we will try to find the maximum transcription rate. The prediction of the transcription rate has been an important hold-back in the past, even though Polymerases per Second was introduced as a unit. This is the amount of Polymerases that passes through a given position in the DNA per time unit and is essentially the transcription rate at a particular location on the DNA (open wetware http://openwetware.org/wiki/PoPS). We can use this value as the transciption rate of the whole gene because it is the value of the slowest and rate-defining step, the binding of polymerase on the promoter. The other step being the movement of polymerase over the gene. It remained difficult to measure in vivo but Kelly et al have introduced a way to measure the activity of promoters using an in vivo standard, promoter J23101. This gave rise to a new unit: Relative activity of promoter (RPU)$=\frac{PoPS_{phi}}{PoPS_{J23101}}$. By using relative units, the variability due to equipment and conditions was drastically decreased. The PoPS of J23101 was found to be 0.03.<br> | First we will try to find the maximum transcription rate. The prediction of the transcription rate has been an important hold-back in the past, even though Polymerases per Second was introduced as a unit. This is the amount of Polymerases that passes through a given position in the DNA per time unit and is essentially the transcription rate at a particular location on the DNA (open wetware http://openwetware.org/wiki/PoPS). We can use this value as the transciption rate of the whole gene because it is the value of the slowest and rate-defining step, the binding of polymerase on the promoter. The other step being the movement of polymerase over the gene. It remained difficult to measure in vivo but Kelly et al have introduced a way to measure the activity of promoters using an in vivo standard, promoter J23101. This gave rise to a new unit: Relative activity of promoter (RPU)$=\frac{PoPS_{phi}}{PoPS_{J23101}}$. By using relative units, the variability due to equipment and conditions was drastically decreased. The PoPS of J23101 was found to be 0.03.<br> | ||
Line 134: | Line 134: | ||
These values all depend on measuring activities and since the strains, media and antibiotic markers won’t be completely the same, the real values will diverge from the values found with these simple calculations. Nevertheless, these values can give us an idea about the relative strength of the different promoters. We could gather the real values of mRNA concentration in our cells by executing a qPCR, but for now we will use the found values for our model. The last important parameter for transcription is the copy number in which the genes are present in the cell. The plasmid that incorporates the used genes, has an ORI with a copy number between 100 and 300. In our model we will use a mean copy number of 200. | These values all depend on measuring activities and since the strains, media and antibiotic markers won’t be completely the same, the real values will diverge from the values found with these simple calculations. Nevertheless, these values can give us an idea about the relative strength of the different promoters. We could gather the real values of mRNA concentration in our cells by executing a qPCR, but for now we will use the found values for our model. The last important parameter for transcription is the copy number in which the genes are present in the cell. The plasmid that incorporates the used genes, has an ORI with a copy number between 100 and 300. In our model we will use a mean copy number of 200. | ||
</p> | </p> | ||
− | < | + | <p> |
+ | <b> Inducable promoters </b> </p> | ||
<p> | <p> | ||
For inducable promoters the maximum transcription rate has to be multiplied with a factor to integrate the effect of the transcription factors. Our system includes two types of DNA binding proteins: repressors and activators. These proteins bind certain regions in the promoter region of the DNA. This binding can either repress or activate the promoter activity, affecting transcription rate. The repressors used in our system are the cI protein of the lambda phage and the PenI protein. The activator protein is LuxR, which should be activated by AHL. All proteins first need to form homodimers before being able to bind their target DNA.</p> | For inducable promoters the maximum transcription rate has to be multiplied with a factor to integrate the effect of the transcription factors. Our system includes two types of DNA binding proteins: repressors and activators. These proteins bind certain regions in the promoter region of the DNA. This binding can either repress or activate the promoter activity, affecting transcription rate. The repressors used in our system are the cI protein of the lambda phage and the PenI protein. The activator protein is LuxR, which should be activated by AHL. All proteins first need to form homodimers before being able to bind their target DNA.</p> | ||
Line 180: | Line 181: | ||
${K_d}$ is the value which corresponds with the amount of repressor or activator necessary to repress or activate half of the promoters. Since our copynumber is 200, we multiply the Hill constant with 200 because this higher value seems more logical to us. | ${K_d}$ is the value which corresponds with the amount of repressor or activator necessary to repress or activate half of the promoters. Since our copynumber is 200, we multiply the Hill constant with 200 because this higher value seems more logical to us. | ||
</p> | </p> | ||
− | + | <p> | |
− | < | + | <b> Leakiness </b></p> |
<p> | <p> | ||
A last important value in transcription is the leakiness of the promoter. A shutdown promoter still has a very small transcription which is called the leakiness of the promoter. This value should be as low as possible, because there should be a big difference between ON and OFF promoters. The effect of leakiness of promoters is something that should be checked. <br> | A last important value in transcription is the leakiness of the promoter. A shutdown promoter still has a very small transcription which is called the leakiness of the promoter. This value should be as low as possible, because there should be a big difference between ON and OFF promoters. The effect of leakiness of promoters is something that should be checked. <br> | ||
Line 190: | Line 191: | ||
<br> | <br> | ||
</div> | </div> | ||
+ | <div class="whiterow"></div> | ||
</div> | </div> | ||
Line 255: | Line 257: | ||
<p>For TransaminaseB we also need to add a production term because it is an protein which naturally occurs in <i>E. coli</i>. We choose a value of 0.0033 for this constitutive production rate. </p> | <p>For TransaminaseB we also need to add a production term because it is an protein which naturally occurs in <i>E. coli</i>. We choose a value of 0.0033 for this constitutive production rate. </p> | ||
</div> | </div> | ||
+ | <div class="whiterow"></div> | ||
</div> | </div> | ||
Line 306: | Line 309: | ||
</table> | </table> | ||
</div> | </div> | ||
− | + | <div class="whiterow"></div> | |
</div> | </div> | ||
</div> | </div> | ||
Line 519: | Line 522: | ||
</table> | </table> | ||
</div> | </div> | ||
− | + | <div class="whiterow"></div> | |
</div> | </div> | ||
</div> | </div> | ||
Line 577: | Line 580: | ||
<div class="summarytext"> | <div class="summarytext"> | ||
<div class="part"> | <div class="part"> | ||
− | < | + | <h2> System </h2> |
<p> After this extensive literature search, we can finally set up our complete system of ODEs for every cell. </p> | <p> After this extensive literature search, we can finally set up our complete system of ODEs for every cell. </p> | ||
</div> | </div> | ||
Line 585: | Line 588: | ||
<div class="toggleseven"> | <div class="toggleseven"> | ||
− | < | + | <h2>Cell A </h2> |
</div> | </div> | ||
<div id="toggleseven"> | <div id="toggleseven"> | ||
Line 639: | Line 642: | ||
<div class="togglebar"> | <div class="togglebar"> | ||
<div class="toggleeight"> | <div class="toggleeight"> | ||
− | < | + | <h2>Cell B </h2> |
</div> | </div> | ||
<div id="toggleeight"> | <div id="toggleeight"> | ||
Line 712: | Line 715: | ||
<br> | <br> | ||
<br> | <br> | ||
− | < | + | <p><b> Sensitivities </b></p> |
<p> Now we are going to check which parameters have the biggest effect on the output and are the most important. We can quantify this effect using derivatives: $\frac{\delta {output}}{\delta {parameter}}$. The parameters with the largest sensitivity value, are the parameters that should be best characterized. Furthermore, if they are controllable, the could be varied to our wishes. The sensitivity analysis will be executed in Simbiology. This analysis uses "complex-step approximation" to calculate derivatives of reaction rates. This technique yields accurate results for the vast majority of typical reaction kinetics. We will use full dedimensionalization. This way we can compare the results. <br> <br> For cell A the output is the Medium Leucine, Adhesine and Medium AHL. We took a time integral of the sensitivity and plotted it in figure 6.<br> <br> | <p> Now we are going to check which parameters have the biggest effect on the output and are the most important. We can quantify this effect using derivatives: $\frac{\delta {output}}{\delta {parameter}}$. The parameters with the largest sensitivity value, are the parameters that should be best characterized. Furthermore, if they are controllable, the could be varied to our wishes. The sensitivity analysis will be executed in Simbiology. This analysis uses "complex-step approximation" to calculate derivatives of reaction rates. This technique yields accurate results for the vast majority of typical reaction kinetics. We will use full dedimensionalization. This way we can compare the results. <br> <br> For cell A the output is the Medium Leucine, Adhesine and Medium AHL. We took a time integral of the sensitivity and plotted it in figure 6.<br> <br> | ||
Revision as of 17:14, 18 September 2015
Internal Model
Introduction
We can think of many relevant questions when implementing a new circuit: how sensitive is the system, how much will it produce and will it affect the growth? As such, it is important to model the effect of the new circuits on the bacteria. This will be done in the Internal Model. We will use two approaches. First we will use a bottom-up approach. This involves building a detailed kinetic model with rate laws. We will use Simbiology and ODEs to study the sensitivity and dynamic processes inside the cell. Afterwards, a top-down model, Flux Balance Analysis (FBA), will be used to study the steady-state values for production flux and growth rate. This part is executed by the iGEM Team of Toulouse as part of a collaboration and can be found here
Simbiology and ODEs
In the next section we will describe our Simbiology model. Simbiology is a toolbox from Matlab designed for the simulation of (bio)chemical reactions. It allows us to calculate systems of ODEs and to visualize the system in a diagram. It also has options to make scans for different parameters, which allows us to study the effect of the specified parameter. We will focus on the production of leucine, Ag43 and AHL in cell A and the changing behavior of cell B due to changing AHL concentration. In this perspective, we will make two models in Simbiology: one for cell A and one for cell B. First we will describe how we made the model and searched for the parameters. Afterwards we check the robustness of the model with a parameter analysis and we do scans to check for the effects of molecular noise.
Quest for parameters
We can divide the different processes that are being executed in the cells in 7 classes: transcription, translation, DNA binding, complexation and dimerization, protein production kinetics, degradation and diffusion. We went on to search the necessary parameters and descriptions for each of these categories. To start making our model we have to pick a unit. We choose to use molecules as unit, because many constants are expressed in this unit and it allows us to drop the dillution terms connected to cell growth. We will also work with a deterministic model instead of a stochastic model. A stochastic model would show us the molecular noise, but we will check this with parameter scans.
The next step is to make some assumptions:
- The effects of cell division can be neglected
- The substrate pool can not be depleted and the concentration (or amount of molecules) of substrate in the cell is constant
- The exterior of the cell contains no leucine at t=0 and is perfectly mixed
- Diffusion happens independent of cell movement and has a constant rate
System
After this extensive literature search, we can finally set up our complete system of ODEs for every cell.
Results
For cell A we made a simulation with cell A in the ON and OFF mode as visuable in figure 3. When cell A is in the OFF state, the whole designed circuit is in OFF mode. This means that the cI repressor is succesful in repressing the design. If the degradation rate of cI is raised to simulate the temperature rising, we see that all the components of the system show a big increase. For LuxR this increase is only temporary, but this is also explainable. Since LuxI keeps on producing AHL which binds LuxR to form a complex. Indeed all the AHL reacts to form the complex. Some values seem really high (for example the AHL,out en Leucine, extracellular values but they are also in the biggest volume so the concentration is not that high). We also assumed that there is a substrate pool without limiting, which is of course not the real situation. The most important conclusion is that with values backed up by literature, our system qualitatively still shows the desired behavior.
Figure 3: Simulation of all processes in Cell A in ON and OFF state
In the OFF state simulations, there is not a big difference between the red and green lines. We do see a very small rise in LuxR but we can ignore this because it is so small. We see a fast equilibration between the external AHL and internal AHL and no drop since there is no LuxR to react with the AHL. The only proteins that are available in high amounts are PenI and RFP. The high amount of PenI are not predicted in our design, but it does not affect the amount of RFP.
In the ON simulations we see a big difference between the red and green lines. When there is AHL available, the production of CheZ and PenI is much bigger and the production of RFP much lower. This is the behavior we wished for. Thus, our system is still qualitatively showing the desired behavior.
Figure 4: Simulation of all processes in Cell B in OFF state with and without AHL induction
Figure 5: Simulation of all processes in Cell B in ON state with and without AHL induction
Sensitivities
Now we are going to check which parameters have the biggest effect on the output and are the most important. We can quantify this effect using derivatives: $\frac{\delta {output}}{\delta {parameter}}$. The parameters with the largest sensitivity value, are the parameters that should be best characterized. Furthermore, if they are controllable, the could be varied to our wishes. The sensitivity analysis will be executed in Simbiology. This analysis uses "complex-step approximation" to calculate derivatives of reaction rates. This technique yields accurate results for the vast majority of typical reaction kinetics. We will use full dedimensionalization. This way we can compare the results.
For cell A the output is the Medium Leucine, Adhesine and Medium AHL. We took a time integral of the sensitivity and plotted it in figure 6.
The leucine medium is dependent on Diffusion of Leucine out, leucine consumption, degradation, transcription and translation of ilvE mRNA. We also notice that not all Ping-Pong Bi Bi constants are equally important. kf1, kf-1 and kcat are the most important while kr1, kr-1 and kcat3 are the least important.
AHL medium: is highly sensitive for variations in luxI translation, degradation of LuxI and LuxI mRNA and for the catalytic activity of LuxI.
Adhesine-YFP is sensitive for translation, degradation and transcription.
In Cell A are all outputs dependent on variables closely related to themselves. Indeed, since there is production of AHL and LuxR in the cell, the transcriptional network is always active so the steps concerning the transcriptional network lose their importance. It is thus only a matter of understanding the metabolism, production and degradation terms to correctly model Cell A.
Figure 6: Sensitivity analysis of parameters in cell A
We do the same for cell B. In cell B the output is the production of CheZ and RFP and the results are plotted in figure 7. We notice that CheZ-GFP is highly sensitive for the association rates of the LuxR/AHL complex and the LuxR/AHL dimer. RFP production is also sensitive for the association of the LuxR/AHL dimer.
We see that in cell B the dimerisation steps are really important. This is logical, since cell B is dependent on external AHL concentrations to start the transcriptional network. Thus, the steps concerning the binding of AHL to LuxR and making the activated LuxR/AHL dimer, which can start the transcription, are the most important steps. They determine the sensitivity of the network to AHL and as so, form the major component of the network.
Figure 7: Sensitivity analysis of parameters in cell B
References
[1] | Chin-Rang Yang, Bruce E Shapiro, She-Pin Hung, Eric D Mjolsness, and G Wesley Hatfield. A mathematical model for the branched chain amino acid biosynthetic pathways of Escherichia coli K12. The Journal of biological chemistry, 280(12):11224-11232, 2005. [ DOI ] |
[2] | Chin Rang Yang, Bruce E Shapiro, Eric D Mjolsness, and G Wesley Hatfield. An enzyme mechanism language for the mathematical modeling of metabolic pathways. Bioinformatics, 21(6):774-780, 2005. [ DOI ] |
[3] | V Wittman, H C Lin, and H C Wong. Functional domains of the penicillinase repressor of Bacillus licheniformis. Journal of Bacteriology, 175(22):7383-7390, 1993. |
[4] | J N Weiss. The Hill equation revisited: uses and misuses. The FASEB journal : official publication of the Federation of American Societies for Experimental Biology, 11(11):835-841, 1997. [ DOI ] |
[5] | Yufang Wang, Ling Guo, Ido Golding, Edward C Cox, and N P Ong. Quantitative transcription factor binding kinetics at the single-molecule level. Biophysical Journal, 96(2):609-620, 2009. [ DOI | arXiv | http ] |
[6] | A L Schaefer, D L Val, B L Hanzelka, J E Cronan, and E P Greenberg. Generation of cell-to-cell signals in quorum sensing: acyl homoserine lactone synthase activity of a purified Vibrio fischeri LuxI protein. Proceedings of the National Academy of Sciences of the United States of America, 93(18):9505-9509, 1996. [ DOI ] |
[7] | K T Samiee, M Foquet, L Guo, E C Cox, and H G Craighead. lambda-Repressor oligomerization kinetics at high concentrations using fluorescence correlation spectroscopy in zero-mode waveguides. Biophysical journal, 88(3):2145-2153, 2005. [ DOI | http ] |
[8] | J H Roe, R R Burgess, and M T Record. Kinetics and mechanism of the interaction of Escherichia coli RNA polymerase with the lambda PR promoter. Journal of molecular biology, 176(4):495-522, 1984. [ DOI ] |
[9] | Benjamin Reeve, Thomas Hargest, Charlie Gilbert, and Tom Ellis. Predicting Translation Initiation Rates for Designing Synthetic Biology. Frontiers in Bioengineering and Biotechnology, 2(January):1-6, 2014. [ DOI | http ] |
[10] | Tarek S Najdi, Tarek S Najdi, Chin-rang Yang, Chin-rang Yang, Bruce E Shapiro, Bruce E Shapiro, G Wesley Hatfield, G Wesley Hatfield, Eric D Mjolsness, and Eric D Mjolsness. A Mathematical Model of Allosteric Regulation for Threonine Biosynthesis in. Jet Propulsion. |
[11] | Jason R Kelly, Adam J Rubin, Joseph H Davis, Caroline M Ajo-Franklin, John Cumbers, Michael J Czar, Kim de Mora, Aaron L Glieberman, Dileep D Monie, and Drew Endy. Measuring the activity of BioBrick promoters using an in vivo reference standard. Journal of biological engineering, 3:4, 2009. [ DOI ] |
[12] | Heidi B Kaplan and E P Greenberg. fischeri Luminescence System. Microbiology, 163(3):1210-1214, 1985. |
[13] | Yasushi Ishihama, Thorsten Schmidt, Juri Rappsilber, Matthias Mann, F Ulrich Hartl, Michael J Kerner, and Dmitrij Frishman. Protein abundance profiling of the Escherichia coli cytosol. BMC genomics, 9(1):102, 2008. [ DOI | http ] |
[14] | A. B. Goryachev, D. J. Toh, and T. Lee. Systems analysis of a quorum sensing network: Design constraints imposed by the functional requirements, network topology and kinetic constants. In BioSystems, volume 83, pages 178-187, 2006. [ DOI ] |
[15] | Stephen S Fong. Computational approaches to metabolic engineering utilizing systems biology and synthetic biology. Computational and Structural Biotechnology Journal, 11(18):28-34, 2014. [ DOI | http ] |
[16] | Jonathan a Bernstein, Arkady B Khodursky, Pei-Hsun Lin, Sue Lin-Chao, and Stanley N Cohen. Global analysis of mRNA decay and abundance in Escherichia coli at single-gene resolution using two-color fluorescent DNA microarrays. Proceedings of the National Academy of Sciences of the United States of America, 99(15):9697-9702, 2002. [ DOI ] |
[17] | Jens Bo Andersen, Claus Sternberg, Lars Kongsbak Poulsen, Sara Petersen Bjø rn, Michael Givskov, and Sø ren Molin. New unstable variants of green fluorescent protein for studies of transient gene expression in bacteria. Applied and Environmental Microbiology, 64(6):2240-2246, 1998. [ arXiv ] |
Contact
Address: Celestijnenlaan 200G room 00.08 - 3001 Heverlee
Telephone: +32(0)16 32 73 19
Email: igem@chem.kuleuven.be