Difference between revisions of "Team:SYSU-Software/Modeling"

 
Line 1: Line 1:
 
{{SYSU-Software}}
 
{{SYSU-Software}}
 
<html>
 
<html>
  <link rel="stylesheet" href="https://2015.igem.org/Team:SYSU-Software/Scientific/CSS?action=raw&ctype=text/css">
+
<link rel="stylesheet" type="text/css" href="https://2015.igem.org/Team:SYSU-Software/Overview/CSS?action=raw&amp;ctype=text/css"/>
  <link rel="stylesheet" type="text/css" href="https://2015.igem.org/Template:SYSU-Software/CSS/Home?action=raw&amp;ctype=text/css">
+
<div id="contentContainer">
  <link rel="stylesheet" type="text/css" href="https://2015.igem.org/Template:SYSU-Software/CSS/BOOT?action=raw&amp;ctype=text/css">
+
 
  <!--script(src="./css/home.js")-->
+
          <h1>Modeling Overview</h1>
  <script src="https://2015.igem.org/Template:SYSU-Software/JS/Home?action=raw&amp;ctype=text/javascript"></script>
+
          <p class = "wenzi">Our whole model is mainly considering about the integrated system providing a set of rules and principles to help modify the ODEs of those devices we quote in the software, which we will make further explanations in the latter paragraphs. To make more reasonable and flexible rules, we have studied several papers and nearly 30 ODE models from the iGEM team participants in the past few years, and notice that they have many structures and constructing rules in common. We choose some meaningful devices of those models and adjust their formulae forms in order to systematically apply them to the numerical simulation and circuits building, hence users can invoke them from our database, combine them and accomplish more complex functions. Any codes of the model we used can be found in <a href = "https://github.com/igemsoftware/SYSU-Software-2015">this Github Page</a>, and details of adjusted formulae of present devices (approximately 50) in database can be seen in <a href="https://static.igem.org/mediawiki/2015/4/43/SYSU-Software_Modeling.zip">attachments(PDF)</a>.<br>
  <script src="https://2015.igem.org/Template:SYSU-Software/JS/BOOT?action=raw&amp;ctype=text/javascript"></script>
+
          Furthermore, users of CORE can build their own formulae in provided form, enter in their own data as parameters-thus a new device of his or her own can be easily created. We also provide users with the automatic function of circuit building and numerical simulation of his new device, as we mentioned in the former sections, if it really works in sense of synthetic biology. In this way, users can easily get reach of the comparison of simulation and his experiment data when parameters change or foresee the circuit production before experiments, which will help avoid the unnecessary steps and testing.</p>
 
+
 
<br>
+
 
<br>
+
          <h2>Model Formation</h2>
<br>
+
 
+
          <h2 class = "fubiao">General Rules Applied</h2>
  <div id="contentContainer">
+
          <h3 class = "xiaobiao">Theorem</h3>
    <h1 class="center bold">Modeling Example: Sunlight Responsor System</h1>
+
          <p class = "wenzi">We’ve studied several papers discussing ODE applications in synthetic biology and dozens of former models of iGEM teams, found that Hill equation, the three forms of which is shown below(cited from Wikipedia[1]), plays the central part in those models:</p>
    <p>Here we raise an example of a specific device models, which includes the source of the device, its formulae with numerical solutions, and comparison with our experiment data as a test of model. We provide this example to better explain the modeling function of our software, and we make further error analysis to examine the accuracy of our models, as an effective proof.</p>
+
          <img src = "https://static.igem.org/mediawiki/2015/c/cc/SYSU-Software_048_mod.jpg">
    <h2 class="bold">System Overview</h2>
+
          <p class = "wenzi">Parameters and signals are shown below:</p>
    <div class="Row">
+
          <img src = "https://static.igem.org/mediawiki/2015/e/ee/SYSU-Software_form_model.jpg">
      <div class="float">
+
          <p class = "wenzi">This equation describes “the fraction of the macromolecule saturated by ligand as a function of the ligand concentration”, and is used in determining the degree of cooperativeness of the ligand binding to the enzyme or receptor. As our model is mainly aiming at the mathematical simulation of the production of circuits, Hill equation can be a cornerstone, which accords with the former ODE models of circuits of other teams.</p>
        <p>This system consists of a sunlight (UVB) sensor and a toggle switch, which is our combination of the two mentioned structures. </p>
+
 
        <p>The E.coli will produce GFP protein without sunlight while they will produce YFP protein under sunlight. As the sunlight sensor, it has a coding sequence of UVR8-TetR protein. This protein can join into a dimer which can act like the TetR dimer to induce the promoter TetR. When the E.coli is exposed to sunlight, the UVR8-TetR protein dimer will become a monomer so that the promoter TetR will not be induced.</p>
+
 
      </div>
+
          <h3>Limitations and Assumptions</h3>
      <div class="float right img"><img src="https://static.igem.org/mediawiki/2015/c/c8/SYSU-Software_New_Demo.png" alt="" width="100%">
+
          <p class = "wenzi">In our ODE model of certain circuits we only consider about the transcription of mRNA, the translation of protein and the inner promotion or inhibition of other chemicals. We also make the assumption that when a coding sequence promotes or inhibits the production of other substances, it takes its effect through the translation of corresponding proteins.</p>
        <p class="imgDescription italic">Circuit Graph</p>
+
          <p class = "wenzi">
      </div>
+
          We choose not to count the influences as below into our models:</p>
    </div>
+
          <b>·Leakage</b>
    <h2 class="bold">Formulae</h2>
+
          <div class = "wenzi dd">For some operons, leakage does exist. For lac operons, repressor protein LacI can bind to the operator of lac operons and repress the transcription of lac operons in the absence of lactose. However, chances are that the concentration of LacI deviates from its normal one and no LacI binds to the operator. Under such circumstance the repressor protein LacI cannot repress the transcription from lac promoter. <br>
    <p>Assume that <img src="https://static.igem.org/mediawiki/2015/1/17/SYSU-Software_forPic.png" alt="">, which is a t-based function that represents the trend of UVB, to serve as the human-defined condition of the whole device. The whole equation group is shown as below, obeying the rule system we introduced in the former modeling page:
+
          However, we can neglect the rate of leakage for the following two reasons. First, the rate of leakage is very low compared with the strength of LacI binding to lac operator.</div>
      <div class="center"><img src="https://static.igem.org/mediawiki/2015/a/a8/SYSU-Software_forPic2.png" alt=""></div><br>
+
          <b>· The distribution of RNA polymerases, ribosomes and chemicals</b>
      <div class="center"><img src="https://static.igem.org/mediawiki/2015/d/d5/SYSU-Software_forPic3.png" alt=""></div><br>
+
          <div class = "wenzi dd"> The molecular weight of RNA polymerases, ribosomes and chemicals is small compared to the size of a cell, and the number of RNA polymerases, ribosomes and chemicals is limited. So the distribution of these materials in a cell is uneven. The unevenness might affect the binding efficiency of protein and materials, the dissociation efficiency, and the local concentration of one particular materials.<br>
      <div class="center"><img src="https://static.igem.org/mediawiki/2015/d/df/SYSU-Software_forPic4.png" alt=""></div><br>
+
          However, metabolism of the cell can reduce or eliminate the unevenness of distribution in cells. Based on the theory of <a href = "https://en.wikipedia.org/wiki/Flux_balance_analysis">flux balanced analysis</a>, for a particular metabolism pathway the rate of consumption of inputs and the production of outputs is equal. So the concentration of materials in the cells remains (nearly) unchanged. Further, the inner mass of a living cells is under constant movement; this kind of moving can reduce or eliminate the unevenness of distribution in cells.<br>
      <div class="center"><img src="https://static.igem.org/mediawiki/2015/e/e0/SYSU-Software_forPic5.png" alt=""></div><br>
+
          So we assume that the distribution of RNA polymerases, ribosomes and chemicals are even.</div>
      <div class="center"><img src="https://static.igem.org/mediawiki/2015/b/be/SYSU-Software_forPic6.png" alt=""></div><br>
+
          <b>·The growth of cells</b>
      <p>In which we introduce five Eigen functions, signaled as χ , to help make judge of certain terms in equations. Thus when the symbol substance appears, the χ function value will equals to 1, making sense of the term including it; otherwise χ function value equals to 0. By Eigen functions we create the switch of important conditions or changes of some chemicals or necessary structures in equations, to reach the flexibility for users to lead in or move out something in the circuits.</p>
+
          <div class = "wenzi dd"> Cells will grow in size, and the concentration of materials will change with the growing cells. However, in iGEM teams usually use bacteria as chassis, and E. coli divides every 20 minutes. So the time with which E. coli grows is short, and the growth of cells can be neglected.</div>
      <h2 class="bold">Table of Parameters</h2>
+
          <b>·The division of cells</b>
      <div class="Row center"><img src="https://static.igem.org/mediawiki/2015/2/2c/SYSU-Software_model_form1.jpg" alt=""></div>
+
          <div class = "wenzi dd"> Cell might divide into two, and the concentration of materials will reduce to half of its original. However, the distribution of materials into two new daughter cells is not determined. For example, when the yeast cells divides, the new cells will be much smaller than their mother cells, and obviously the number of different kind of materials in mother cells and daughter cells is different. (Although the concentration can be thought of as remain the same.)
      <div class="Row center"><img src="https://static.igem.org/mediawiki/2015/9/90/SYSU-Software_model_form2.jpg" alt=""></div>
+
          Because the division of cells into two cells, no matter whether the mother cells and the daughter cells are of the same size or different sizes, the concentration of these materials remain roughly the same, we can neglect the impact of division of cells on the concentration of two daughter cells.</div>
    </p>
+
          </p>
    <h2 class="bold">Comparison between Numerical Simulation and Experiment Results</h2>
+
 
    <p>Based on the differential equations, we use the modeling function of CORE to make graphs of the dynamic performance of certain parts chosen by users. In the system we mainly observe the performance of YFP Protein and GFP Protein, as they are signaled by the light density they produced. The flexibility of toggle switch as regulators in circuits can be shown in its behavior pattern that the YFP or GFP protein dominates while the other one is close to 0, and they have the completely opposite varying trends. The boundary conditions and parameter values can be reset if different performance needed, with different slopes and initial values. In the example, we set the initial values of all variables as zero, which accords with the initial state of the experiment. (Users can also rewrite those values.)</p>
+
          <p>
    <p>In the graphing field of CORE we can easily get the graph of simulating dynamic performance of YFP, GFP and CI, as shown below:</p>
+
          We also need to point out the limitations of modeling:<br>
    <div class="center"><img src="https://static.igem.org/mediawiki/2015/c/c0/SYSU-Software_model_jietu.jpg" alt="" width="100%"></div>
+
          <div class = "wenzi dd"> · We does not consider the impact of cell types and cell strains on the constant of formulae. As we all know,  different kinds of cells contains different concentration of the same materials. So if we consider the impact of cell types and cell strains on the constant of formulae, the formulae will be more accurate. However, due to the limited time we spent on the formulae, it is impossible for the formulae to take this into considerations.</div>
    <p>So based on the model we can see in the graph that YFP is produced, while the production of GFP is inhibited. The result echoes the simulation result run from MATLAB, shown in the graph below:</p>
+
          <div class = "wenzi dd"> · In our formulae we have considered the impact of binding efficiency and dissociation efficiency of two materials (like, a protein and its ligands). However, as a software team, it is difficult for us to obtain the exact values of the binding constants. </div>
    <div class="Row center">
+
          <div class = "wenzi dd"> · In the formulae we adopt the Hill functions as the blueprints of our formulae. Although many other kinds of formulae can be adopted, we adopt the Hill functions because they are mostly used by teams of previous years. So, in order to construct a mathematical model that is of universal use, we choose to adopt the Hill equation as the basis of our equations.</div>
      <div class="float"><img src="https://static.igem.org/mediawiki/2015/d/da/SYSU-Software_model_yfp.jpg" alt=""></div>
+
          </p>
      <div class="float"><img src="https://static.igem.org/mediawiki/2015/f/f0/SYSU-Software_model_yfp2.jpg" alt=""></div>
+
 
    </div>
+
          <h2 class = "fubiao">Basic Form of ODEs</h2>
    <p>We run our model (left curve) and experiment data (right curve) at the same time on MATLAB platform (the time interval is set as 10 hours according to the experiment time) and find the curves are close. Further, we make the fitted curve of the experiment data and find that it has the approaching trend with the simulation curve of the model, as shown below:</p>
+
          <h3 class = "xiaobiao">Formulae Form concerning genes and proteins</h3>
    <div class="Row center"><img src="https://static.igem.org/mediawiki/2015/2/24/SYSU-Software_model_fitted.jpg" alt="" width="100%"></div>
+
          <p class = "wenzi">We set the basic Concentration-Time ODEs form of a certain protein α as below:</p>
    <h2 class="bold">Error Explanation</h2>
+
          <img src = "https://static.igem.org/mediawiki/2015/a/af/SYSU-Software_227_mod.jpg">
    <p>But there is still some inevitable errors. We can see fluctuation in the data of the 2nd, 3rd and 4th hour. This may be caused by random error of the plate reader. The data curve went more steadily from 5th to 8th hour than the simulate one. That's because we only used one tube of culture for measurement. Every time we took out 200μ culture for measurement, some of the bacteria were lost, and the growth were also paused for a moment due to the low temperature. This influence is especially obvious in the first few hours, but the two curves would finally be closer to each other. From 8th to 10th hour, the data curve rose much more rapidly because in this period, the loss of bacteria caused by measurement can be neglected. As there were less bacteria in the earlier period, the resources and space made it possible for bacteria to grow more rapidly.</p>
+
          <p class = "wenzi">in which Х<SUB>Promoter</SUB> is an Eigen function of the promoter of the circuit lining with α. In other words, when the needed promoter exists, the function value equals to 1; otherwise equals to 0. Depending on the judging mechanism, we can write codes to decide the function mode in different situations. Here we only raise one example of Eigen functions, but actually it is widely used in our detailed models for circuit devices, serving for other judging rules. k<SUB>α</SUB> represents the production or translation rate of α<SUB>protein</SUB>, and d represents the degradation rate of α<SUB>protein</SUB>. [α<SUB>gene</SUB>], [α] mean the concentration of α<SUB>gene</SUB> and α<SUB>protein</SUB>. In the basic form we only consider the production and degradation of protein.</p>
    <h2 class="bold">Device Sources</h2>
+
 
    <p>We pick two devices from our database, sourcing from the former iGEM teams: Bologna, <a href="https://2008.igem.org/Team:Bologna">(https://2008.igem.org/Team:Bologna) </a>and ETH-Zurich, <a href="https://2012.igem.org/Team:ETH_Zurich">(https://2012.igem.org/Team:ETH_Zurich) </a>as they made study on toggle switch and sunlight sensor, which are representative and useful structures in circuit design. We made some modifications to the devices and put them together to build a system.</p>
+
 
  </div>
+
          <h3>Formulae Form of Promotion and Inhibition Relationships</h3>
  <!-- end content-->
+
          <p class = "wenzi">Promotion and inhibition happen in nearly all the circuits, and if we want to design more complex functions of certain circuits, we usually add specific proteins or chemicals, or build up some inter-inhibited structures to control the protein production as expected. When these human-added control are demonstrated in the formulae, they will be acting as the changes of parameters or the concentration of some variables. This kind of promotion and inhibition are shown in fixed form of terms in formulae, mainly based on the Hill equation form. Examples are given as below:<br>
</div>
+
          (The complex <b><i>LuxU</i></b> promotes the production of <b><i>LuxUp</i></b>, chosen from the model of <b><i>Device: Receptor</i></b>)</p>
 +
          <img src = "https://static.igem.org/mediawiki/2015/8/84/SYSU-Software_429_mod.jpg">
 +
          <p class = "wenzi">In the relation we set the parameters <b><i>β</i></b> as the production rate of <b><i>LuxUp</i></b> Protein, as the degradation rate of <b><i>LuxUp</i></b> Protein. The inhibition effect from LuxU to <b><i>LuxUp</i></b> is defined in the term <img src = "https://static.igem.org/mediawiki/2015/e/e6/SYSU-Software_538_mod.jpg" id = "md">, in which <b><i>k</i></b><SUB>cu</SUB> represents the repression coefficient of <b><i>LuxU</i></b>. Obviously, when the concentration of <b><i>LuxU</i></b> increases, the amount of <b><i>LuxUp</i></b> will correspondingly increase.
 +
          (The exterior <b><i>IPTG</b></i> inhibits the function of <b><i>LacI</b></i> protein, chosen from the model of <b><i>Device: Biomaker</b></i>)</p>
 +
          <img src = "https://static.igem.org/mediawiki/2015/f/ff/SYSU-Software_242_mod.zip">
 +
          <p class = "wenzi">In the relation we named <b><i>LacI</b></i><SUP>F</SUP> to represent the effective portion of <b><i>LacI</b></i> protein, by which we in fact lower the concentration of LacI protein. The concentration of <b><i>IPTG</b></i>, which is signaled as <b><i>[IPTG]</b></i>, is set in the denominator to express its effect of inhibition. The parameter k<SUB>IPTG</SUB> is used as standardization, to better measure the <b><i>IPTG</b></i> effect on <b><i>LacI</b></i> Protein. Obviously, the Hill coefficient here is set as 1.<br>
 +
          Hereby we clearly state the easiest form of promotion and inhibition relationships between two parts. But sometimes they will be shown more complex but in structured way, when a chemical is influenced by more than one substance, for example:<br>
 +
          (The production of <b><i>CI λ</i></b> is inhibited simultaneously by <b><i>FAdP</i></b> and <b><i>Rhamnose</i></b>, chosen from the model of <b><i>Device: Kill Switch</b></i>)</p>
 +
          <img src = "https://static.igem.org/mediawiki/2015/d/d2/SYSU-Software_001_mod.zip">
 +
          <p class = "wenzi">Likewise, we use the form of Hill equation to indicate the relationship between <b><i>FAdP</b></i> and <b><i>CI λ</b></i>,<b><i> Rhamnose</b></i> and <b><i>CI λ</b></i>. When <b><i>FAdP</b></i> does not exist, the term of <b><i>FAdP</b></i> will equals to 1, which does not make influence on the whole function, so does <b><i> Rhamnose</b></i>. The multi-one relationship is demonstrated as the product of terms of each factor.
 +
          If there exists chain reactions in devices (actually it is very common), our measures is <b>to make simultaneous equations to reach the combined solution</b>, which is an important method when dealing with the multi-relationships in devices. Further can be described in details <a href = "https://2015.igem.org/Team:SYSU-Software">in the example subpage</a>.</p>
 +
          <p class = "wenzi">We have to mention the situation when a promoter is promoted or inhibited by chemicals, as it actually occupies a significant part of our formulae of devices. We acquiesce that the influence will be forced on the translation and transcription of coding sequences that link with the promoter, by lowering the effective gene amount in functions. Also, as we mentioned before, there will be an Eigen function of the promoter, in order to reach the automatic recognition of it.</p>
 +
 
 +
 
 +
          <h3>Explanation for our work</h3>
 +
          <p class = "wenzi">Our major work in modeling is to rewrite the models of present devices under our mechanism, which includes nearly 50 device models and its related formulae. All the models can be seen in the <a href = "https://2015.igem.org/Team:SYSU-Software">attachments (PDF)</a>. The formulae can be divided into two parts:ones concerning only two parts, while others expressing the variation of the concentration of one central variable and all other linked factors. Each formula may get its distortions considering the multi-solutions whether those influencing factors exist, all of which are calculated and stored in our database. With the former ones our software can automatically draw the circuit graphs in design field, and the latter ones are widely used when users are making numerical simulation of the circuits.  
 +
          We also take into consideration that the devices are put together and users may need to add some extra relations between devices. As we stated before, we already have independent formulae systems for each device, so the software will recognize and put all the related equations together, then finding the combined solutions using SciPy. (More details about the recognition process, see the algorithm in <a href = "https://2015.igem.org/Team:SYSU-Software">Back-end page</a>.) If users want to add their new relations to the circuits, they need to enter the accurate equations and parameters with their values and units in the form we required, so that the front end can add new lines to the circuit graph, and the back end will store the new equations if succeed, which is also a part of crowdsourcing in the software.
 +
          Due to the limited time, we have not finish the function to show the specific properties of analytic solutions of those simultaneous ODE equations; we only give their numerical solutions but that is enough to forecast the possible trend of concentration of certain parts.
 +
 
 +
          <a href = "https://2015.igem.org/Team:SYSU-Software">Turn to the example page</a>
 +
          </p>
 +
 
 +
          <h2 class = "juzhong">Simulation Function</h2>
 +
          <p class = "wenzi">Here we briefly introduce the numerical simulation function in modeling field of our software. Below is a demo of the
 +
          simulation of a given model, in which concentration-time curves of chosen parts are shown in the same graph. In this
 +
          graph, CI, UVR8-TetR, GFP, TetR, YFP and UVB curves are drawn in the selected time period.</p>
 +
 
 +
          <img src = "https://static.igem.org/mediawiki/2015/c/c9/SYSU-Software_324_mod.png" id = "teshua">
 +
 
 +
          <div class ="tuwen">
 +
              <div class = "tuwenwen wenzi">
 +
                  In the same operating area, users can define the simulation time
 +
                  and initial quantities of the necessary variables in equations. If
 +
                  there is no inputs, all the conditions will be set as fixed values in
 +
                  the software.
 +
              </div>
 +
              <img src = "https://static.igem.org/mediawiki/2015/f/f8/SYSU-Software_256_mod.png">
 +
          </div>
 +
 
 +
          <h2 class = "juzhong">Searching Function of Equations</h2>
 +
          <p class = "wenzi">Users can find all the possible equations now existing in the database through the searching function in design area, if it does have. The choosing buttons and answer demo are shown below:</p>
 +
 
 +
          <div id = "tutu">
 +
              <img src = "https://static.igem.org/mediawiki/2015/a/ae/SYSU-Software_842_mod.png" id = "tu_zuo">
 +
              <img src = "https://static.igem.org/mediawiki/2015/1/1d/SYSU-Software_024_mod.png" id = "tu_you">
 +
          </div>
 +
 
 +
          <h2 class = "juzhong">Self-defined Function of Equations</h2>
 +
 
 +
          <div class ="tuwen">
 +
              <div class = "tuwenwen wenzi">
 +
                  In the design area users can add their self-defined equations, if
 +
                  there is no existing ones, which will be stored in the shared
 +
                  database so that all the users can invoke them. NOTE that the
 +
                  inputs should meet the requirements (rules) in the guidance of
 +
                  Help. Correspondingly, the circuit graph in design function will be
 +
                  updated along with the new relationship lines.
 +
              </div>
 +
              <img src = "https://static.igem.org/mediawiki/2015/3/39/SYSU-Software_554_mod.png">
 +
          </div>
 +
        </div>
 
</html>
 
</html>
 
{{SYSU-Software/FOOTER}}
 
{{SYSU-Software/FOOTER}}

Latest revision as of 17:26, 1 October 2015

SYSU-SOFTWARE IGEM 2015

Modeling Overview

Our whole model is mainly considering about the integrated system providing a set of rules and principles to help modify the ODEs of those devices we quote in the software, which we will make further explanations in the latter paragraphs. To make more reasonable and flexible rules, we have studied several papers and nearly 30 ODE models from the iGEM team participants in the past few years, and notice that they have many structures and constructing rules in common. We choose some meaningful devices of those models and adjust their formulae forms in order to systematically apply them to the numerical simulation and circuits building, hence users can invoke them from our database, combine them and accomplish more complex functions. Any codes of the model we used can be found in this Github Page, and details of adjusted formulae of present devices (approximately 50) in database can be seen in attachments(PDF).
Furthermore, users of CORE can build their own formulae in provided form, enter in their own data as parameters-thus a new device of his or her own can be easily created. We also provide users with the automatic function of circuit building and numerical simulation of his new device, as we mentioned in the former sections, if it really works in sense of synthetic biology. In this way, users can easily get reach of the comparison of simulation and his experiment data when parameters change or foresee the circuit production before experiments, which will help avoid the unnecessary steps and testing.

Model Formation

General Rules Applied

Theorem

We’ve studied several papers discussing ODE applications in synthetic biology and dozens of former models of iGEM teams, found that Hill equation, the three forms of which is shown below(cited from Wikipedia[1]), plays the central part in those models:

Parameters and signals are shown below:

This equation describes “the fraction of the macromolecule saturated by ligand as a function of the ligand concentration”, and is used in determining the degree of cooperativeness of the ligand binding to the enzyme or receptor. As our model is mainly aiming at the mathematical simulation of the production of circuits, Hill equation can be a cornerstone, which accords with the former ODE models of circuits of other teams.

Limitations and Assumptions

In our ODE model of certain circuits we only consider about the transcription of mRNA, the translation of protein and the inner promotion or inhibition of other chemicals. We also make the assumption that when a coding sequence promotes or inhibits the production of other substances, it takes its effect through the translation of corresponding proteins.

We choose not to count the influences as below into our models:

·Leakage
For some operons, leakage does exist. For lac operons, repressor protein LacI can bind to the operator of lac operons and repress the transcription of lac operons in the absence of lactose. However, chances are that the concentration of LacI deviates from its normal one and no LacI binds to the operator. Under such circumstance the repressor protein LacI cannot repress the transcription from lac promoter.
However, we can neglect the rate of leakage for the following two reasons. First, the rate of leakage is very low compared with the strength of LacI binding to lac operator.
· The distribution of RNA polymerases, ribosomes and chemicals
The molecular weight of RNA polymerases, ribosomes and chemicals is small compared to the size of a cell, and the number of RNA polymerases, ribosomes and chemicals is limited. So the distribution of these materials in a cell is uneven. The unevenness might affect the binding efficiency of protein and materials, the dissociation efficiency, and the local concentration of one particular materials.
However, metabolism of the cell can reduce or eliminate the unevenness of distribution in cells. Based on the theory of flux balanced analysis, for a particular metabolism pathway the rate of consumption of inputs and the production of outputs is equal. So the concentration of materials in the cells remains (nearly) unchanged. Further, the inner mass of a living cells is under constant movement; this kind of moving can reduce or eliminate the unevenness of distribution in cells.
So we assume that the distribution of RNA polymerases, ribosomes and chemicals are even.
·The growth of cells
Cells will grow in size, and the concentration of materials will change with the growing cells. However, in iGEM teams usually use bacteria as chassis, and E. coli divides every 20 minutes. So the time with which E. coli grows is short, and the growth of cells can be neglected.
·The division of cells
Cell might divide into two, and the concentration of materials will reduce to half of its original. However, the distribution of materials into two new daughter cells is not determined. For example, when the yeast cells divides, the new cells will be much smaller than their mother cells, and obviously the number of different kind of materials in mother cells and daughter cells is different. (Although the concentration can be thought of as remain the same.) Because the division of cells into two cells, no matter whether the mother cells and the daughter cells are of the same size or different sizes, the concentration of these materials remain roughly the same, we can neglect the impact of division of cells on the concentration of two daughter cells.

We also need to point out the limitations of modeling:

· We does not consider the impact of cell types and cell strains on the constant of formulae. As we all know, different kinds of cells contains different concentration of the same materials. So if we consider the impact of cell types and cell strains on the constant of formulae, the formulae will be more accurate. However, due to the limited time we spent on the formulae, it is impossible for the formulae to take this into considerations.
· In our formulae we have considered the impact of binding efficiency and dissociation efficiency of two materials (like, a protein and its ligands). However, as a software team, it is difficult for us to obtain the exact values of the binding constants.
· In the formulae we adopt the Hill functions as the blueprints of our formulae. Although many other kinds of formulae can be adopted, we adopt the Hill functions because they are mostly used by teams of previous years. So, in order to construct a mathematical model that is of universal use, we choose to adopt the Hill equation as the basis of our equations.

Basic Form of ODEs

Formulae Form concerning genes and proteins

We set the basic Concentration-Time ODEs form of a certain protein α as below:

in which ХPromoter is an Eigen function of the promoter of the circuit lining with α. In other words, when the needed promoter exists, the function value equals to 1; otherwise equals to 0. Depending on the judging mechanism, we can write codes to decide the function mode in different situations. Here we only raise one example of Eigen functions, but actually it is widely used in our detailed models for circuit devices, serving for other judging rules. kα represents the production or translation rate of αprotein, and d represents the degradation rate of αprotein. [αgene], [α] mean the concentration of αgene and αprotein. In the basic form we only consider the production and degradation of protein.

Formulae Form of Promotion and Inhibition Relationships

Promotion and inhibition happen in nearly all the circuits, and if we want to design more complex functions of certain circuits, we usually add specific proteins or chemicals, or build up some inter-inhibited structures to control the protein production as expected. When these human-added control are demonstrated in the formulae, they will be acting as the changes of parameters or the concentration of some variables. This kind of promotion and inhibition are shown in fixed form of terms in formulae, mainly based on the Hill equation form. Examples are given as below:
(The complex LuxU promotes the production of LuxUp, chosen from the model of Device: Receptor)

In the relation we set the parameters β as the production rate of LuxUp Protein, as the degradation rate of LuxUp Protein. The inhibition effect from LuxU to LuxUp is defined in the term , in which kcu represents the repression coefficient of LuxU. Obviously, when the concentration of LuxU increases, the amount of LuxUp will correspondingly increase. (The exterior IPTG inhibits the function of LacI protein, chosen from the model of Device: Biomaker)

In the relation we named LacIF to represent the effective portion of LacI protein, by which we in fact lower the concentration of LacI protein. The concentration of IPTG, which is signaled as [IPTG], is set in the denominator to express its effect of inhibition. The parameter kIPTG is used as standardization, to better measure the IPTG effect on LacI Protein. Obviously, the Hill coefficient here is set as 1.
Hereby we clearly state the easiest form of promotion and inhibition relationships between two parts. But sometimes they will be shown more complex but in structured way, when a chemical is influenced by more than one substance, for example:
(The production of CI λ is inhibited simultaneously by FAdP and Rhamnose, chosen from the model of Device: Kill Switch)

Likewise, we use the form of Hill equation to indicate the relationship between FAdP and CI λ, Rhamnose and CI λ. When FAdP does not exist, the term of FAdP will equals to 1, which does not make influence on the whole function, so does Rhamnose. The multi-one relationship is demonstrated as the product of terms of each factor. If there exists chain reactions in devices (actually it is very common), our measures is to make simultaneous equations to reach the combined solution, which is an important method when dealing with the multi-relationships in devices. Further can be described in details in the example subpage.

We have to mention the situation when a promoter is promoted or inhibited by chemicals, as it actually occupies a significant part of our formulae of devices. We acquiesce that the influence will be forced on the translation and transcription of coding sequences that link with the promoter, by lowering the effective gene amount in functions. Also, as we mentioned before, there will be an Eigen function of the promoter, in order to reach the automatic recognition of it.

Explanation for our work

Our major work in modeling is to rewrite the models of present devices under our mechanism, which includes nearly 50 device models and its related formulae. All the models can be seen in the attachments (PDF). The formulae can be divided into two parts:ones concerning only two parts, while others expressing the variation of the concentration of one central variable and all other linked factors. Each formula may get its distortions considering the multi-solutions whether those influencing factors exist, all of which are calculated and stored in our database. With the former ones our software can automatically draw the circuit graphs in design field, and the latter ones are widely used when users are making numerical simulation of the circuits. We also take into consideration that the devices are put together and users may need to add some extra relations between devices. As we stated before, we already have independent formulae systems for each device, so the software will recognize and put all the related equations together, then finding the combined solutions using SciPy. (More details about the recognition process, see the algorithm in Back-end page.) If users want to add their new relations to the circuits, they need to enter the accurate equations and parameters with their values and units in the form we required, so that the front end can add new lines to the circuit graph, and the back end will store the new equations if succeed, which is also a part of crowdsourcing in the software. Due to the limited time, we have not finish the function to show the specific properties of analytic solutions of those simultaneous ODE equations; we only give their numerical solutions but that is enough to forecast the possible trend of concentration of certain parts. Turn to the example page

Simulation Function

Here we briefly introduce the numerical simulation function in modeling field of our software. Below is a demo of the simulation of a given model, in which concentration-time curves of chosen parts are shown in the same graph. In this graph, CI, UVR8-TetR, GFP, TetR, YFP and UVB curves are drawn in the selected time period.

In the same operating area, users can define the simulation time and initial quantities of the necessary variables in equations. If there is no inputs, all the conditions will be set as fixed values in the software.

Searching Function of Equations

Users can find all the possible equations now existing in the database through the searching function in design area, if it does have. The choosing buttons and answer demo are shown below:

Self-defined Function of Equations

In the design area users can add their self-defined equations, if there is no existing ones, which will be stored in the shared database so that all the users can invoke them. NOTE that the inputs should meet the requirements (rules) in the guidance of Help. Correspondingly, the circuit graph in design function will be updated along with the new relationship lines.