Team:WLC-Milwaukee/Modeling
Modeling
Here is where we should put a brief description of the content that fits under the SURVEYS category:
In order to probe further into the feasibility of our project, we decided we needed to address the fact that bacteria evolve in response to bacteriophages. The basis of our project is that gram-negative bacteria without a working tolC gene (be it knocked-out, misfolding, or incomplete) will be unable to form a complete antibiotic-efflux protein complex. As a result bacteria without a functioning tolC gene will show an increased sensitivity to antibiotics normally resisted through efflux; we demonstrated this experimentally using a Kirby-Bauer assay and the antibiotics Novobiocin and Erythromycin .
Since evolution is driven by random mutations being selected for, we thought this was an achievable simulation. In order to simulate this, we would need a way to simulate the insertion of random non-silent mutations into a tolC gene/protein, a way to predict whether or not these mutations were damaging to the function of the TolC protein, and a way to predict whether or not these would affect TolC-mediated phage binding. For the sake of simplicity we considered only point mutations to the coding-region of the DNA. We divided the mutations into 4 main types:
- A nonsense mutation which turns an amino acid into a stop codon; this will prevent a full TolC protein from being translated, preventing both antibiotic efflux and phage binding.
- A missense mutation which prevents the TolC monomers from forming a trimer. In these we included strongly polar or charged amino acids being mutated into the beta-barrel regions, the insertion of Valine, Threonine, Isoleucine, Serine, Aspartate, Asparagine, or Proline in the alpha helices (these amino acids are not accommodated in an alpha helix) which are critical for TolC trimerization, and in addition 1/4 of any other mutations in the alpha helices (we assume the amino acids on one face of each helix are critical for interaction with other monomers; there are ~3.6 amino acids per turn).
- A base-pair substation in the extracellular loops of the TolC protein. These mutations do not affect the assembly or function and therefore do not affect antibiotic resistance. We assumed any of these would result in phage-resistance.
- Any mutation not fitting into the above three categories we assumed would not affect assembly/function or the ability of bacteriophages to recognize the protein. Therefore these show no effect on phage resistance or antibiotic sensitivity.
Example of mutation program input (E. coli tolC sequence)
With these parameters, we constructed a program to simulate random point mutations. C++ was chosen for its universality as well as our familiarity with it. At the base of the program is a codon class; this contains room for 3 characters to be filled with 3 nucleotides from the DNA sequence. When filled or altered, the class analyzes itself to determine which amino acid it represents, its polarity, and its charge. Codons have the ability to mutate themselves (pick one of their three constituent bases and change it to one of the other three nucleotides), as well as the ability to return their contents, polarity, charge, and amino acid. Codons were organized and used in a single-linked list controlled by a polypeptide class. The polypeptide class stored a string of higher-level information about each codon in a parallel single-linked list; the information stored here was used to store what secondary structure the specific amino acid was in. When told to mutate, the Polypeptide class used a random number generator (rand() from stdio.h) seeded with the current time to pick which amino acid will be mutated; the randomly selected amino acid is then told to mutate.
Example of the file used to associate secondary structures with codons
Handling Input/Output as well as analysis of the mutations (for us, this included predicting function destruction and if it affected phage binding) was done in a main class, so the underlying classes could easily be adapted for other projects. Input of the DNA sequence was done by parsing a plain-text sequence from a file. The amino acid properties is input from a plain text file and parsed as an integer and a string in which the string is the property of the amino acid and the integer designates how many amino acids it applies to. The user inputs how many mutations are desired through the CLI. Information about each mutation is outputted into a csv file (comma separated values) and at the end the program prints the frequency of amino acids changing their charge (or lack of it), changing their polarity, turning into premature stops as well as the percentage of mutations that are predicted to destroy function of the TolC and the percentage of mutations which would interfere with phage binding. Assigning the tolC codons to a specific secondary structure was based on previous research .
Here is where we should put a brief description of the content that fits under the RESULTS category:
Limitations
Our model is fairly simplistic in its nature. It is limited in the scope of mutations, only handeling missense and nonsense mutations. Insertions and deletions/frameshift mutations would add considerable layer of complexity in programming both how to generate these mutations as well as analyzing the effects of these mutations. Our model also limited itself to the coding portions of the genes. The effect of mutations on any control sequences (promoter, RBS, ect…) would be more difficult to predict because their effects are not due to amino acids, but rather their interactions with other molecules in the cell. One could hypothesize that a mutation either upregulating or downregulating gene expression could work in favor of our project; downregulation (or stopping transcription) would likely make the bacteria hypersensistive to the antibiotics, and an upregulation could increase the likely hood that that individual bacterium would be infected by a bacteriophage (more receptors = more binding opportunities) . We, however, are not prepared to make that claim.
ResultsResults are output as a .csv file which can be opened using a program like Microsoft Excel or Apple’s Numbers for further analysis. On a run of 1000 mutants on E. coli’s tolC gene we got the following numbers:
Changes of charge | 358/1000 |
Polarity changes | 338/1000 |
premature stops added | 46/1000 |
Predicted Function Destroying Mutations | 395.25/1000 |
Predicted mutations preventing phage binding | 446.25/1000 |
Meaning out of every 1000 tolC mutants we would predict approximately 446.25 to be resistant to a TolC-binding bacteriophage, and 553.75 to be infectable. Of the phage-insensitive mutants, we would predict 395.5 (88.6%) to be hypersensitive to (what would be under normal circumstances) efflux-resisted antibiotics, and 50.75 (11.4%) to be both phage insensitive as well as possessing the normal antibiotic tolerance.
Screenshot of the output .csv file opened in Excel
Here is where we should put a brief description of the content that fits under the Explore category:
A model is only as informative as it is accurate. Knowing this we had a plan to test whether or not our model was actually giving us something resembling the reality. We planned a week-long protocol for evolving and analyzing mutants resistant to the TolC-binding TLS bacteriophage (referred to as TLS phage). The protocol is outlined is as follows:
Day 0 | Take 250µL of overnight grown culture of ΔtolC E. coli (knockout-tolC E. coli) with the pUC57 plasmid containing the E. coli K12 tolC gene mix with 50 µL of TLS phage stock and plate in 4mL 50µg/mL Ampicillin (Luria Broth) Top Agar on an LB-Amp plate (Make 1 per person). |
Day 1 | Pick 10-15 mutant colonies and streak them onto a 1/6 or 1/8 sectioned LB-Amp plate. |
Day 2 |
Pick 1 mutant colony from each colony streaked and grow them overnight in a LB-Amp liquid culture. Grow enough ΔtolC E. coli cells for 10-15 transformations. (Electroporation or Heat Shock) in a LB liquid culture. |
Day 3 |
Miniprep the 10-15 cultures grown from picked colonies, carefully labeling the resulting plasmids so you can distinguish one from another. Prepare the ΔtolC E. coli cells as competent cells (Electrocompetent or Thermocompetent for whichever method you prefer). Transform with the 10-15 plasmids you just miniprepped, being extra sure that you know which transformation is from which plasmid. Plate transformation on LB-Amp plates after the resting period. |
Day 4 |
Make freezer stocks of successfully transformed bacteria, making sure you know which stock goes with which plasmid. Perform cross-streaks on Amp plates with TLS phage crossing every successfully transformed sample. |
Day 5 |
Make stocks of the samples whose bacteria are NOT killed by cross-streaks (these ones we are now sure have a TLS-inhibiting mutation in the E. coli TolC gene) Throw out other samples and their corresponding minipreps. |
The idea is that we would take our knockout tolC (ΔtolC) E. coli strain and transform them with a plasmid containing the E. coli tolC gene (behind a pBAD promoter and a strong E. coli RBS) and an ampicillin-resistance gene. We would then grow them as if we were making a lawn, but mix in an abundance of TLS phage and a working concentration of ampicillin. It was hoped that the ampicillin would prevent any bacteria without the plasmid from growing. The TLS phage would provide the selective pressure to select for phage resistant mutants. One of these plates is pictured.
It was immediately noted that there was considerable variability in the size of colonies (initially presumed to be caused by different mutations. 12 colonies were picked making sure to get a variety of colony sizes; none of these grew on LB-Amp. 15 colonies were picked from a similar plate with identical results. It is highly likely that the colonies picked were satellite colonies that had abandoned their plasmids under selective pressure from the TLS phage.
Attempting the experiment with 2, 4, and 6 times the working concentration of ampicillin resulted in approximately 10% of picked colonies growing. Due to time constraints further experimentation was halted.