Team:Vanderbilt/Project/Sequence
To truly demonstrate that our mutation optimization procedure reduces the physical occurrence of DNA mutations, we had to take an in vitro approach. We needed methods for visualizing the incidence of mutagenic DNA lesions on purified samples of DNA. Ideally, these methods would be compatible with a wide variety of DNA lesions, and would have specificity to the particular lesion that we are trying to quantify.
After searching through the published literature, we were unable to find any single protocol that met all of these criteria. However, we did identify several that each had components of our ideal assay. Therefore, we decided to adapt protocols from several sources in order to develop a reliable, high-resolution method for quantifying virtually any form of DNA damage.
That starting point for our approach was a class of DNA repair enzymes. What these enzymes have in common is that they recognize certain forms of mutagenic lesions on a DNA helix, where they then hydrolyze the DNA strand's sugar-phosphate backbone (in addition to other potential activities, such as excising entire bases). These enzymes were excellently suited to give us the lesion specificity and targeting diversity that we were looking for. By selecting different repair enzymes, we could tailor the range of substrates that are targeted: for example, NEIL1 recognizes most forms of oxidized DNA bases, while FPG is restricted to 8-oxoguanine. There are enzymes for oxidation, ultraviolet radiation, alkylation, and more, which spans effectively all of the hotspots that we were interested in studying.
To visualize and quantify the enzymatically-processed DNA samples, we investigated multiple options. One of these was looking at the conversion of plasmids from their usual supercoiled confirmation to a nicked confirmation. Another utilizes alkaline denaturing electrophoresis to separate the DNA into single strands, which then can be assayed for fragmentation, which appears as long smears on a gel. We have also considered approaches that utilize S1 Nuclease to convert single strand breaks into double strand breaks.
Our first set of experiments involved pairing alkaline denaturing electrophoresis with the repair enzyme T4 Pyrimidine Dimer Glycosylase (T4 PDG) to reveal whether or sequences with UV mutation sites removed actually exhibit greater resistance to damage by UV irradiation. Because our protocol was highly adapted from a number of sources, we first had to demonstrate the assay produced consistent results under controlled conditions. A large amount of troubleshooting was necessary to optimize both the conditions of enzymatic processing, alkaline denaturation of the DNA samples, running conditions for the alkaline gel, and neutralization and staining conditions for generating images.
["https://2015.igem.org/File:VU15_alkaline_uv_control.png"] ["https://2015.igem.org/File:VU15_alklaine_uv_linearity.png"]
After considerable modifications, we managed to develop a protocol that consistantly 1. Shows little or no fragmented DNA with samples that are not irradiated and not processed with enzyme 2. Has no significant difference between unirradiated and unprocessed control samples and samples that are either irradiated without enzymatic processing or enzymatically processed without irradiation 3. Is responsive to dosage, meaning samples with greater irradiation show greater fragmentation
As the images and quantification above demonstrate, our final protocol satisfies all of these conditions, thus establishing it as a effective method for quantifying UV-induced DNA damage. The linearity seen in response to increasing UV dosage further allows us to state that differences in measured fragmentations between control and mutation-optimized RFP reflect reductions in mutagenic lesions that are equivalently linear. By that logic, an optimized sequence that produces a ten percent reduction in fragmentation intensity relative to control should have ten percent fewer mutagenic lesions than the control.
During our early experiments we used a UV-mutation optimized sequence, labeled RFPyy, with 12% of its UV mutation sites eliminated. Quantification of RFPyy's rate of fragmentation relative to the E1010 RFP control sequence revealed a trending to significant decrease in UV-induced fragmentation on the mutation optimized sequence (two-sample t-test: n=7, t*two tailed=2.57, p=0.13). The results are depicted graphically as bar graphs with standard error bars, alongside dot plots of raw data. For each data point, the "background" seen in the no enzyme negative control lane was subtracted from the fragmentation smear intensity to correct for differences in amounts loaded, fragmentation of the stock sample, and other factors. Two representative alkaline gel images are also shown, in greyscale and inverted. At 1000 mJ/cm2, it was found that non-specific DNA shearing occurs at such high magnitudes that there is no longer appreciable differences between enzyme treated and untreated samples. The graph shown is for assays conducted at the 500 mJ/cm2 UV dosage.
https://2015.igem.org/File:VU15_RFPyy_alkaline_quantification.png
https://2015.igem.org/File:VU15_RFPyy_alkaline_500_1000.png
https://2015.igem.org/File:VU15_RFPyy_alkaline_500.png
Although statistical analysis of RFPyy failed to produce statistical significance according to the conventional definitions of alpha levels, the modest 12% reduction in mutation-prone sites may not be sufficient to produce significant results because of the intrinsic variability of the technique, which combines variation from irradiation, enzymatic treatment, denaturation, electrophoresis, and staining. Despite all of these factors and the variability they introduce, we nevertheless observe a strong trend that may be capable of significance with further replication.
While these assays were ongoing, the mutation optimization algorithm was being constantly improved. One large improvement that was made was incorporating more precise values for the relate mutagenicity of each type of UV hotspot relative to each other. Our updated algorithm is now capable of optimizing RFP twice as well as had been done with RFPyy. Our improved optimized RFP has a 35% reduction in UV-prone sites, which, based on how strong our results were with RFPyy, would almost certainly produce a result that is both statistically significant and of considerable magnitude.
To evaluate our technique's performance with respect to oxidation sites, we primarily used the conversion of supercoiled plasmid to nicked plasmid as our form of visualization. An alkaline gel assay was run, and produced results roughly in accordance with the data found by the plasmid confirmation method. However, the alkaline gel assay requires a minimum of ten hours to complete, and requires specialized equipment and extra care to set up.
https://2015.igem.org/File:VU15_RFP_ox_assay_validation.png
Our optimized RFP for these experiments had a 49.5% reduction in oxidation hotspots. Our repair enzyme was Formamidopyrimidine DNA glycosylase (FPG), which is specific in its activity for the 8-oxoguanine DNA lesions that those hotspots are correlated to. For oxidizing our DNA in vitro, we initially tried methylene blue. The advantage of methylene blue is that it releases oxidizing radicals in response to exposure to light, which would allow precisely timed oxidation exposures. Unfortunately, we found the presence of methylene blue interfered with electrophoresis in our samples, so we transitioned to a protocol based on the Fenton reaction, which utilizes trace amounts of iron (II) particles to catalyze the conversion of hydrogen peroxide (H2O2) into oxidizing radical species.
Just as with our highly-modified assay for detecting UV DNA damage, we had to demonstrate the same three conditions were true for the oxidation assay we developed. At a dose of 50 μM H2O2, we had very good results, showing the conversion of supercoiled plasmid to a nicked form. As with extreme amounts of UV irradiation, we found 500 μM H2O2 will produce damage severe enough to convert plasmids to their nicked form without FPG incubation.
After validating our protocol, we began experimentation on our mutation-optimized sequences. With our RFP sequences, both control and optimized, we saw a third band present that was not visible in the original validation of the assay. Based on established information on the relative motilities of relaxed, nicked, and supercoiled DNA, we theorized that the newly-appeared top band represented plasmid in a relaxed conformation. Since the RFP plasmids were extracted in parallel, but the plasmid for the initial validation was not, variation in the negative control case of no mutagen and no enzyme would be expected. In line with this assessment, the relaxed plasmids behave as expected: they exactly parallel the intensity of the supercoiled plasmid, being converted like the supercoiled plasmid is upon oxidation and enzymatic treatment.
For quantification, we quantified the ratio of the intensity of the nicked band to the supercoiled band. The relaxed band was not included for the data reported below, although because it parallels the supercoiled bands, factoring it in does not greatly alter our results. Samples with less oxidation would have a lower ratio, since they would retain more supercoiled plasmid while having less plasmid converted to nicked. The same trends are seen when quantification is instead done looking only at decreases in the intensity of the supercoiled conformation.
https://2015.igem.org/File:VU15_rfp_ox_quantification.png
https://2015.igem.org/File:VU15_RFP_ox_nick.png
Due predominantly to a low sample size, the reduction found in the RFP with minimized oxidation hotspots was not significant (two-sample t-test: n=4, t*two tailed=4.30, p=0.29). Nevertheless, there is replication of a trend, which could yield more conclusive results with more testing. As the promising results with the UV-optimized sequence demonstrates the principle behind reducing mutation with rational, algorithmic strategies, there is reason to be hopefully the same principles will apply to oxidation hotspots as well, as the previously noted trend suggests.
Given the large decrease in oxidation sites in the optimized RFP, it is still necessary to conclude that the observed improvement in sequence stability with oxidation was lower than seen in the ultraviolet experiments. One probable explanation for this is that different hotspots contribute differently to overall mutagenic potential. For oxidation hotspots, each has a certain elevated probability of undergoing mutation, but those probabilities are likely lower than the corresponding probabilities for UV dimer hotspots. The different technique used also would explain part of the discrepancy.
In total, these experimental results strongly suggest that the optimizations that we make at the sequence level to reduce the number of mutation hotspots do indeed translate to lowered rates of mutation at a biochemical level. Despite the intrinsic variation of these techniques, they were still able to consistently detect the occurrence of DNA damage, and were at least sensitive enough to produce apparent trends in mutation reduction following optimization. As these protocols were only just developed in their current form by our team, there is undoubtedly opportunities to optimize these protocols further. Given the advantages of these techniques to detect DNA damage, as previously outlined, there is both the means and the incentive to develop them further as research tools for both testing our own hypothesis about various forms of mutation-optimization as well as for other researchers more generally.
Because the current paradigm for optimizing synthetic gene sequences is codon adaptation index (CAI), which is purported to correlate to expression levels, one of our team's main priorities was demonstrating that our novel optimization method based on lowering mutation did not interfere with gene expression. While the initial form of RFP that we generated to have resistance to UV mutation was confirmed to still produce RFP, there was a several-fold decrease in expression levels relative to the CAI-optimized control. This was despite the fact that the sequence was optimized to select the codon with a higher CAI associated with it when neither of two or more codons had fewer hotspots than the other. In response to this setback, we made substantial revisions to the algorithm, this time factoring in "limiting codons" instead of standard CAI values. These limiting codons are from tRNAs of exceptionally low abundance in an organism, although varying definitions exist for how low the abundance must be for this classification.
https://2015.igem.org/File:VU15_improved_RFP_expression.png
Using this new approach based on limiting codons instead of CAI, we were able to increase the expression of our UV-optimized by 200%. Upon further research and refinement of our definition of what constitutes a limiting codon, in addition to re-introducing a CAI-like metric to the mutation optimization algorithm's decision making process, we were able to further significantly increase expression, reaching levels close to but slightly below the RFP E1010 sequence on the Registry. Through our collaboration with the iGEM team at the University of Washington St. Louis, we have achieved an independent replication of our results, under a range of IPTG inducer concentrations. https://2015.igem.org/File:VU15_WashU_RFP_IPTG_data.png
https://2015.igem.org/File:VU15_improved_expression_visual.png
These findings have vastly improved the expression of our optimized genes, and in the process has revealed some insights into the correlation of metrics like CAI and limiting codons to actual gene expression. With further refinement of our algorithm's criteria, it is reasonable to expect that we will be able to continue this ever-improving trend, and constantly produce genes with comparable expression to ones optimized by traditional methods.
Interestingly, the differences in expression between our most recent iterations of mutation-optimized genes and their traditionally optimized versions are not directly attributable to CAI. When CAI scores are calculated for mutation-optimized and traditionally optimized sequences, they rarely differ. As an example, the figure below shows a graphical representation of the CAI for comparison with our mutation-optimized sequences.
https://2015.igem.org/File:VU15_CAI_with_optimization.png
</p>