Team:GeorgiaTech/Background

Background

Click Chemistry

Nobel Laureate Barry Sharpless coined the term “click chemistry” in 1998 to describe a class of chemical reactions that revolutionized the field of organic chemistry by placing heavy emphasis on reactions that are “modular, wide in scope, give very high yields, generate only inoffensive by-products that can be removed by nonchromatographic methods, and be stereospecific (but not necessarily enantioselective)” (Sharpless, Finn, Kolb, 2001). Many click reactions are also insensitive to oxygen and water and require only readily available reagents and solvents. This new approach to organic synthesis presents opportunities for the advancement of materials science, drug design, bioconjugation.

Figure 1. Copper-catalyzed azide-alkyne cycloaddition reaction. The colored components indicate the corresponding atoms and bonds through the transformation from reactants to products. A terminal alkyne in the presence of Cu will react with an azide to form a stable triazole product.

The copper-catalyzed azide-alkyne cycloaddition (CuAAC) reaction (Figure 1) is the ideal example of a click reaction, having all of the desirable characteristics of click chemistry. The CuAAC reaction is modular because the azide and alkyne groups easily attach to a wide variety of organic R groups, the “modules”, that need to be joined. This reaction has the power to simplify the synthesis of a diverse array of complex compounds. Furthermore, the azide and alkyne are unobtrusive and do not affect the function of their R1 and R2 groups. This reaction also exhibits the rare combination of producing a thermodynamic product but at an extremely slow reaction rate that can be dramatically accelerated with a catalyst. The azide and alkyne carry a lot a of potential energy and will reliably proceed towards the extremely stable heteroatom triazole ring, but this requires heat, time, and high concentration. When copper ions are present, however, the reaction is nearly immediate (Sharpless, Finn, Kolb, 2001). The CuAAC reaction is also bioorthogonal, meaning that the reaction does not interfere with any normally occurring biological reactions. For these reasons, the CuAAC reaction has a high potential for applications in the delivery of drugs with few side effects. One possible application is a “pro drug” delivery method where a pharmaceutical would be delivered in two separate inactive “halves” or precursors that could be joined by an azide and alkyne at a desired location to form an active drug without any negative side-effects.

Before the CuAAC reaction can be implemented in vivo, we must address the scarcity of free copper(I) ions in cells. Even though copper(I) is a requirement for living systems, copper is cytotoxic at the concentrations required for CuAAC because it reacts with oxygen to form peroxide and superoxide radicals (Pham, et al., 2013) that are prone to initiate radical chain reactions that propagate and destroy proteins and cell membranes. As a result, free copper ions are only present at sub-picomolar concentrations in cells, and most of a cell’s copper is sequestered by the copper-binding proteins that need copper in order to perform their respective functions (Dean, Qin, Palmer, 2012). Although copper is not readily available to catalyze the CuAAC reaction within cells, we hypothesize that an “unnatural” enzyme, derived from a copper-binding protein, can be evolved through phage display.

Phage Biology

A bacteriophage is a type of virus that infects bacteria, and bacteriophages are composed of the genetic material that encodes for the specific proteins that encapsulate the phage. Specifically, the M13 filamentous phage particles that we will use consist of a 6400-nucleotide circular ssDNA plasmid with “a dozen or fewer closely packed genes and an intergenic (IG) region that contains sequences necessary for DNA replication and encapsidation.” (Russel, Lowman, Clackson, 2004). This genetic material is wrapped in five types of coat proteins: pVII, pIX, pVIII, pIII, and pVI and the body of the phage is extremely small, about 6.5 nm wide and about 900 nm tall.

The phage reproductive cycle includes infection of an E. coli host cell, replication of the phage’s DNA, assembly of the progeny, and lysogenic secretion from the E. coli. First, the pIII coat protein binds to the pili of an E. coli, and Tol proteins Q, R, and A of the E. coli depolymerize the phage coat proteins, allowing translocation of the viral ssDNA into the bacterial cytoplasm (Russel, Lowman, Clackson, 2004). Once inside the E. coli host, the ssDNA is converted into dsDNA by host polymerases. Initially, only dsDNA is generated, but as more of the pV protein is synthesized, it becomes available to bind to newly made single strands and blocks host polymerase from assembling complementary base pairs. After all of the genes are expressed, the phage progeny exit the E. coli lysogenically.

Phage Display

Phage display is a well-established technique that uses bacteriophages to study protein-protein and protein-ligand interactions. “Foreign” genetic code is fused to the bacteriophage genome, causing the bacteriophage to display the corresponding “unnatural” peptides or proteins on its coat.

The pIII coat protein is the most commonly used protein for phage display and can be performed with commercially available kits. One important advantage of pIII phage display is its compatibility with a variety of proteins of different sizes. In addition, pIII protein is relatively easy to modify.

Figure 2. Phage display. Phages with varying genetic sequences express varying proteins with differing binding affinities to the substrate. The non-binders are washed, and the binders are subjected to additional cycles of this process and ultimately analyzed (sequenced). Adapted from a figure created by Dr. Graham Beards.

Phage populations can be encoded with entire libraries of varying mutations of a genetic sequence, resulting in the display of a diverse protein collection that can be screened for binding (Figure 2). Non-binders are washed away while binders are eluted, and the selected variants can be further mutated. This cycle is repeated until the library is enriched with only the strongest binders. In the end, all the proteins that exhibit desired binding properties can be efficiently identified and sequenced, making phage display a powerful tool for a wide array of evolutionarily driven protein-discovery research efforts.

In addition to binding-based screening, phage display can also be used for activity-based screening. For example, our project will require a given displayed protein to successfully facilitate the CuAAC reaction if it is to avoid being washed away. More specifically, we hope to use the in vitro selection capabilities presented by phage display to artificially evolve naturally occurring copper-binding proteins into unnatural enzymes that can aid the CuAAC reaction in acquiring copper for catalysis in copper-deficient environments.

Starting Proteins

Our search for an unnatural enzyme began with research on natural copper-binding proteins. Our selection process demanded proteins that were small (less than 80 kDa) in order to be successfully displayed on the pIII coat protein (Russel, Lowman, Clackson, 2004). Additionally, these proteins had to have the ability to bind to either one or two copper(I) ions in order to satisfy the requirements of the CuAAC reaction (Fokin, Worrell, Malik, 2013). We selected the following six proteins: Atx1, CusF, plastocyanin, tyrosinase, Ace1, and the N-terminus domain of MNK. We also worked with three additional tyrosinase homologues which were 80%, 70%, and 70% homologous to wild-type tyrosinase. These homologous genes were necessary for DNA shuffling, as explained below. We hypothesized that using these candidate proteins’ copper binding geometry combined with further CuAAC reaction-specific optimization would allow us to more efficiently search the “protein universe” and generate more diverse libraries; therefore, we would increase our chances of discovering a functional “clickase”.

Protein Library Generation

A fundamental component of our evolution-directed experiment is to generate a library of mutated proteins. After selecting six starting proteins, our next task would be to generate a large number of derivatives with similar but different properties. An ideal mutant would retain the high binding affinity for copper while also being able to exchange copper with an alkyne to trigger the CuAAC reaction. We used two different techniques to generate used our library of mutants: error-prone polymerase chain reaction (PCR) and DNA shuffling.

Natural condition PCR has three main steps: denaturation, annealing, and extension. During denaturation, the reaction mixture (containing DNA, primer, dNTPs, buffer, and DNA polymerase) is heated, allowing the double-stranded DNA to “melt” into single-stranded DNA. Specialized primers anneal to specific sites (usually the beginning and end of the target sequence) on the single stranded “template” DNA in accordance with complementary base pairing. Then Taq polymerase (a heat-resistant DNA polymerase) begins assembling base pairs along the entire template DNA strand, extending it until a complete dsDNA molecule is formed. After the 3 steps are complete, the cycle repeats again to duplicate the two parent DNA strands. Each cycle theoretically doubles the amount of DNA present; PCR generally exhibits exponential growth patterns, allowing for the rapid generation of large amounts of DNA replicates.

Figure 3. Error-prone PCR. Modifications to the standard PCR conditions are required to achieve different error rates of the PCR.

Error-prone PCR (Figure 3) is essentially the same as canonical PCR, but with different reaction conditions, including varied ratios of available dNTPs, elevated concentrations of already-present magnesium chloride, and addition of manganese chloride. In small amounts, magnesium chloride acts as a cofactor for DNA polymerase, but in large amounts, magnesium chloride will stabilize non-complementary base pairing. Manganese chloride also stabilizes non-complementary base pairing (Cirino, Mayer, Umeno 2003).

Figure 4. DNA Shuffling. This technique involves digestion and reassembly of homologous genes to form unnatural chimeric genes with possible new activities.

In addition to EP-PCR, DNA shuffling can also be used to generate a diverse protein library. DNA shuffling is another widely used molecular evolution technique developed by Willem Stemmer in 1994. The protocol calls for a set of naturally occurring homologous genes that code for the proteins of interest (Stemmer, 1994).

Figure 4 illustrates the general process of DNA shuffling. The genes are digested by DNase1, and the resulting fragments are sorted by gel electrophoresis. Fragments of the desired length are selected for subsequent purification. The purified fragments are subjected to a “PCR-like” reaction in the absence of primers (Stemmer, 1994). Instead homologous fragments to anneal according to complementary base pairing, resulting in the formation of chimeric gene sequences. Starting with naturally occurring homologous sequences leverages billions of years of evolution to combine the most useful mutations from individual genes.

While proteins with homologous sequences can be directly used in DNA shuffling, when homologous gene sequences are not readily available, a protein library can still be built by using a combination of EP-PCR and DNA shuffling. These genes can first be subjected to EP-PCR to generate sequence diversity prior to DNA shuffling.

Bibliography

Cirino, P. C.; Mayer, K. M.; Umeno, D. Methods in Molecular Biology 2003 231: 3-9
Clackson, T.; Lowman, H. B. Phage Display: A Practical Approach; Oxford University Press: New York, 2004.
Dean, K. M.; Qin, Y.; Palmer, A. E. Biochimica et Biophysica Acta 2012 1823(9), 1406–1415.
Fokin, V.V.; Worrell, B. T.; Malik, J. A. Science 2013 340(6131), 457-460
Kolb, H. C.; Finn, M. G.; Sharpless, K. B. Angew. Chem. Int. Ed. 2001, 40: 2004–2021.
Pham, A. N.; Xing, G.; Miller, C. J.; Waite, T. D. Journal of Catalysis 2013 301: 54-64
Stemmer, W.C. Letters to Nature 1994 370