Methodology Cell-Free Expression

Why Do We Use Cell-Free Expression?

Cell-based protein expression is a well established method to obtain large amounts of a target protein. It enables accumulation and purification of the protein in quantities sufficient for various in vitro applications. Nevertheless, it is a tedious task to generate all the genetically modified organisms if many different proteins need to be expressed. Additionally, purification cannot be performed with a generalized protocol, but usually requires separate optimization for each protein.
Cell-free expression offers a possibility to overcome several challenges of conventional protein expression and has many advantages, in particular for our project.¹⁾
Generally speaking, it saves a lot of time and money to avoid the generation of genetically modified organisms for every protein. Suitable DNA sequences are only constructed once, for example by custom synthesis by a company, and can be stored until needed. At the purification step, cell-free expression avoids the need for cell lysis and therefore circumvents this harsh procedure, thus preserving the integrity of the protein. In cell-based expression, too strong induction often results in aggregation of protein, rendering it non-functional. This risk is minimized by using cell-free expression, since the expressed protein is dispersed in a far larger volume than the intracellular space.²⁾
For translating DNA templates into protein microarrays in a microfluidic set-up, cell-free expression is the method of choice. This system is capable of expressing many different sequences at once. Additionally, the microfluidic setup provides the opportunity to constantly supplement the expression. Replacing depleted components, like dNTPs, amino acids or energy sources like creatine phosphate, allows higher protein yields.

Basics of Cell-Free Expression

Figure 1: Basic Components of a Cell-Free Expression Mix A standard cell-free expression systems consist of a DNA template, the cell extract and additives containing all the components needed for energy regeneration. buffering and generation of the protein.

Two basic components are needed to conduct in vitro protein expression:

the genetic template (mRNA or DNA) encoding the target protein
a reaction solution containing the necessary transcriptional and translational molecular machinery

Cell extracts supply the reaction with most of the molecules, including:

RNA polymerases for mRNA transcription
ribosomes for polypeptide translation
tRNA and amino acids
enzymatic cofactors and an energy source
cellular components essential for proper protein folding

Regular cell lysate already contains most of the components needed for cell-free expression. The machinery that usually conducts the translation and transcription of various proteins in the organism can be "reprogrammed" to produce only the protein of choice. Our building blocks, the amino acids and NTPs are already present but are also added in an increased amount to raise the efficiency of our system. As the normal energy regeneration system is missing in a cell lysate an artifical one is added. Here for example creatine phosphokinase is used. The other components present in an cell-free expression mix buffer the sensitive system and imitate a cytosolic environment.

Figure 1: Prokaryotic vs eukaryotic cell-free expression text. This is a test if more text will eventually make this text box wider and the page look better.

Besides bacterial based systems, there are several other options with specific advantages and disadvantages. An E. coli system convinces with easyness, cost-efficiency and high yield. However, for the production of complex eukaryotic proteins, it might not be the best option. In contrast, eucaryotic cell-free lysates provide post-translational modifications such as core glycosylation or phosphorylation, and natural membrane components (e.g. microsomes) for membrane protein insertion. These systems are based on wheat germ embryos, rabbit reticulozytes, insect cells and various other animal and human cell lines.

Figure 1: Batch vs Dialysis mode text. This is a test if more text will eventually make this text box wider and the page look better.

There are two ways of performing a cell-free reaction. In batch reactions, transcription and translation are carried out in a reaction vessel containing all necessary components. Due to different reasons like fast depletion of energy supply, degradation of components like nucleotides and decreasing concentrations of free Mg²⁺-ions, the reaction in the batch system usually reaches a plateau after about 1-2 hours.
In a dialysis mode, the cell-free transcription and translation reaction is carried out in a small reaction chamber that is seperated by a dialysis membrane from a 10-20x larger reservoir containing low molecular weight reagents. The reservoir supplies the reaction chamber with ions, energy substrates, nucleotides and amino acids. In turn, the low molecular weight by-products are efficiently diluted via the membrane from the reaction chamber into the reservoir. This setup will drive the reaction longer (usually 20-24 hrs) and results in higher levels of produced protein.

A further aspect that has to be discussed is the design of the DNA template.
In general plasmids, linear templates from for example PCR production or RNA can be used. The structural design of the DNA or RNA is in every case of great importance for yield and quality of produced protein. In bacterial 5′-UTRs such optimization can be achieved by placing the Shine-Dalgarno (SD) sequence at an optimal distance, typically eight nucleotides, from the AUG start codon. The SD sequence interacts with the 3′-end of 16S ribosomal RNA (rRNA) that is part of the small ribosomal (30S) subunit, to allow efficient initiation. Furthermore, the integration of an A/U-rich enhancer sequence further upstream allows the SD sequence to interact more effectively with the rRNA. Another attempt in this context is the use of different tags to enhance expression as reported by Haberstock et al. (2012)³⁾. In eukaryotes, the Internal Ribosome Entry Site (IRES), is a highly structured element found within viral mRNA that is able to induce eukaryotic initiation. This solves the problem of ineffective capping, which presents a major restriction of high production. For example, the IRES of the Cricket Paralysis Virus (CrPV) showed the ability to increase expression substantially across several species. ⁴⁾

Our DiaMIX

We developed our own low-budget protocol for expressing proteins in a prokaryotic cell-free expression system based on an Escherichia coli lysate. Most of the established cell-free expression systems are based on rabbit reticulocytes, wheat germ cells, insect cells or E. coli cells. Due to the prokaryotic origin of most of the proteins expressed during our project, we decided to develop our own low-budget prokaryotic cell-free expression system, the DiaMIX, based on an E. coli cell culture.

Figure 3: Immobilization of DNA PCR product coding for GFP with a Cy3 and an amino label was spotted onto the activated PDMS slide. Binding of the amino group to the specific surface allows direct DNA immobilization.

To produce the DiaMIX we used the E. coli strain BL21, as it is carrying a gene coding for the T7 RNA polymerase, which facilitates an improved expression of proteins during the cell-free expression. The cells were induced with IPTG prior to further treatment to initialize T7 polymerase expression, therefore allowing a high rate of protein synthesis. Since all the coding sequences used in our project were manufactured synthetically, the E. coli BL21 cells were additionally transformed with a plasmid (pRARE2) LINK Plasmids containing the coding sequence of tRNAs for the translation of rare codons (AGA, AGG; AUA, CUA, GGA, CCC and CGG). According to an optimized protocol adapted from the European Molecular Biology Lab (EMBL) Heidelberg, the lysate was prepared from E. coli BL21 cells, frozen in liquid nitrogen and afterwards stored at -80°C. This lysate contains the whole transcription and translation apparatus of the cell, allowing a successful expression of proteins from DNA templates.

However, the E. coli lysate is just one of the main components of the DiaMIX. To finally be able to express proteins, the system needs additional buffers, amino acids, nucleotides and energy sources. We obtained an L-amino acid kit from Sigma-Aldrich and prepared an accurate mix ourselves. The energy generating system based on creatine phosphate, phosphokinases and buffers as well as the mix containing nucleotides, amino acids and buffers (see protocol) were prepared and added to the lysate directly before expression.

After adding all three components the complete cell-free expression mix was subsequently used to express proteins. Therefore, the reaction was started by adding DNA coding for the respective proteins. To verify the functionality of our system we compared it to a commercially available kit.

Verification of Protein Expression

To validate the expression of correctly folded proteins applying the DiaMIX we used various methods like fluorescence measurements, luciferase assays and Western Blots. One of our first approaches was the expression of GFP, which can be detected via its fluorescence. For expression of GFP we used a plasmid template (LINK) that was added to the self-prepared expression mix. The expression was then visualized via a microplate reader measurement. Measurement of the increase in emitted light at a wavelength of 520 nm monitored the progression of expression. To verify the obtained data of this measurement a Western Blot was additionally performed. We used GFP antibodies to show the expression of GFP in our mix.
For further results of this experiment, have a look here.
As a second test of our system we performed a luciferase assay. In contrast to the expression of GFP which can be followed by a gradually rising of GFP over time, the luciferase assay is based on the emittance of light when luciferin is conversed into oxyluciferin. This reaction is catalyzed by the enzyme luciferase and needs to be induced with luciferin as substrate. Therefore it does not allow a tracking of expression over time. Moreover it is not dependent on correct folding and offers detection within seconds. We used the coding sequence of the firefly luciferase (pBESTluc^TM) to express the protein with our cell-free expression system. The substrate luciferin was added to the mix after expression was finished. Then, the luminescence was measured at once using a microplate reader.
For more information about the outcome of this analysis, check out our results. This analysis helped us to evaluate the concentration of expressed protein in the cell-free expression system.

Figure 2: Luciferase reaction. The enzyme luciferase catalyzes the reaction of D-luciferin to D-luciferyl using ATP as an energy source. D-luciferyl is instantly converted into oxyluciferin, whereby energy is emitted as light. The intensity of bioluminescence is thus proportional to the amount of expressed luciferase.

Step by Step Validation

Since we aim to use cell-free expression in the DiaCHIP, we decided to validate the process in single steps. Starting with expression in a tube, we spotted readily expressed proteins on iRIf slides to check, whether we expressed detectable antigens that interacted with our antibodies at hand. As a next step, we spotted our expression system on the slide without starting the reaction beforehand. By letting it incubate on a specific surface we could check for further obstacles. Lastly, we performed a cell-free expression from immobilized DNA in the finished setup of glass and PDMS slide.

Figure 3: Immobilization of DNA PCR product coding for GFP with a Cy3 and an amino label was spotted onto the activated PDMS slide. Binding of the amino group to the specific surface allows direct DNA immobilization.

Immobilizing DNA on PDMS

One of the core components of the DiaCHIP is the copying mechanism that allows production of a protein microarray from a DNA template. In summary, we established a cell-free expression system that can be used to translate proteins from DNA immobilized on a PDMS slide.
The first step needed for copying DNA arrays is to genetically fuse antigen coding sequences to a His tag that is used for surface immobilization later on. The whole expression cassette including promoter and terminator regions is amplified by PCR using an amino-labeled reverse primer. Via this amino group, the DNA is immobilized on an activated PDMS surface. The forward primer used for this PCR is labeled with Cy3.

Figure 4: Spotting of expressed GFP Cellfree expressed GFP-Lysate was spotted on the activated glass slide after expression was performed for 2 hours at 37°C. For detailed reaction check our labjournal.

Spotting of Expressed GFP on Slide

To test, whether our cell-free expression system reaches protein concentration high enough to be detected with iRIf, we spotted cell-free expressed GFP on iRIf slides. We then measured the binding of anti-GFP to the cell-free expressed GFP, to gather information about the amount of expressed protein and also about its folding status.

Figure 5: On-slide expression of GFP Cell-free expression mix was spotted on the activated glass slide and expression was performed. For detailed reaction check our labjournal.

On-slide Expression of GFP

To estimate the specificity of our surface we evaluated the proportion of cell-free expressed GFP that bound to the surface. To investigate how many non-target proteins bound unspecifical to the surface, we measured the slide with iRIf using anti-GFP antibodies. Unspecific binding of proteins present in the DiaMIX would lead to decrease in signal, whereas a highly specific surface will only bind GFP and thus show a strong signal for the GFP spot.

Figure 6: In-chamber expression of GFP Cell-free expression mix was pipetted into the flowchamber with immobilied DNA on the silicone slide and an activated glass slide. Then the expression was performed. For detailed reaction check our labjournal.

In-chamber Expression of GFP

This step would show if our entire system, from cell-free expression to detection of the expressed proteins with antibodies actually works. Expression of the antigens inside the flowchamber with our DiaMIX and subsequent diffusion of the His-tagged proteins through the chamber will result in a distinct spot on the glass slide. When cell-free expression is completed, the bound antigens would form an antigen array that can be used for detection of antigen-antibody binding events.

Cell-free Expression of Antigens

Figure 6: Tetanus Antigen.

Our biggest goal was to express antigens with the required tags in a concentration and folding state that suffices for an iRIf measurement. Here we focused on a C. tetani epitope flagged by a double His and a Spy-tag. For cloning of this construct we used our own cloning site and a vector that we optimized for cell-free expression.
The antigen was expressed in a tube reaction and afterwards spotted onto a Ni-NTA surface.

Testing Different Conditions

To identify the optimal DNA concentration that results in a maximum protein expression, we testet different conditions. For this experiment, we used DNA coding for the enzyme luciferase. The protein concentration could therefore be measured at once using the luciferase assay.
Another component whose concentration is crucial for optimal cell-free expression, is magnesium. To evaluate the effects of different magnesium concentrations we started multiple cell-free expression mixes using different magnesium starting concentrations. Magnesium was added to the mix in the form of magnesium acetate.
After expression, a luciferase assay could visualize the protein concentration in the different reaction mixes.

info