Team:UESTC Software/Description

Description

Description

Photo From © PEXELS

Background

It is well-known that microbial cell factories are able to produce many valuable products such as biofuels, new drugs for treatment of diseases and industry enzymes. In order to improve the efficiency of cell factories, we use synthetic biology to design the minimal cell[1]. The lower left figure is extracted from the web page[2].

It is widely accepted that building a minimal artificial cell serves as a critical role in synthetic biology. A synthetic minimal cell will supply a proper chassis to integrate functional synthetic parts, devices and systems with functions which can meet different purposes and requirements[3]. More and more researchers have taken the design and construction of a functional minimal genome into great consideration when assembly such a minimal artificial cell or chassis, since all the cell functions can be dates back to genome. For the minimal genome, it represents the infrastructure of a minimal cell and could be taken as the foundation for synthetic biology[4].It is well-known that minimal genome can be used for chassis cell while applying to the industrial manufacture and synthetic biology.The top right figure is designed by us.

The works for evaluating the size and content of minimal genome have been carried out through comparative and experimental approaches. Mushegian and Koonin[5] pioneered the work of determining a minimal gene set by comparative genomic approach. They derived such a repertoire for the first minimal gene set with 256 conserved genes based on the first two completely sequenced small bacterial genomes, Mycoplasma genitalium and Haemophilus influenza, which belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Later, based on comprehensively comparing the published data from comparative genomics and experiments, Gil and his colleagues[6] determined the core of a minimal bacterial gene set with 206 genes, and reconstructed a minimal metabolic machinery necessary to sustain life. In practice, the comparative-genomic approach to minimal gene set derivation is based on the key evolutionary notion of orthology, and the resulting sets of genes should approximate those of ancestral life-forms [7].

It is predictable that with the increasing number of chromosomes from different organisms, the number of orthologous genes would descend. Analysis of about 100 genomes shows that only 63 genes are ubiquitous. That may be caused by the condition, when the genes which perform the same function in two species are non-orthologous, the essential gene will be missed[8]. However, the researchers had to adopt the “ubiquitous” strategy just because at that time only a few bacterial genomes had been completely sequenced.

Overview

In recent years, dozens of bacteria have obtained genome scale essentiality data with experimental validations and they can provide us the feasibility to develop a novel strategy of minimal gene set extraction from the experimental essentiality data. Following the comparative genomic approach, we developed with our instructor’s group the “half-retaining”, by comparing the experimentally-determined essential genes among various reference species. This strategy[9] retains the essential genes persisting in more than half of the reference genomes to obtain conserved essential genes. Conventional methods[5] which retain genes persistent in all reference species as a minimal gene set suffer from the disadvantage that the number of orthologous genes would drop off with the increase of the number of chromosomes from different organisms. In contrast, we overcome this issue in principle and results in a robust gene set, only if the reference species are randomly selected and cover all the primary phylogenetic lineages. In addition, our source data is derived from the initial set of experimentally-determined essential genes, which ensures our results are not redundant. Hence, our results represent that the size and content of minimal gene set are stable and convergent.

Virtues

To construct an updated minimal gene set of bacteria, we renovate systematically the conventional process in the following two points. Firstly, it starts from the experimentally-determined essential genes and thus the result is more reliable and accurate. Secondly, we developed a new strategy called half-retaining to identify essential genes conserved among over half of the reference species, which made the obtained minimal gene set more robust.

Innovation

For innovation, our strategy, half-retaining, has blazed a new trail in three aspects. Based on the increasingly accumulated number of essential gene experimentally determined by genome scale gene in-activation way, we together with instructors’ groups proposed our strategy, half-retaining, to solve this problem[9]. Primarily, we combined comparative genomics approach with data (essential genes) from experiments (gene inactivation) instead of using theoretical method alone to construct a more complete minimal gene set than previous. In the second place, the minimal gene set that we defined would be confirmed according to users' requirements and purpose, rather than the only gene set in the previous. At last, the ubiquitous genes of minimal gene set determined by Koonin suffers from the flaw that enlarging the number of reference species would reduce the amount of genes retained in the minimal gene set. However, our half-retaining strategy overcomes this problem and results in a robust gene set. The number of selected genes is tending towards stability with the increase reference species by our strategy.

Reference

[1] Foley, PL and Shuler, ML. Considerations for the design and construction of a synthetic platform cell for biotechnological applications. Biotechnol Bioeng, 2010, 105(1), 26-36.

[2] http://teacherpages.hallco.org/webpages/trennspies/index.cfm?subpage=1247337.

[3] Juhas M. On the road to synthetic life: the minimal cell and genome-scale engineering. Crit Rev Biotechnol, 2015, July 7. [Epub head of print]

[4] Juhas M, Eberl L, Church GM. Essential genes as antimicrobial targets and cornerstones of synthetic biology. Trends Biotechnol, 2012, 30(11)601-7.

[5] Mushegian AR, Koonin EV. A minimal gene set for cellular life derived by comparison of complete bacterial genomes. Proc Natl Acad Sci USA, 1996, 93(19):10268-73.

[6] Gil R, Silva FJ, Pereto J, Moya A. Determination of the core of a minimal bacterial gene set. Microbiol Mol Biol Rev, 2004, 68(3): 518-37.

[7] Koonin EV. Comparative genomics, minimal gene sets and the last universal common ancestor. Nat Rev Microbiol, 2003, 1(2):127-36.

[8] Zhang LY, Chang SH, Wang J. How to make a minimal genome for synthetic minimal cell. Protein Cell. 2010, 1(5):427-34

[9] Ye YN, Ma BG, Dong C, Zhang H, Chen LL and Guo FB. A novel bacterial minimal gene set and neo-construction of a general minimal metabolic network. Scientific Reports, under review.(From instructors' group)