Difference between revisions of "Team:UFSCar-Brasil/part2.html"

Line 43: Line 43:
  
 
           <p>Cophenetic correlation coefficient, which brings how feaseable the dendogram is, was estimated as 0.84134 and values higher than 0.7 are considered as a good-fit. It can be seen that the combination of "all" chaperones together is farther away from the others, which was expected since its yield reached the highest value. Arrangements between "Ibp+DnaK" and "Ibp+clpB" are together on a branch, indicating that their yield values are closer than others. In addition, they are nearby branch of combination of all chaperones, indicating its yield not as high as the performance of all chaperones together, although close enough. This may indicate combinations "Ibp+DnaK" and "Ibp+clpB" as good substitutes for all three chaperones, when it would not be possible the complete arrangement.</p>
 
           <p>Cophenetic correlation coefficient, which brings how feaseable the dendogram is, was estimated as 0.84134 and values higher than 0.7 are considered as a good-fit. It can be seen that the combination of "all" chaperones together is farther away from the others, which was expected since its yield reached the highest value. Arrangements between "Ibp+DnaK" and "Ibp+clpB" are together on a branch, indicating that their yield values are closer than others. In addition, they are nearby branch of combination of all chaperones, indicating its yield not as high as the performance of all chaperones together, although close enough. This may indicate combinations "Ibp+DnaK" and "Ibp+clpB" as good substitutes for all three chaperones, when it would not be possible the complete arrangement.</p>
           <p>As possible to observe, the protein yield using chaperones arrangement are not trivial, since there will be cross interactions hard to predict its behaviour jointly taking in account the single values. </p>
+
           <p>As possible to observe, the protein yield using chaperones arrangement are not trivial, since there will be cross interactions hard to predict its behaviour jointly taking in account the single values. Para que seja possível tentar prever essa interferência entre elas, equacionamos todas as combinações possíveis de rendimento correspondente para cada combinação e multiplicamos elas por um fator matemático \(x_i\). O objetivo é interpretar os valores dessas constantes, para que fosse possível comparar a influência que cada componente de chaperona isolado influencia no conjunto das combinações. </p> <p>Onde cada índice \(x_i\) representa uma combinação das três chaperonas estudadas, assim:
        <p>Para que seja possível tentar prever essa interferência entre elas, equacionamos todas as combinações possíveis de rendimento correspondente para cada combinação e multiplicamos elas por um fator matemático \(x_i\). O objetivo é interpretar os valores dessas constantes, para que fosse possível comparar a influência que cada componente de chaperona isolado influencia no conjunto das combinações. </p>
+
        <p>Onde cada índice \(x_i\) representa uma combinação das três chaperonas estudadas, assim:
+
 
   
 
   
 
         \(x_1 \rightarrow Ibp\); \(x_2\rightarrow Clbp\);
 
         \(x_1 \rightarrow Ibp\); \(x_2\rightarrow Clbp\);

Revision as of 19:03, 16 September 2015

Protein Solubilization toolkit

Where are the singular effects?

Simulation and analysis

In this modeling step, we will develop a study about different chaperones arrangement efficiency in the proteins folding, using as example our protein of interest, Limonene Synthase. As it was not possible to obtain the complete system of this toolkit, there are no available experimental data to reinforce our findings. Nevertheless, the modeling of this section is extremely significant and could be useful to another teams with some similar problems. In this sense, we decided to model our system starting from a simulated dataset where we use fake data, and in the future, real data could simply replace them.

First of all, the experiment to measure the efficiency of each chaperone and arrangement of them was conceived in the following form:

All chaperones would be tested alone (IbpA+B, ClpB and DnaK), in pairs and all together. The proteins would be encoded in plasmid pSB1C3 under control of a constitutive promoter, allied to our gene of interest under the same promoter. Protein of interest would be produced by some hours, and the cells would be harvested and lysed by ultrasound. Finally, the cell debris would be precipitated by centrifugation and the soluble and insoluble portions would be recovered. After, a SDS-PAGE the band of interest would be quantified with optical densitometry. Chaperone efficiency coefficient would be given as the reatio between soluble to insoluble bands.

Simulated data following this procedures were provided in the table bellow. A control group without any chaperones was included to comparision, and all measurements were made in triplicates. This table shows portion of well folded proteins (soluble) and misfolded ones (insoluble).

Table 1: Simulated data using different chaperone sets and the solubility after the treatment.

Chaperones efficiency coefficient still means the proportion of correct and incorrectly folded proteins. Considering the case of 100% of solubility, this coefficient tends to infinity, since the insoluble portion tends to zero. In this sense, a case of this type would be impossible to work, to prevent this, we decided work with protein yield. Protein yield coefficient would be given as percent (0-100) and calculated as follows:

(Eq. 1) $$Y = 100*[Soluble/(Soluble+Insoluble)]$$

Table 2: Chaperone arrangements related to chaperones efficiency and protein yield coefficients. (TABELA 2)

In order to observe this large set of combinations relate to each other in terms of yield, we used a hierarchical analysis model of clustering available on a free webserver, DENDROUPGMA. It has done the calculations used in the creation of dendogram shown in Figure 1. This analysis basically shows together those data that are mathematically closer and separates those who are further away. As we can see, the values obtained with all chaperones acting together, here named "all", are more isolated from other combinations for being the most efficient treatment of the others and so it is statistically more 'distant' to the other groups. Similarly we see the double "Ibp+clpB" and "Ibp+DnaK" grouped in the same node, so we conclude that their values are closer enought to not differentiate them and therefore cannot be separated. (DENDOGRAMA)

Figure 1: Dendogram of distances using simulated dataset generating a clustering complex behaviour.

Cophenetic correlation coefficient, which brings how feaseable the dendogram is, was estimated as 0.84134 and values higher than 0.7 are considered as a good-fit. It can be seen that the combination of "all" chaperones together is farther away from the others, which was expected since its yield reached the highest value. Arrangements between "Ibp+DnaK" and "Ibp+clpB" are together on a branch, indicating that their yield values are closer than others. In addition, they are nearby branch of combination of all chaperones, indicating its yield not as high as the performance of all chaperones together, although close enough. This may indicate combinations "Ibp+DnaK" and "Ibp+clpB" as good substitutes for all three chaperones, when it would not be possible the complete arrangement.

As possible to observe, the protein yield using chaperones arrangement are not trivial, since there will be cross interactions hard to predict its behaviour jointly taking in account the single values. Para que seja possível tentar prever essa interferência entre elas, equacionamos todas as combinações possíveis de rendimento correspondente para cada combinação e multiplicamos elas por um fator matemático \(x_i\). O objetivo é interpretar os valores dessas constantes, para que fosse possível comparar a influência que cada componente de chaperona isolado influencia no conjunto das combinações.

Onde cada índice \(x_i\) representa uma combinação das três chaperonas estudadas, assim: \(x_1 \rightarrow Ibp\); \(x_2\rightarrow Clbp\); \(x_3\rightarrow Dnak\);\(x_4\rightarrow Ibp+Clbp\); \(x_5\rightarrow Ibp+Dnak\);\(x_6 \rightarrow Clbp+Dnak\); \(x_7 \rightarrow Ibp+Dnak+Clpb\).

O resultado obtido da resolução deste sistema foi :

(Exp. 1) $$x_1=(471/124)x_7-(251/248)x_6 $$
(Exp. 2) $$x_2= -(314/71)x_7 +(502/213)x_6$$
(Exp. 3) $$x_3=(471/109)x_7 - (251/218)x_6$$
(Exp. 4) $$x_4=(251/612)x_6 $$
(Exp. 5) $$x_5=(1884/503)x_7-(502/503)x_6 $$
(Exp. 6) $$x_6=x_6 $$
(Exp. 7) $$x_7=x_7 $$

É possível escrever todos os parâmetros com base em dois valores \(x_6\) e \(x_7\), ou seja, a relação entre todas as chaperonas estava vincula a duas relações específicas, que são “todas” as chaperonas juntas e a combinação “Dnak+clpb”. Assim fixando-se uma das constantes é possível estudar o comportamento das outras por meio de gráficos. (GRAFICO 1 E GRAFICO 2)

Figure 2: KKK.
Figure 3: KKK.

Pode-se perceber que não há uma única solução para nosso problema, e provavelmente mesmo com valores reais talvez não exista uma solução única, mas pelos gráficos é possível, mesmo assim, tirar algumas relações de dependência entre os coeficientes, como por exemplo quando tem-se fixo x7 a componente x2 é crescente com inclinação maior que x1, que decresce conforme se aumenta o valor de x6, isso indica que sua influencia nas combinações de chaperonas da Clpb é maior que a Ibp nestas configurações, porem esse resultado se inverte a partir do momento que aumentamos os valores relativos a x7.

Como não foi possível obtermos resultados experimentais para validar nossas discussões e prever resultados, deixamos esta modelagem como um modelo teórico que pode ser empregado e testado em futuros projetos de nossa ou outra equipe iGEM que queira emprega-la.

Referencias

S. Garcia-Vallve, J. Palau and A. Romeu (1999) Horizontal gene transfer in glycosyl hydrolases inferred from codon usage in Escherichia coli and Bacillus subtilis. Molecular Biology and Evololution 16(9):1125-1134.

Our amazing sponsors!