Difference between revisions of "Team:Minnesota/Web Scrape"

(Created page with "<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en" dir="ltr"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <meta name="gen...")
 
Line 771: Line 771:
  
  
<h1>Attributions</h1><br>
 
  
<b><font size="4"> Insulin: Basic Background </font></b><br>
+
<h1> Biotechnology and the Web </h1><br>
 +
 
 +
<i> &nbsp; &nbsp; &nbsp; &nbsp;  The advent of the internet has conceived a common grounds for the public to rapidly generate and spread their ideas. Although this has undoubtedly shaped and improved our lives, patterns of misinformation spread in these channels have presented serious obstacles to the advancement to biotechnology in societal applications. To address this, we have taken prelimary steps into developing a module power by Google that has the ability to probe the web on both content and temporal ranges to give research a reference for public outlook. </i><br><br>
 +
 
 +
<b><font size="4"><center> The Rising Coalition </font></b><br></center>
 
<br>
 
<br>
 +
&nbsp; &nbsp; &nbsp; &nbsp; The past few years has brought biotechnology to the forefront of science discussion in the public. Vaccination and genetically modified organisms are the greatest examples of this debate and how the public can follow emotional driven arguments above scientific reason. How can we study these behaviors? <br><br>
  
<p>
+
&nbsp; &nbsp; &nbsp; &nbsp; "Web scraping" is a term used in computer science to describe the process of extracting data and information from websites in a highly automated manner. With the effectively endless supplies of opinions, data, and articles on the web, we can clean up text from these websites and computationally sweep them for both objective and emotional content. <br><br>
<i><u><font size="3">• </font>What is the role of Insulin in the Human body?</u></i>
+
 
 +
<img src="https://static.igem.org/mediawiki/2015/a/a9/Registry_1.png" width=100% height=75% align="middle">
 
<br>
 
<br>
&nbsp; &nbsp; &nbsp; Insulin is a hormone produced by the pancreas that removes glucose from the blood. In healthy individuals, excess glucose is readily removed from the blood stream by a proportional production of insulin. In persons with diabetes mellitus however; the body is either resistant to insulin, or it has a reduced capacity to produce insulin. Those individuals require an external source of insulin.
 
<br><br>
 
  
 +
<b><font size="4"><center>Today's Registry</font></b><br></center>
 +
 +
&nbsp; &nbsp; &nbsp; &nbsp; Maybe there still is value in the registry, and maybe it does save groups synthesis costs and time. After all, current saves researchers $325 on average a day. But when we return to this conversation in 5 years and the savings is only about $80 a day? The National Science Foundation (NSF, $37 million over 10 years), and Defense Advanced Research Projects Agency (DARPA, undisclosed grant), and National Institutes of Health (NIH, undisclosed grant) funnel hefty grants as well as the International Genetically Engineered Machines (iGEM) competition to develop and contribute to the physical stock of parts. Should we be investing in this type of technology? <br><br>
 +
 +
&nbsp; &nbsp; &nbsp; &nbsp; In addition to the physical database, we were interested in the meta-analysis of biobricks. Are parts highly used in other projects? What do the usage statistics look like? We contacted the iGEM staff to determine whether they would release the complete list of Biobricks with usage data and our request was denied. <br><br>
 +
 +
&nbsp; &nbsp; &nbsp; &nbsp;But we had computers and we had website scraping tools, so we scraped all  parts web pages for every iGEM team part from 2005 to 2014 for their usage information. Processing this information, we can get a snapshot of what is happening with the all the information this organization has accumulated. Biobricks are unconnected to any other Biobrick 72% the time, and 92% are used less than once a year. 61% of part usage is within the same team in the same year. Despite these trends, the superb parts had an entirely different story to tell. <br><br>
  
<i><u><font size="3">• </font>Why are we expressing human Insulin?</u></i>
 
 
<br>
 
<br>
&nbsp; &nbsp; &nbsp; The ability to produce recombinant human Insulin cheaply has long been a lucrative goal. There are millions of people worldwide who are dependent on Insulin derived from production methods that make the product expensive -and further yet- potentially dangerous.Our team thinks that the current production methods for human Insulin are inefficient and can be optimized by being expressed in Pichia pastoris.  
+
<img src="https://static.igem.org/mediawiki/2015/4/45/Registry_2.png" width=60% height=100% align="middle">
<br><br>
+
<br>
  
 +
<b><font size="4"><center> The Best of the Registry </font></b><br></center>
 +
<br>
 +
 +
&nbsp; &nbsp; &nbsp; &nbsp;To the (right), the top 1.3% of Biobricks by usage were mapped to all Biobricks they were used in and processed in Gephi as a network of information. The resulting image is the marvel of what the Registry of Standard Biological Parts has achieved. With each point representing a Biobrick, each color representing a year of competition, you can see the interconnectivity of iGEM at its best. Teams across the world and years building from others work and linking their work into this network of information.<br><br>
 +
 +
 +
&nbsp; &nbsp; &nbsp; &nbsp;And this is what we believe the best direction is for the registry and iGEM as a whole. There has been this emphasis on forcing projects into this physical standard of biobricks and punished teams for omitting submission of a registry part. These leaves massive portions of the registry dark, untouched. The registry was a revolutionary idea initially, but the registry must evolve to coexist with the modern scientific environment. Drop the physical system and commit the registry to be a haven of synthetic biology research focusing on the scientific network, not the part number.<br><br>
  
<img src="#" width=96% height=72%>
 
 
<br>
 
<br>
 +
<img src="https://static.igem.org/mediawiki/2015/1/17/Registry_3.png" width=70% height=100% align="middle">
 +
<br>
 +
  
 
<!---END-->
 
<!---END-->

Revision as of 19:32, 18 September 2015

Team:Minnesota/Project/Insulin - 2015.igem.org

 

Team:Minnesota/Project/Insulin

From 2015.igem.org

Team:Minnesota - Main Style Template Team:Minnesota - Template

Biotechnology and the Web


        The advent of the internet has conceived a common grounds for the public to rapidly generate and spread their ideas. Although this has undoubtedly shaped and improved our lives, patterns of misinformation spread in these channels have presented serious obstacles to the advancement to biotechnology in societal applications. To address this, we have taken prelimary steps into developing a module power by Google that has the ability to probe the web on both content and temporal ranges to give research a reference for public outlook.

The Rising Coalition

        The past few years has brought biotechnology to the forefront of science discussion in the public. Vaccination and genetically modified organisms are the greatest examples of this debate and how the public can follow emotional driven arguments above scientific reason. How can we study these behaviors?

        "Web scraping" is a term used in computer science to describe the process of extracting data and information from websites in a highly automated manner. With the effectively endless supplies of opinions, data, and articles on the web, we can clean up text from these websites and computationally sweep them for both objective and emotional content.


Today's Registry
        Maybe there still is value in the registry, and maybe it does save groups synthesis costs and time. After all, current saves researchers $325 on average a day. But when we return to this conversation in 5 years and the savings is only about $80 a day? The National Science Foundation (NSF, $37 million over 10 years), and Defense Advanced Research Projects Agency (DARPA, undisclosed grant), and National Institutes of Health (NIH, undisclosed grant) funnel hefty grants as well as the International Genetically Engineered Machines (iGEM) competition to develop and contribute to the physical stock of parts. Should we be investing in this type of technology?

        In addition to the physical database, we were interested in the meta-analysis of biobricks. Are parts highly used in other projects? What do the usage statistics look like? We contacted the iGEM staff to determine whether they would release the complete list of Biobricks with usage data and our request was denied.

       But we had computers and we had website scraping tools, so we scraped all parts web pages for every iGEM team part from 2005 to 2014 for their usage information. Processing this information, we can get a snapshot of what is happening with the all the information this organization has accumulated. Biobricks are unconnected to any other Biobrick 72% the time, and 92% are used less than once a year. 61% of part usage is within the same team in the same year. Despite these trends, the superb parts had an entirely different story to tell.



The Best of the Registry

       To the (right), the top 1.3% of Biobricks by usage were mapped to all Biobricks they were used in and processed in Gephi as a network of information. The resulting image is the marvel of what the Registry of Standard Biological Parts has achieved. With each point representing a Biobrick, each color representing a year of competition, you can see the interconnectivity of iGEM at its best. Teams across the world and years building from others work and linking their work into this network of information.

       And this is what we believe the best direction is for the registry and iGEM as a whole. There has been this emphasis on forcing projects into this physical standard of biobricks and punished teams for omitting submission of a registry part. These leaves massive portions of the registry dark, untouched. The registry was a revolutionary idea initially, but the registry must evolve to coexist with the modern scientific environment. Drop the physical system and commit the registry to be a haven of synthetic biology research focusing on the scientific network, not the part number.