University of Missouri--Columbia, Cyberinfrastructure Council
Publication date
01/01/2016
Field of study
Executive Summary: It would be an understatement to say that "next-generation" sequencing technology has been revolutionary. Over the last 10 years, sequencing has created a paradigm shift in biological sciences where more and more a component of research involves "just sequence it". This is because the types of data, applications and resulting insights are expanding every year. Further, the volume and speed of data generation are growing exponentially, while the costs to generate these data are decreasing exponentially. The Human Genome Project completed the first draft genome sequence in 2001 at an estimated cost of 3billion.Next−generationsequencingbecamemainstreamaround2007andenabledthere−sequencingofahumangenomeatacostofapproximately50,000. In late 2015, Illumina announced the availability of their X10 sequencer for use on non-human samples enabling the re-sequencing of a mammalian (human, cow, dog etc.) genome for approximately 1,500andwithanannualthroughputof10,000genomesperyear.Theease,rapidityandcosteffectivenessofgeneratingsequencedatahascreatedacomputationalanalysisbottleneck.ThegrowthofcomputationalresourcesontheMUcampushasnotkeptpacewiththegrowthindatagenerationcapability.InorderforMizzoutomaintainacompetitiveresearchenvironment,weneedtoexpandthecomputationalresourcesavailableforbioinformaticsanalysisoflargedatawhichincludesequencedata.Itwillrequireaninitialinvestmentof619,000 in early 2016 to build the needed core infrastructure and will require ongoing funding to maintain and expand this infrastructure. Initial investments (cost share of 231,000)madebyMizzouin2005tobringnext−generationsequencingtothiscampushavebeenreturnedmany−fold.BasedonasurveysenttoMUresearchersinNovember2015,atotalof66grantshavebeenawardedinvolvingsequencingforatotalof87.5M. 7.6Mofthatisdirectlyattributabletosequencedatageneration/analysis.Inaddition,another7.9M in grant funding has been submitted and remains pending. This research has led to 173 refereed journal articles in top-tier journals producing over 6,000 citations. Additionally, 19 M.S., 62 Ph.D. and 21 postdocs have been trained as a result of these sequence related research projects. Plant and animal researchers at MU have been at the forefront of the next-generation sequencing revolution. However, based on the diversity of grants and papers gathered by the survey, sequence analysis provides a common foundation that ties together many disciplines on campus. As such, investment in computational capacity directed at sequence data analysis will serve the entire campus and provide technological ties between disciplines. The following is a detailed description of the history of sequencing/bioinformatics, a description of the computation resources required, and a model for sustainability and an analysis of the impacts of next-generation sequencing at Mizzou