Search CORE

18 research outputs found

The cowboy wrangles data into publication without analyzing it properly.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Researchers should beware of potential confounding effects and statistical biases that could lead to inappropriate conclusions. In silico and mechanistic validations can also overcome cowboy tendencies. Image credit: Dan Madsen.</p

FigShare

The jailer guards research data and tools under lock and key to maintain her competitive advantage, even though sharing would advance general scientific progress.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Having published, researchers should openly share their methods and data with the community. Image credit: Dan Madsen and Devika Joglekar.</p

FigShare

The gold miner keeps digging until a “significant” result surfaces.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Researchers should stay true to their original experimental design, use positive and negative control experiments, and be open about the approaches that were attempted but failed. Image credit: Dan Madsen.</p

FigShare

The farmer builds a vast storehouse of genomic data but falls short on experimental design.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Prior to “planting,” researchers should define clear objectives, identify suitable analytical approaches, and consider sample-size requirements, confounding variables, and evaluation measurements. Image credit: Dan Madsen.</p

FigShare

The master has unreasonable expectations about the expertise and time required to complete genomics research tasks; and the servant submits too willingly to those expectations.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Front-line researchers should insist on adequate training and supervision, whereas mentors should take the long view on scientific training needs. Image credit: Dan Madsen.</p

FigShare

The hermit insists on scientific isolation and fails to realize that, in most cases, success in genomics research hinges upon collaboration among a broad range of scientists.

Author: Andrea H. Bild (506308)
Jeffrey T. Chang (6154)
Stephen R. Piccolo (353881)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Open-mindedness toward the conventions and idiosyncrasies of researchers from other domains is key to avoiding the hermit's existence. Image credit: Dan Madsen and Devika Joglekar.</p

FigShare

OmniScope: a Computational Pipeline for Metagenomic Species Identification Using Reference and de novo Assembly

Author: Eduardo Castro-Nallar (763161)
Keith A. Crandall (121812)
Matthieu J. Miossec (4488517)
Sandro L. Valenzuela (778243)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Metagenomics has revolutionized the field of microbiology and promises to impact clinical practice as well. While the number of genomes available for reference-based metagenomic pathogen identification keeps increasing, it is still difficult to classify most of the reads from a metagenomic experiment due to intra-species diversity and uncharacterized pathogens. Here, we propose to combine reference-based metagenomic profiling (faster) with de novo metagenomic assembly (more accurate) to maximize the number of used reads and allow for the discovery of novel species in the data that are not identified by reference-based methods. We take advantage of the fact that homologous sequences among related but different species form detectable peaks in coverage. Reads belonging to those peaks are then extracted and assembled into contigs. Finally, using a de novo strategy that involves storing the DeBruijn graph in bloom filters, we take the unmapped reads and, together with the contigs, create a hybrid assembly that increases the number of species discovered. We provide a proof of concept and discuss potential applications both for clinical and environmental samples. Test data and code is freely available in GitHub at www.github.com/mjmiossec/omniscope

FigShare

Evaluation of Computational Methods for Human Microbiome Analysis Using Simulated Data

Author: Domenico Simone (484053)
Eduardo Castro-Nallar (763161)
Keith A. Crandall (121812)
Marcos Pérez-Losada (159353)
Sandro L. Valenzuela (778243)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

Our understanding of the composition, function, and health implications of human microbiota has been advanced by high-throughput sequencing and the development of new genomic analyses. However, tradeoffs among alternative strategies for the acquisition and analysis of sequence data remain understudied. How do sequencing layout, sample complexity, and analysis pipeline affect taxonomic profiles? In order to approach this, we simulated metagenomic datasets reflecting different read lengths (75-1000 bp), sequencing depths (100 k-10 M), and number of species (10-426). Likewise, we simulated different database composition scenarios including presence/absence of dominant microbes in the database. The resulting simulation design yielded ~144 datasets analyzed using six pipelines (MetaPhlan2; metaMix, PathoScope2, Sigma, Kraken, and ConStrains). We evaluated pipeline performance based on ROC analysis (specificity/sensitivity), relative root mean square error, and average relative error.Our study enables researchers to make informed decisions relative to strengths and weaknesses of current taxonomic profiling methods, and adjust their sequencing experiments accordingly. All datasets and parameter values used in the study are freely available to ensure reproducibility and future pipeline benchmarking.</p

FigShare

PathoScope reads alignment summaries.

Author: Benjamin J. Krajacich (702631)
Brian D. Foy (702634)
Doug E. Brackney (177174)
Fatorma K. Bolay (417274)
Gregory D. Ebel (177185)
Joe W. Diclaro II (702633)
Lawrence S. Fakoli III (702632)
Nathan D. Grubaugh (446564)
Supriya Sharma (546610)
W. Evan Johnson (242079)
Publication venue
Publication date
Field of study

EBV, Epstein-Barr virus; CDV, canine distemper virus.aControl pool generated from laboratory-raised An. gambiae mosquitoes that fed upod sheep’s blood.bDenotes percentage after PathoQC.cDenotes reads aligning to the sheep reference library.dDenotes reads aligned to EBV strain B95–8 (GenBank V01555.2)eDenotes reads aligned to CDV strain Uy251 (GenBank KM280689.1)PathoScope reads alignment summaries.</p

FigShare

Xenosurveillance: A Novel Mosquito-Based Approach for Examining the Human-Pathogen Landscape

Author: Benjamin J. Krajacich (702631)
Brian D. Foy (702634)
Doug E. Brackney (177174)
Fatorma K. Bolay (417274)
Gregory D. Ebel (177185)
Joe W. Diclaro II (702633)
Lawrence S. Fakoli III (702632)
Nathan D. Grubaugh (446564)
Supriya Sharma (546610)
W. Evan Johnson (242079)
Publication venue
Publication date: 01/03/2015
Field of study

<div>BackgroundGlobally, regions at the highest risk for emerging infectious diseases are often the ones with the fewest resources. As a result, implementing sustainable infectious disease surveillance systems in these regions is challenging. The cost of these programs and difficulties associated with collecting, storing and transporting relevant samples have hindered them in the regions where they are most needed. Therefore, we tested the sensitivity and feasibility of a novel surveillance technique called xenosurveillance. This approach utilizes the host feeding preferences and behaviors of Anopheles gambiae, which are highly anthropophilic and rest indoors after feeding, to sample viruses in human beings. We hypothesized that mosquito bloodmeals could be used to detect vertebrate viral pathogens within realistic field collection timeframes and clinically relevant concentrations.Methodology/Principal FindingsTo validate this approach, we examined variables influencing virus detection such as the duration between mosquito blood feeding and mosquito processing, the pathogen nucleic acid stability in the mosquito gut and the pathogen load present in the host’s blood at the time of bloodmeal ingestion using our laboratory model. Our findings revealed that viral nucleic acids, at clinically relevant concentrations, could be detected from engorged mosquitoes for up to 24 hours post feeding by qRT-PCR. Subsequently, we tested this approach in the field by examining blood from engorged mosquitoes from two field sites in Liberia. Using next-generation sequencing and PCR we were able to detect the genetic signatures of multiple viral pathogens including Epstein-Barr virus and canine distemper virus.Conclusions/SignificanceTogether, these data demonstrate the feasibility of xenosurveillance and in doing so validated a simple and non-invasive surveillance tool that could be used to complement current biosurveillance efforts.</div

Directory of Open Access Journals

PubMed Central

FigShare

<i>The cowboy</i> wrangles data into publication without analyzing it properly.

<i>The jailer</i> guards research data and tools under lock and key to maintain her competitive advantage, even though sharing would advance general scientific progress.

<i>The gold miner</i> keeps digging until a “significant” result surfaces.

<i>The farmer</i> builds a vast storehouse of genomic data but falls short on experimental design.

<i>The master</i> has unreasonable expectations about the expertise and time required to complete genomics research tasks; and <i>the servant</i> submits too willingly to those expectations.

<i>The hermit</i> insists on scientific isolation and fails to realize that, in most cases, success in genomics research hinges upon collaboration among a broad range of scientists.

OmniScope: a Computational Pipeline for Metagenomic Species Identification Using Reference and de novo Assembly

Evaluation of Computational Methods for Human Microbiome Analysis Using Simulated Data

PathoScope reads alignment summaries.

Xenosurveillance: A Novel Mosquito-Based Approach for Examining the Human-Pathogen Landscape