268 research outputs found
Serial evolutionary networks of within-patient HIV-1 sequences reveal patterns of evolution of X4 strains
<p>Abstract</p> <p>Background</p> <p>The HIV virus is known for its ability to exploit numerous genetic and evolutionary mechanisms to ensure its proliferation, among them, high replication, mutation and recombination rates. Sliding MinPD, a recently introduced computational method <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, was used to investigate the patterns of evolution of serially-sampled HIV-1 sequence data from eight patients with a special focus on the emergence of X4 strains. Unlike other phylogenetic methods, Sliding MinPD combines distance-based inference with a nonparametric bootstrap procedure and automated recombination detection to reconstruct the evolutionary history of longitudinal sequence data. We present serial evolutionary networks as a longitudinal representation of the mutational pathways of a viral population in a within-host environment. The longitudinal representation of the evolutionary networks was complemented with charts of clinical markers to facilitate correlation analysis between pertinent clinical information and the evolutionary relationships.</p> <p>Results</p> <p>Analysis based on the predicted networks suggests the following:: significantly stronger recombination signals (p = 0.003) for the inferred ancestors of the X4 strains, recombination events between different lineages and recombination events between putative reservoir virus and those from a later population, an early star-like topology observed for four of the patients who died of AIDS. A significantly higher number of recombinants were predicted at sampling points that corresponded to peaks in the viral load levels (p = 0.0042).</p> <p>Conclusion</p> <p>Our results indicate that serial evolutionary networks of HIV sequences enable systematic statistical analysis of the implicit relations embedded in the topology of the structure and can greatly facilitate identification of patterns of evolution that can lead to specific hypotheses and new insights. The conclusions of applying our method to empirical HIV data support the conventional wisdom of the new generation HIV treatments, that in order to keep the virus in check, viral loads need to be suppressed to almost undetectable levels.</p
The Power of Explainability in Forecast-Informed Deep Learning Models for Flood Mitigation
Floods can cause horrific harm to life and property. However, they can be
mitigated or even avoided by the effective use of hydraulic structures such as
dams, gates, and pumps. By pre-releasing water via these structures in advance
of extreme weather events, water levels are sufficiently lowered to prevent
floods. In this work, we propose FIDLAR, a Forecast Informed Deep Learning
Architecture, achieving flood management in watersheds with hydraulic
structures in an optimal manner by balancing out flood mitigation and
unnecessary wastage of water via pre-releases. We perform experiments with
FIDLAR using data from the South Florida Water Management District, which
manages a coastal area that is highly prone to frequent storms and floods.
Results show that FIDLAR performs better than the current state-of-the-art with
several orders of magnitude speedup and with provably better pre-release
schedules. The dramatic speedups make it possible for FIDLAR to be used for
real-time flood management. The main contribution of this paper is the
effective use of tools for model explainability, allowing us to understand the
contribution of the various environmental factors towards its decisions
Large scale microbiome profiling in the cloud
Motivation
Bacterial metagenomics profiling for metagenomic whole sequencing (mWGS) usually starts by aligning sequencing reads to a collection of reference genomes. Current profiling tools are designed to work against a small representative collection of genomes, and do not scale very well to larger reference genome collections. However, large reference genome collections are capable of providing a more complete and accurate profile of the bacterial population in a metagenomics dataset. In this paper, we discuss a scalable, efficient and affordable approach to this problem, bringing big data solutions within the reach of laboratories with modest resources. Results
We developed FLINT, a metagenomics profiling pipeline that is built on top of the Apache Spark framework, and is designed for fast real-time profiling of metagenomic samples against a large collection of reference genomes. FLINT takes advantage of Spark’s built-in parallelism and streaming engine architecture to quickly map reads against a large (170 GB) reference collection of 43 552 bacterial genomes from Ensembl. FLINT runs on Amazon’s Elastic MapReduce service, and is able to profile 1 million Illumina paired-end reads against over 40 K genomes on 64 machines in 67 s—an order of magnitude faster than the state of the art, while using a much larger reference collection. Streaming the sequencing reads allows this approach to sustain mapping rates of 55 million reads per hour, at an hourly cluster cost of $8.00 USD, while avoiding the necessity of storing large quantities of intermediate alignments
Recommended from our members
The Regulatory Repertoire of Pseudomonas aeruginosa AmpC Ăź-Lactamase Regulator AmpR Includes Virulence Genes
In Enterobacteriaceae, the transcriptional regulator AmpR, a member of the LysR family, regulates the expression of a chromosomal β-lactamase AmpC. The regulatory repertoire of AmpR is broader in Pseudomonas aeruginosa, an opportunistic pathogen responsible for numerous acute and chronic infections including cystic fibrosis. In addition to regulating ampC, P. aeruginosa AmpR regulates the sigma factor AlgT/U and production of some quorum sensing (QS)-regulated virulence factors. In order to better understand the ampR regulon, we compared the transcriptional profile generated using DNA microarrays of the prototypic P. aeruginosa PAO1 strain with its isogenic ampR deletion mutant, PAOΔampR. Transcriptome analysis demonstrates that the AmpR regulon is much more extensive than previously thought, with the deletion of ampR influencing the differential expression of over 500 genes. In addition to regulating resistance to β-lactam antibiotics via AmpC, AmpR also regulates non-β-lactam antibiotic resistance by modulating the MexEF-OprN efflux pump. Other virulence mechanisms including biofilm formation and QS-regulated acute virulence factors are AmpR-regulated. Real-time PCR and phenotypic assays confirmed the microarray data. Further, using a Caenorhabditis elegans model, we demonstrate that a functional AmpR is required for P. aeruginosa pathogenicity. AmpR, a member of the core genome, also regulates genes in the regions of genome plasticity that are acquired by horizontal gene transfer. Further, we show differential regulation of other transcriptional regulators and sigma factors by AmpR, accounting for the extensive AmpR regulon. The data demonstrates that AmpR functions as a global regulator in P. aeruginosa and is a positive regulator of acute virulence while negatively regulating biofilm formation, a chronic infection phenotype. Unraveling this complex regulatory circuit will provide a better understanding of the bacterial response to antibiotics and how the organism coordinately regulates a myriad of virulence factors in response to antibiotic exposure
Adapting Flint for Calculating Bacterial Replication Rates in Microbiomes
We extend Flint, a Spark-based metagenomic profiling tool, to efficiently measure bacterial growth rates for large data sets. The tool bPTR for bacterial growth rate measurement from metagenomic samples [Brown et al., Nat Biotech, 2016] was adapted and integrated into Flint’s MapReduce framework in order to take advantage of Flint\u27s efficient read alignments and mapping, thus enabling the creation of bacterial abundance profiles that are enhanced with growth-rate information.To show the viability of our method we analyzed whole metagenome sequence data from a longitudinal study of sampled preterm infants [Gibson et al., Nat Micro, 2016], computing the abundance profile enhanced with growth rate information. The conclusions shed light on the new perspective obtained on antibiotics treatments and antibiotic resistance by looking at replication rates
ATria: a novel centrality algorithm applied to biological networks
Background The notion of centrality is used to identify ?important? nodes in social networks. Importance of nodes is not well-defined, and many different notions exist in the literature. The challenge of defining centrality in meaningful ways when network edges can be positively or negatively weighted has not been adequately addressed in the literature. Existing centrality algorithms also have a second shortcoming, i.e., the list of the most central nodes are often clustered in a specific region of the network and are not well represented across the network. Methods We address both by proposing Ablatio Triadum (ATria), an iterative centrality algorithm that uses the concept of ?payoffs? from economic theory. Results We compare our algorithm with other known centrality algorithms and demonstrate how ATria overcomes several of their shortcomings. We demonstrate the applicability of our algorithm to synthetic networks as well as biological networks including bacterial co-occurrence networks, sometimes referred to as microbial social networks. Conclusions We show evidence that ATria identifies three different kinds of ?important? nodes in microbial social networks with different potential roles in the community
Causal Inference in Microbiomes Using Intervention Calculus
Inferring causal effects is critically important in biomedical research as it allows us to move from the typical paradigm of associational studies to causal inference, and can impact treatments and therapeutics. Association patterns can be coincidental and may lead to wrong inferences in complex systems. Microbiomes are highly complex, diverse, and dynamic environments. Microbes are key players in health and diseases. Hence knowledge of genuine causal relationships among the entities in a microbiome, and the impact of internal and external factors on microbial abundance and interactions are essential for understanding disease mechanisms and making treatment recommendations. In this paper, we investigate fundamental causal inference techniques to measure the causal effects of various entities in a microbiome. In particular, we show how to use these techniques on microbiome datasets to study the rise and impact of antibiotic-resistance in microbiomes. Our main contributions include the following. We introduce a novel pipeline for microbiome studies, new ideas for experimental design under weaker assumptions, and data augmentation by context embedding. Our pipeline is robust, different from traditional approaches, and able to predict interventional effects without any controlled experiments. Our work shows the advantages of causal inference in identifying potential pathogenic, beneficial, and antibiotic-resistant bacteria. We validate our results using results that were previously published
- …