245 research outputs found
Analysis of complex metabolic behavior through pathway decomposition
<p>Abstract</p> <p>Background</p> <p>Understanding complex systems through decomposition into simple interacting components is a pervasive paradigm throughout modern science and engineering. For cellular metabolism, complexity can be reduced by decomposition into pathways with particular biochemical functions, and the concept of elementary flux modes provides a systematic way for organizing metabolic networks into such pathways. While decomposition using elementary flux modes has proven to be a powerful tool for understanding and manipulating cellular metabolism, its utility, however, is severely limited since the number of modes in a network increases exponentially with its size.</p> <p>Results</p> <p>Here, we present a new method for decomposition of metabolic flux distributions into elementary flux modes. Our method can easily operate on large, genome-scale networks since it does not require all relevant modes of the metabolic network to be generated. We illustrate the utility of our method for metabolic engineering of <it>Escherichia coli </it>and for understanding the survival of <it>Mycobacterium tuberculosis </it>(MTB) during infection.</p> <p>Conclusions</p> <p>Our method can achieve computational time improvements exceeding 2000-fold and requires only several seconds to generate elementary mode decompositions on genome-scale networks. These improvements arise from not having to generate all relevant elementary modes prior to initiating the decomposition. The decompositions from our method are useful for understanding complex flux distributions and debugging genome-scale models.</p
Deep clustering of bacterial tree images
The field of genomic epidemiology is rapidly growing as many jurisdictions begin to deploy whole-genome sequencing (WGS) in their national or regional pathogen surveillance programmes. WGS data offer a rich view of the shared ancestry of a set of taxa, typically visualized with phylogenetic trees illustrating the clusters or subtypes present in a group of taxa, their relatedness and the extent of diversification within and between them. When methicillin-resistant Staphylococcus aureus (MRSA) arose and disseminated widely, phylogenetic trees of MRSA-containing types of S. aureus had a distinctive 'comet' shape, with a 'comet head' of recently adapted drug-resistant isolates in the context of a 'comet tail' that was predominantly drug-sensitive. Placing an S. aureus isolate in the context of such a 'comet' helped public health laboratories interpret local data within the broader setting of S. aureus evolution. In this work, we ask what other tree shapes, analogous to the MRSA comet, are present in bacterial WGS datasets. We extract trees from large bacterial genomic datasets, visualize them as images and cluster the images. We find nine major groups of tree images, including the 'comets', star-like phylogenies, 'barbell' phylogenies and other shapes, and comment on the evolutionary and epidemiological stories these shapes might illustrate. This article is part of a discussion meeting issue 'Genomic population structures of microbial pathogens'
COVID-19 in Schools: Mitigating Classroom Clusters in the Context of Variable Transmission
Widespread school closures occurred during the COVID-19 pandemic. Because closures are costly and damaging, many jurisdictions have since reopened schools with control measures in place. Early evidence indicated that schools were low risk and children were unlikely to be very infectious, but it is becoming clear that children and youth can acquire and transmit COVID-19 in school settings and that transmission clusters and outbreaks can be large. We describe the contrasting literature on school transmission, and argue that the apparent discrepancy can be reconciled by heterogeneity, or “overdispersion” in transmission, with many exposures yielding little to no risk of onward transmission, but some unfortunate exposures causing sizeable onward transmission. In addition, respiratory viral loads are as high in children and youth as in adults, pre- and asymptomatic transmission occur, and the possibility of aerosol transmission has been established. We use a stochastic individual-based model to find the implications of these combined observations for cluster sizes and control measures. We consider both individual and environment/activity contributions to the transmission rate, as both are known to contribute to variability in transmission. We find that even small heterogeneities in these contributions result in highly variable transmission cluster sizes in the classroom setting, with clusters ranging from 1 to 20 individuals in a class of 25. None of the mitigation protocols we modeled, initiated by a positive test in a symptomatic individual, are able to prevent large transmission clusters unless the transmission rate is low (in which case large clusters do not occur in any case). Among the measures we modeled, only rapid universal monitoring (for example by regular, onsite, pooled testing) accomplished this prevention. We suggest approaches and the rationale for mitigating these larger clusters, even if they are expected to be rare
Selecting likely causal risk factors from high-throughput experiments using multivariable Mendelian randomization
Modern high-throughput experiments provide a rich resource to investigate causal determinants of disease risk. Mendelian randomization (MR) is the use of genetic variants as instrumental variables to infer the causal effect of a specific risk factor on an outcome. Multivariable MR is an extension of the standard MR framework to consider multiple potential risk factors in a single model. However, current implementations of multivariable MR use standard linear regression and hence perform poorly with many risk factors. Here, we propose a two-sample multivariable MR approach based on Bayesian model averaging (MR-BMA) that scales to high-throughput experiments. In a realistic simulation study, we show that MR-BMA can detect true causal risk factors even when the candidate risk factors are highly correlated. We illustrate MR-BMA by analysing publicly-available summarized data on metabolites to prioritise likely causal biomarkers for age-related macular degeneration
Recommended from our members
A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype and genotype. There are specific challenges when identifying genetic associations in bacteria which often comprise highly structured populations. Here we consider general methodological questions related to sampling and analysis focusing on clonal to moderately recombining pathogens. We propose that a matched sampling scheme constitutes an efficient study design, and provide a power calculator based on phylogenetic convergence. We demonstrate this approach by applying it to genomic datasets for two microbial pathogens: Mycobacterium tuberculosis and Campylobacter species. Electronic supplementary material The online version of this article (doi:10.1186/s13073-014-0101-7) contains supplementary material, which is available to authorized users
The cost-effectiveness of alternative vaccination strategies for polyvalent meningococcal vaccines in Burkina Faso: A transmission dynamic modeling study.
BACKGROUND: The introduction of a conjugate vaccine for serogroup A Neisseria meningitidis has dramatically reduced disease in the African meningitis belt. In this context, important questions remain about the performance of different vaccine policies that target remaining serogroups. Here, we estimate the health impact and cost associated with several alternative vaccination policies in Burkina Faso. METHODS AND FINDINGS: We developed and calibrated a mathematical model of meningococcal transmission to project the disability-adjusted life years (DALYs) averted and costs associated with the current Base policy (serogroup A conjugate vaccination at 9 months, as part of the Expanded Program on Immunization [EPI], plus district-specific reactive vaccination campaigns using polyvalent meningococcal polysaccharide [PMP] vaccine in response to outbreaks) and three alternative policies: (1) Base Prime: novel polyvalent meningococcal conjugate (PMC) vaccine replaces the serogroup A conjugate in EPI and is also used in reactive campaigns; (2) Prevention 1: PMC used in EPI and in a nationwide catch-up campaign for 1-18-year-olds; and (3) Prevention 2: Prevention 1, except the nationwide campaign includes individuals up to 29 years old. Over a 30-year simulation period, Prevention 2 would avert 78% of the meningococcal cases (95% prediction interval: 63%-90%) expected under the Base policy if serogroup A is not replaced by remaining serogroups after elimination, and would avert 87% (77%-93%) of meningococcal cases if complete strain replacement occurs. Compared to the Base policy and at the PMC vaccine price of US51 (-US490), US97, US246 (-US703) per DALY averted, respectively, if strain replacement does not occur. An important potential limitation of our study is the simplifying assumption that all circulating meningococcal serogroups can be aggregated into a single group; while this assumption is critical for model tractability, it would compromise the insights derived from our model if the effectiveness of the vaccine differs markedly between serogroups or if there are complex between-serogroup interactions that influence the frequency and magnitude of future meningitis epidemics. CONCLUSIONS: Our results suggest that a vaccination strategy that includes a catch-up nationwide immunization campaign in young adults with a PMC vaccine and the addition of this new vaccine into EPI is cost-effective and would avert a substantial portion of meningococcal cases expected under the current World Health Organization-recommended strategy of reactive vaccination. This analysis is limited to Burkina Faso and assumes that polyvalent vaccines offer equal protection against all meningococcal serogroups; further studies are needed to evaluate the robustness of this assumption and applicability for other countries in the meningitis belt
Genome-Wide Association with Uncertainty in the Genetic Similarity Matrix
Genome-wide association studies (GWASs) are often confounded by population stratification and structure. Linear mixed models (LMMs) are a powerful class of methods for uncovering genetic effects, while controlling for such confounding. LMMs include random effects for a genetic similarity matrix, and they assume that a true genetic similarity matrix is known. However, uncertainty about the phylogenetic structure of a study population may degrade the quality of LMM results. This may happen in bacterial studies in which the number of samples or loci is small, or in studies with low-quality genotyping. In this study, we develop methods for linear mixed models in which the genetic similarity matrix is unknown and is derived from Markov chain Monte Carlo estimates of the phylogeny. We apply our model to a GWAS of multidrug resistance in tuberculosis, and illustrate our methods on simulated data
Are Survey-Based Estimates of the Burden of Drug Resistant TB Too Low? Insight from a Simulation Study
Background: The emergence of tuberculosis resistant to multiple first- and second-line antibiotics poses challenges to a global control strategy that relies on standard drug treatment regimens. Highly drug-resistant strains of Mycobacterium tuberculosis have been implicated in outbreaks and have been found throughout the world; a comprehensive understanding the magnitude of this threat requires an accurate assessment of the worldwide burden of resistance. Unfortunately, in many settings where resistance is emerging, laboratory capacity is limited and estimates of the burden of resistance are obtained by performing drug sensitivity testing on a sample of incident cases rather than through the use of routine surveillance. Methodology/Principal Findings: Using an individual-based dynamic tuberculosis model to simulate surveillance strategies for drug resistance, we found that current surveys may underestimate the total burden of resistant tuberculosis because cases of acquired resistance are undercounted and resistance among prevalent cases is not assessed. We explored how this bias is affected by the maturity of the epidemic and by the introduction of interventions that target the emergence and spread of resistant tuberculosis. Conclusions: Estimates of drug resistant tuberculosis based on samples of incident cases should be viewed as a lower bound of the total burden of resistance
Transmission analysis of a large tuberculosis outbreak in London:a mathematical modelling study using genomic data
Outbreaks of tuberculosis (TB) - such as the large isoniazid-resistant outbreak centred on London, UK, which originated in 1995 - provide excellent opportunities to model transmission of this devastating disease. Transmission chains for TB are notoriously difficult to ascertain, but mathematical modelling approaches, combined with whole-genome sequencing data, have strong potential to contribute to transmission analyses. Using such data, we aimed to reconstruct transmission histories for the outbreak using a Bayesian approach, and to use machine-learning techniques with patient-level data to identify the key covariates associated with transmission. By using our transmission reconstruction method that accounts for phylogenetic uncertainty, we are able to identify 21 transmission events with reasonable confidence, 9 of which have zero SNP distance, and a maximum distance of 3. Patient age, alcohol abuse and history of homelessness were found to be the most important predictors of being credible TB transmitters
- …