123 research outputs found

    Identifying Structural Variation in Haploid Microbial Genomes from Short-Read Resequencing Data Using Breseq

    Get PDF
    Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. Results: We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for similar to 25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation with modest read-depth coverage of the reference genome (>40-fold). Conclusions: Using breseq to predict structural variation should be useful for studies of microbial epidemiology, experimental evolution, synthetic biology, and genetics when a reference genome for a closely related strain is available. In these cases, breseq can discover mutations that may be responsible for important or unintended changes in genomes that might otherwise go undetected.U.S. National Institutes of Health R00-GM087550U.S. National Science Foundation (NSF) DEB-0515729NSF BEACON Center for the Study of Evolution in Action DBI-0939454Cancer Prevention & Research Institute of Texas (CPRIT) RP130124University of Texas at Austin startup fundsUniversity of Texas at AustinCPRIT Cancer Research TraineeshipMolecular Bioscience

    LASAGNA PLOTS: A SAUCY ALTERNATIVE TO SPAGHETTI PLOTS

    Get PDF
    Longitudinal repeated measures data has often been visualized with spaghetti plots for continuous out- comes. For large datasets, this often leads to over-plotting and consequential obscuring of trends in the data. This is primarily due to overlapping of trajectories. Here, we suggest a framework called lasagna plot ting that constrains the subject-specific trajectories to prevent overlapping and utilizes gradients of color to depict the outcome. Dynamic sorting and visualization is demonstrated as an exploratory data analysis tool. Supplemental material in the form of sample R code additional illustrated examples are available online

    Augmentation Therapy for Severe Alpha-1 Antitrypsin Deficiency Improves Survival and Is Decoupled from Spirometric Decline—A Multinational Registry Analysis

    Full text link
    Rationale: Intravenous plasma-purified alpha-1 antitrypsin (IV-AAT) has been used as therapy for alpha-1 antitrypsin deficiency (AATD) since 1987. Previous trials (RAPID and RAPID-OLE) demonstrated efficacy in preserving computed tomography of lung density but no effect on FEV1. This observational study evaluated 615 people with severe AATD from three countries with socialized health care (Ireland, Switzerland, and Austria), where access to standard medical care was equal but access to IV-AAT was not. Objectives: To assess the real-world longitudinal effects of IV-AAT. Methods: Pulmonary function and mortality data were utilized to perform longitudinal analyses on registry participants with severe AATD. Measurements and Main Results: IV-AAT confers a survival benefit in severe AATD (P < 0.001). We uncovered two distinct AATD phenotypes based on an initial respiratory diagnosis: lung index and non-lung index. Lung indexes demonstrated a more rapid FEV1 decline between the ages of 20 and 50 and subsequently entered a plateau phase of minimal decline from 50 onward. Consequentially, IV-AAT had no effect on FEV1 decline, except in patients with a Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage 2 lung index. Conclusions: This real-world study demonstrates a survival advantage from IV-AAT. This improved survival is largely decoupled from FEV1 decline. The observation that patients with severe AATD fall into two major phenotypes has implications for clinical trial design where FEV1 is a primary endpoint. Recruits into trials are typically older lung indexes entering the plateau phase and, therefore, unlikely to show spirometric benefits. IV-AAT attenuates spirometric decline in lung indexes in GOLD stage 2, a spirometric group commonly outside current IV-AAT commencement recommendations

    Intraspecfic variation in cold-temperature metabolic phenotypes of Arabidopsis lyrata ssp petraea

    Get PDF
    Atmospheric temperature is a key factor in determining the distribution of a plant species. Alongside this, plant populations growing at the margin of their range may exhibit traits that indicate genetic differentiation and adaptation to their local abiotic environment. We investigated whether geographically separated marginal populations of Arabidopsis lyrata ssp. petraea have distinct metabolic phenotypes associated with exposure to cold temperatures. Seeds of A. petraea were obtained from populations along a latitudinal gradient, namely Wales, Sweden and Iceland and grown in a controlled cabinet environment. Mannose, glucose, fructose, sucrose and raffinose concentrations were different between cold treatments and populations, especially in the Welsh population, but polyhydric alcohol concentrations were not. The free amino acid compositions were population specific, with fold differences in most amino acids, especially in the Icelandic populations, with gross changes in amino acids, particularly those associated with glutamine metabolism. Metabolic fingerprints and profiles were obtained. Principal component analysis (PCA) of metabolite fingerprints revealed metabolic characteristic phenotypes for each population and temperature. It is suggested that amino acids and carbohydrates were responsible for discriminating populations within the PCA. Metabolite fingerprinting and profiling has proved to be sufficiently sensitive to identify metabolic differences between plant populations at different atmospheric temperatures. These findings show that there is significant natural variation in cold metabolism among populations of A. l. petraea which may signify plant adaptation to local climates

    Multidrug resistant pulmonary tuberculosis treatment regimens and patient outcomes: an individual patient data meta-analysis of 9,153 patients.

    Get PDF
    Treatment of multidrug resistant tuberculosis (MDR-TB) is lengthy, toxic, expensive, and has generally poor outcomes. We undertook an individual patient data meta-analysis to assess the impact on outcomes of the type, number, and duration of drugs used to treat MDR-TB

    Gene content evolution in the arthropods

    Get PDF
    Arthropods comprise the largest and most diverse phylum on Earth and play vital roles in nearly every ecosystem. Their diversity stems in part from variations on a conserved body plan, resulting from and recorded in adaptive changes in the genome. Dissection of the genomic record of sequence change enables broad questions regarding genome evolution to be addressed, even across hyper-diverse taxa within arthropods. Using 76 whole genome sequences representing 21 orders spanning more than 500 million years of arthropod evolution, we document changes in gene and protein domain content and provide temporal and phylogenetic context for interpreting these innovations. We identify many novel gene families that arose early in the evolution of arthropods and during the diversification of insects into modern orders. We reveal unexpected variation in patterns of DNA methylation across arthropods and examples of gene family and protein domain evolution coincident with the appearance of notable phenotypic and physiological adaptations such as flight, metamorphosis, sociality, and chemoperception. These analyses demonstrate how large-scale comparative genomics can provide broad new insights into the genotype to phenotype map and generate testable hypotheses about the evolution of animal diversity

    Global population divergence and admixture of the brown rat (Rattus norvegicus)

    Get PDF
    Native to China and Mongolia, the brown rat (Rattus norvegicus) now enjoys a worldwide distribution. While black rats and the house mouse tracked the regional development of human agricultural settlements, brown rats did not appear in Europe until the 1500s, suggesting their range expansion was a response to relatively recent increases in global trade. We inferred the global phylogeography of brown rats using 32 k SNPs, and detected 13 evolutionary clusters within five expansion routes. One cluster arose following a southward expansion into Southeast Asia. Three additional clusters arose from two independent eastward expansions: one expansion from Russia to the Aleutian Archipelago, and a second to western North America. Westward expansion resulted in the colonization of Europe from which subsequent rapid colonization of Africa, the Americas and Australasia occurred, and multiple evolutionary clusters were detected. An astonishing degree of fine-grained clustering between and within sampling sites underscored the extent to which urban heterogeneity shaped genetic structure of commensal rodents. Surprisingly, few individuals were recent migrants, suggesting that recruitment into established populations is limited. Understanding the global population structure of R. norvegicus offers novel perspectives on the forces driving the spread of zoonotic disease, and aids in development of rat eradication programmes

    Impact of exposure measurement error in air pollution epidemiology: effect of error type in time-series studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Two distinctly different types of measurement error are Berkson and classical. Impacts of measurement error in epidemiologic studies of ambient air pollution are expected to depend on error type. We characterize measurement error due to instrument imprecision and spatial variability as multiplicative (i.e. additive on the log scale) and model it over a range of error types to assess impacts on risk ratio estimates both on a per measurement unit basis and on a per interquartile range (IQR) basis in a time-series study in Atlanta.</p> <p>Methods</p> <p>Daily measures of twelve ambient air pollutants were analyzed: NO<sub>2</sub>, NO<sub>x</sub>, O<sub>3</sub>, SO<sub>2</sub>, CO, PM<sub>10 </sub>mass, PM<sub>2.5 </sub>mass, and PM<sub>2.5 </sub>components sulfate, nitrate, ammonium, elemental carbon and organic carbon. Semivariogram analysis was applied to assess spatial variability. Error due to this spatial variability was added to a reference pollutant time-series on the log scale using Monte Carlo simulations. Each of these time-series was exponentiated and introduced to a Poisson generalized linear model of cardiovascular disease emergency department visits.</p> <p>Results</p> <p>Measurement error resulted in reduced statistical significance for the risk ratio estimates for all amounts (corresponding to different pollutants) and types of error. When modelled as classical-type error, risk ratios were attenuated, particularly for primary air pollutants, with average attenuation in risk ratios on a per unit of measurement basis ranging from 18% to 92% and on an IQR basis ranging from 18% to 86%. When modelled as Berkson-type error, risk ratios per unit of measurement were biased away from the null hypothesis by 2% to 31%, whereas risk ratios per IQR were attenuated (i.e. biased toward the null) by 5% to 34%. For CO modelled error amount, a range of error types were simulated and effects on risk ratio bias and significance were observed.</p> <p>Conclusions</p> <p>For multiplicative error, both the amount and type of measurement error impact health effect estimates in air pollution epidemiology. By modelling instrument imprecision and spatial variability as different error types, we estimate direction and magnitude of the effects of error over a range of error types.</p
    corecore