487 research outputs found

    Bulk Segregant Analysis by High-Throughput Sequencing Reveals a Novel Xylose Utilization Gene from Saccharomyces cerevisiae

    Get PDF
    Fermentation of xylose is a fundamental requirement for the efficient production of ethanol from lignocellulosic biomass sources. Although they aggressively ferment hexoses, it has long been thought that native Saccharomyces cerevisiae strains cannot grow fermentatively or non-fermentatively on xylose. Population surveys have uncovered a few naturally occurring strains that are weakly xylose-positive, and some S. cerevisiae have been genetically engineered to ferment xylose, but no strain, either natural or engineered, has yet been reported to ferment xylose as efficiently as glucose. Here, we used a medium-throughput screen to identify Saccharomyces strains that can increase in optical density when xylose is presented as the sole carbon source. We identified 38 strains that have this xylose utilization phenotype, including strains of S. cerevisiae, other sensu stricto members, and hybrids between them. All the S. cerevisiae xylose-utilizing strains we identified are wine yeasts, and for those that could produce meiotic progeny, the xylose phenotype segregates as a single gene trait. We mapped this gene by Bulk Segregant Analysis (BSA) using tiling microarrays and high-throughput sequencing. The gene is a putative xylitol dehydrogenase, which we name XDH1, and is located in the subtelomeric region of the right end of chromosome XV in a region not present in the S288c reference genome. We further characterized the xylose phenotype by performing gene expression microarrays and by genetically dissecting the endogenous Saccharomyces xylose pathway. We have demonstrated that natural S. cerevisiae yeasts are capable of utilizing xylose as the sole carbon source, characterized the genetic basis for this trait as well as the endogenous xylose utilization pathway, and demonstrated the feasibility of BSA using high-throughput sequencing

    Evolutionary dynamics and structural consequences of de novo beneficial mutations and mutant lineages arising in a constant environment

    Get PDF
    Background: Microbial evolution experiments can be used to study the tempo and dynamics of evolutionary change in asexual populations, founded from single clones and growing into large populations with multiple clonal lineages. High-throughput sequencing can be used to catalog de novo mutations as potential targets of selection, determine in which lineages they arise, and track the fates of those lineages. Here, we describe a long-term experimental evolution study to identify targets of selection and to determine when, where, and how often those targets are hit. Results: We experimentally evolved replicate Escherichia coli populations that originated from a mutator/nonsense suppressor ancestor under glucose limitation for between 300 and 500 generations. Whole-genome, whole-population sequencing enabled us to catalog 3346 de novo mutations that reached \u3e 1% frequency. We sequenced the genomes of 96 clones from each population when allelic diversity was greatest in order to establish whether mutations were in the same or different lineages and to depict lineage dynamics. Operon-specific mutations that enhance glucose uptake were the first to rise to high frequency, followed by global regulatory mutations. Mutations related to energy conservation, membrane biogenesis, and mitigating the impact of nonsense mutations, both ancestral and derived, arose later. New alleles were confined to relatively few loci, with many instances of identical mutations arising independently in multiple lineages, among and within replicate populations. However, most never exceeded 10% in frequency and were at a lower frequency at the end of the experiment than at their maxima, indicating clonal interference. Many alleles mapped to key structures within the proteins that they mutated, providing insight into their functional consequences. Conclusions: Overall, we find that when mutational input is increased by an ancestral defect in DNA repair, the spectrum of high-frequency beneficial mutations in a simple, constant resource-limited environment is narrow, resulting in extreme parallelism where many adaptive mutations arise but few ever go to fixation

    Time-varying coefficient models for the analysis of air pollution and health outcome data

    Get PDF
    In this article a time-varying coefficient model is developed to examine the relationship between adverse health and short-term (acute) exposure to air pollution. This model allows the relative risk to evolve over time, which may be due to an interaction with temperature, or from a change in the composition of pollutants, such as particulate matter, over time. The model produces a smooth estimate of these time-varying effects, which are not constrained to follow a fixed parametric form set by the investigator. Instead, the shape is estimated from the data using penalized natural cubic splines. Poisson regression models, using both quasi-likelihood and Bayesian techniques, are developed, with estimation performed using an iteratively re-weighted least squares procedure and Markov chain Monte Carlo simulation, respectively. The efficacy of the methods to estimate different types of time-varying effects are assessed via a simulation study, and the models are then applied to data from four cities that were part of the National Morbidity, Mortality, and Air Pollution Study

    APJ1 and GRE3 Homologs Work in Concert to Allow Growth in Xylose in a Natural Saccharomyces sensu stricto Hybrid Yeast

    Get PDF
    Creating Saccharomyces yeasts capable of efficient fermentation of pentoses such as xylose remains a key challenge in the production of ethanol from lignocellulosic biomass. Metabolic engineering of industrial Saccharomyces cerevisiae strains has yielded xylose-fermenting strains, but these strains have not yet achieved industrial viability due largely to xylose fermentation being prohibitively slower than that of glucose. Recently, it has been shown that naturally occurring xylose-utilizing Saccharomyces species exist. Uncovering the genetic architecture of such strains will shed further light on xylose metabolism, suggesting additional engineering approaches or possibly even enabling the development of xylose-fermenting yeasts that are not genetically modified. We previously identified a hybrid yeast strain, the genome of which is largely Saccharomyces uvarum, which has the ability to grow on xylose as the sole carbon source. To circumvent the sterility of this hybrid strain, we developed a novel method to genetically characterize its xylose-utilization phenotype, using a tetraploid intermediate, followed by bulk segregant analysis in conjunction with high-throughput sequencing. We found that this strain’s growth in xylose is governed by at least two genetic loci, within which we identified the responsible genes: one locus contains a known xylose-pathway gene, a novel homolog of the aldo-keto reductase gene GRE3, while a second locus contains a homolog of APJ1, which encodes a putative chaperone not previously connected to xylose metabolism. Our work demonstrates that the power of sequencing combined with bulk segregant analysis can also be applied to a nongenetically tractable hybrid strain that contains a complex, polygenic trait, and identifies new avenues for metabolic engineering as well as for construction of nongenetically modified xylose-fermenting strains

    GC-Content Normalization for RNA-Seq Data

    Get PDF
    Background: Transcriptome sequencing (RNA-Seq) has become the assay of choice for high-throughput studies of gene expression. However, as is the case with microarrays, major technology-related artifacts and biases affect the resulting expression measures. Normalization is therefore essential to ensure accurate inference of expression levels and subsequent analyses thereof. Results: We focus on biases related to GC-content and demonstrate the existence of strong sample-specific GC-content effects on RNA-Seq read counts, which can substantially bias differential expression analysis. We propose three simple within-lane gene-level GC-content normalization approaches and assess their performance on two different RNA-Seq datasets, involving different species and experimental designs. Our methods are compared to state-of-the-art normalization procedures in terms of bias and mean squared error for expression fold-change estimation and in terms of Type I error and p-value distributions for tests of differential expression. The exploratory data analysis and normalization methods proposed in this article are implemented in the open-source Bioconductor R package EDASeq. Conclusions: Our within-lane normalization procedures, followed by between-lane normalization, reduce GC-content bias and lead to more accurate estimates of expression fold-changes and tests of differential expression. Such results are crucial for the biological interpretation of RNA-Seq experiments, where downstream analyses can be sensitive to the supplied lists of genes

    The Valley-of-Death: Reciprocal sign epistasis constrains adaptive trajectories in a constant, nutrient limiting environment

    Get PDF
    The fitness landscape is a powerful metaphor for describing the relationship between genotype and phenotype for a population under selection. However, empirical data as to the topography of fitness landscapes are limited, owing to difficulties in measuring fitness for large numbers of genotypes under any condition. We previously reported a case of reciprocal sign epistasis (RSE), where two mutations individually increased yeast fitness in a glucose-limited environment, but reduced fitness when combined, suggesting the existence of two peaks on the fitness landscape. We sought to determine whether a ridge connected these peaks so that populations founded by one mutant could reach the peak created by the other, avoiding the low-fitness Valley-of-Death between them. Sequencing clones after 250 generations of further evolution provided no evidence for such a ridge, but did reveal many presumptive beneficial mutations, adding to a growing body of evidence that clonal interference pervades evolving microbial populations

    Revisiting Date and Party Hubs: Novel Approaches to Role Assignment in Protein Interaction Networks

    Get PDF
    The idea of 'date' and 'party' hubs has been influential in the study of protein-protein interaction networks. Date hubs display low co-expression with their partners, whilst party hubs have high co-expression. It was proposed that party hubs are local coordinators whereas date hubs are global connectors. Here we show that the reported importance of date hubs to network connectivity can in fact be attributed to a tiny subset of them. Crucially, these few, extremely central, hubs do not display particularly low expression correlation, undermining the idea of a link between this quantity and hub function. The date/party distinction was originally motivated by an approximately bimodal distribution of hub co-expression; we show that this feature is not always robust to methodological changes. Additionally, topological properties of hubs do not in general correlate with co-expression. Thus, we suggest that a date/party dichotomy is not meaningful and it might be more useful to conceive of roles for protein-protein interactions rather than individual proteins. We find significant correlations between interaction centrality and the functional similarity of the interacting proteins.Comment: 27 pages, 5 main figures, 4 supplementary figure

    Cmr1/WDR76 defines a nuclear genotoxic stress body linking genome integrity and protein quality control

    Get PDF
    DNA replication stress is a source of genomic instability. Here we identify ​changed mutation rate 1 (​Cmr1) as a factor involved in the response to DNA replication stress in Saccharomyces cerevisiae and show that ​Cmr1—together with ​Mrc1/​Claspin, ​Pph3, the chaperonin containing ​TCP1 (CCT) and 25 other proteins—define a novel intranuclear quality control compartment (INQ) that sequesters misfolded, ubiquitylated and sumoylated proteins in response to genotoxic stress. The diversity of proteins that localize to INQ indicates that other biological processes such as cell cycle progression, chromatin and mitotic spindle organization may also be regulated through INQ. Similar to ​Cmr1, its human orthologue ​WDR76 responds to proteasome inhibition and DNA damage by relocalizing to nuclear foci and physically associating with CCT, suggesting an evolutionarily conserved biological function. We propose that ​Cmr1/​WDR76 plays a role in the recovery from genotoxic stress through regulation of the turnover of sumoylated and phosphorylated proteins

    Why social values cannot be changed for the sake of conservation

    Get PDF
    The hope for creating widespread change in social values has endured among conservation professionals since early calls by Aldo Leopold for a “land ethic.” However, there has been little serious attention in conservation to the fields of investigation that address values, how they are formed, and how they change. We introduce a social–ecological systems conceptual approach in which values are seen not only as motivational goals people hold but also as ideas that are deeply embedded in society’s material culture, collective behaviors, traditions, and institutions. Values define and bind groups, organizations, and societies; serve an adaptive role; and are typically stable across generations. When abrupt value changes occur, they are in response to substantial alterations in the social–ecological context. Such changes build on prior value structures and do not result in complete replacement. Given this understanding of values, we conclude that deliberate efforts to orchestrate value shifts for conservation are unlikely to be effective. Instead, there is an urgent need for research on values with a multilevel and dynamic view that can inform innovative conservation strategies for working within existing value structures. New directions facilitated by a systems approach will enhance understanding of the role values play in shaping conservation challenges and improve management of the human component of conservation

    Customizable views on semantically integrated networks for systems biology

    Get PDF
    Motivation: The rise of high-throughput technologies in the post-genomic era has led to the production of large amounts of biological data. Many of these datasets are freely available on the Internet. Making optimal use of these data is a significant challenge for bioinformaticians. Various strategies for integrating data have been proposed to address this challenge. One of the most promising approaches is the development of semantically rich integrated datasets. Although well suited to computational manipulation, such integrated datasets are typically too large and complex for easy visualization and interactive exploration
    corecore