80 research outputs found
Statistical inference of transmission fidelity of DNA methylation patterns over somatic cell divisions in mammals
We develop Bayesian inference methods for a recently-emerging type of
epigenetic data to study the transmission fidelity of DNA methylation patterns
over cell divisions. The data consist of parent-daughter double-stranded DNA
methylation patterns with each pattern coming from a single cell and
represented as an unordered pair of binary strings. The data are technically
difficult and time-consuming to collect, putting a premium on an efficient
inference method. Our aim is to estimate rates for the maintenance and de novo
methylation events that gave rise to the observed patterns, while accounting
for measurement error. We model data at multiple sites jointly, thus using
whole-strand information, and considerably reduce confounding between
parameters. We also adopt a hierarchical structure that allows for variation in
rates across sites without an explosion in the effective number of parameters.
Our context-specific priors capture the expected stationarity, or
near-stationarity, of the stochastic process that generated the data analyzed
here. This expected stationarity is shown to greatly increase the precision of
the estimation. Applying our model to a data set collected at the human FMR1
locus, we find that measurement errors, generally ignored in similar studies,
occur at a nontrivial rate (inappropriate bisulfite conversion error: 1.6
with 80 CI: 0.9--2.3). Accounting for these errors has a substantial
impact on estimates of key biological parameters. The estimated average failure
of maintenance rate and daughter de novo rate decline from 0.04 to 0.024 and
from 0.14 to 0.07, respectively, when errors are accounted for. Our results
also provide evidence that de novo events may occur on both parent and daughter
strands: the median parent and daughter de novo rates are 0.08 (80 CI:
0.04--0.13) and 0.07 (80 CI: 0.04--0.11), respectively.Comment: Published in at http://dx.doi.org/10.1214/09-AOAS297 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
A comparative genomics multitool for scientific discovery and conservation
The Zoonomia Project is investigating the genomics of shared and specialized traits in eutherian mammals. Here we provide genome assemblies for 131 species, of which all but 9 are previously uncharacterized, and describe a whole-genome alignment of 240 species of considerable phylogenetic diversity, comprising representatives from more than 80% of mammalian families. We find that regions of reduced genetic diversity are more abundant in species at a high risk of extinction, discern signals of evolutionary selection at high resolution and provide insights from individual reference genomes. By prioritizing phylogenetic diversity and making data available quickly and without restriction, the Zoonomia Project aims to support biological discovery, medical research and the conservation of biodiversity
Analysis of the Human Mucosal Response to Cholera Reveals Sustained Activation of Innate Immune Signaling Pathways
To better understand the innate immune response to Vibrio cholerae infection, we tracked gene expression in the duodenal mucosa of 11 Bangladeshi adults with cholera, using biopsy specimens obtained immediately after rehydration and 30 and 180 days later. We identified differentially expressed genes and performed an analysis to predict differentially regulated pathways and upstream regulators. During acute cholera, there was a broad increase in the expression of genes associated with innate immunity, including activation of the NF-kappaB, mitogen-activated protein kinase (MAPK), and Toll-like receptor (TLR)-mediated signaling pathways, which, unexpectedly, persisted even 30 days after infection. Focusing on early differences in gene expression, we identified 37 genes that were differentially expressed on days 2 and 30 across the 11 participants. These genes included the endosomal Toll-like receptor gene TLR8, which was expressed in lamina propria cells. Underscoring a potential role for endosomal TLR-mediated signaling in vivo, our pathway analysis found that interferon regulatory factor 7 and beta 1 and alpha 2 interferons were among the top upstream regulators activated during cholera. Among the innate immune effectors, we found that the gene for DUOX2, an NADPH oxidase involved in the maintenance of intestinal homeostasis, was upregulated in intestinal epithelial cells during cholera. Notably, the observed increases in DUOX2 and TLR8 expression were also modeled in vitro when Caco-2 or THP-1 cells, respectively, were stimulated with live V. cholerae but not with heat-killed organisms or cholera toxin alone. These previously unidentified features of the innate immune response to V. cholerae extend our understanding of the mucosal immune signaling pathways and effectors activated in vivo following cholera
Analysis of the Human Mucosal Response to Cholera Reveals Sustained Activation of Innate Immune Signaling Pathways
To better understand the innate immune response to Vibrio cholerae infection, we tracked gene expression in the duodenal mucosa of 11 Bangladeshi adults with cholera, using biopsy specimens obtained immediately after rehydration and 30 and 180 days later. We identified differentially expressed genes and performed an analysis to predict differentially regulated pathways and upstream regulators. During acute cholera, there was a broad increase in the expression of genes associated with innate immunity, including activation of the NF-kappaB, mitogen-activated protein kinase (MAPK), and Toll-like receptor (TLR)-mediated signaling pathways, which, unexpectedly, persisted even 30 days after infection. Focusing on early differences in gene expression, we identified 37 genes that were differentially expressed on days 2 and 30 across the 11 participants. These genes included the endosomal Toll-like receptor gene TLR8, which was expressed in lamina propria cells. Underscoring a potential role for endosomal TLR-mediated signaling in vivo, our pathway analysis found that interferon regulatory factor 7 and beta 1 and alpha 2 interferons were among the top upstream regulators activated during cholera. Among the innate immune effectors, we found that the gene for DUOX2, an NADPH oxidase involved in the maintenance of intestinal homeostasis, was upregulated in intestinal epithelial cells during cholera. Notably, the observed increases in DUOX2 and TLR8 expression were also modeled in vitro when Caco-2 or THP-1 cells, respectively, were stimulated with live V. cholerae but not with heat-killed organisms or cholera toxin alone. These previously unidentified features of the innate immune response to V. cholerae extend our understanding of the mucosal immune signaling pathways and effectors activated in vivo following cholera
Combining Citizen Science and Genomics to Investigate Tick, Pathogen, and Commensal Microbiome at Single-Tick Resolution
The prevalence of tickborne diseases worldwide is increasing virtually unchecked due to the lack of effective control strategies. The transmission dynamics of tickborne pathogens are influenced by the tick microbiome, tick co-infection with other pathogens, and environmental features. Understanding this complex system could lead to new strategies for pathogen control, but will require large-scale, high-resolution data. Here, we introduce Project Acari, a citizen science-based project to assay, at single-tick resolution, species, pathogen infection status, microbiome profile, and environmental conditions of tens of thousands of ticks collected from numerous sites across the United States. In the first phase of the project, we collected more than 2,400 ticks wild-caught by citizen scientists and developed high-throughput methods to process and sequence them individually. Applying these methods to 192 Ixodes scapularis ticks collected in a region with a high incidence of Lyme disease, we found that 62% were colonized by Borrelia burgdorferi, the Lyme disease pathogen. In contrast to previous reports, we did not find an association between the microbiome diversity of a tick and its probability of carrying B. burgdorferi. However, we did find undescribed associations between B. burgdorferi carriage and the presence of specific microbial taxa within individual ticks. Our findings underscore the power of coupling citizen science with high-throughput processing to reveal pathogen dynamics. Our approach can be extended for massively parallel screening of individual ticks, offering a powerful tool to elucidate the ecology of tickborne disease and to guide pathogen-control initiatives
Asymmetric Strand Segregation: Epigenetic Costs of Genetic Fidelity?
Asymmetric strand segregation has been proposed as a mechanism to minimize effective mutation rates in epithelial tissues. Under asymmetric strand segregation, the double-stranded molecule that contains the oldest DNA strand is preferentially targeted to the somatic stem cell after each round of DNA replication. This oldest DNA strand is expected to have fewer errors than younger strands because some of the errors that arise on daughter strands during their synthesis fail to be repaired. Empirical findings suggest the possibility of asymmetric strand segregation in a subset of mammalian cell lineages, indicating that it may indeed function to increase genetic fidelity. However, the implications of asymmetric strand segregation for the fidelity of epigenetic information remain unexplored. Here, I explore the impact of strand-segregation dynamics on epigenetic fidelity using a mathematical-modelling approach that draws on the known molecular mechanisms of DNA methylation and existing rate estimates from empirical methylation data. I find that, for a wide range of starting methylation densities, asymmetric—but not symmetric—strand segregation leads to systematic increases in methylation levels if parent strands are subject to de novo methylation events. I found that epigenetic fidelity can be compromised when enhanced genetic fidelity is achieved through asymmetric strand segregation. Strand segregation dynamics could thus explain the increased DNA methylation densities that are observed in structured cellular populations during aging and in disease
Statistical Inference of In Vivo Properties of Human DNA Methyltransferases from Double-Stranded Methylation Patterns
DNA methyltransferases establish methylation patterns in cells and transmit these patterns over cell generations, thereby influencing each cell's epigenetic states. Three primary DNA methyltransferases have been identified in mammals: DNMT1, DNMT3A and DNMT3B. Extensive in vitro studies have investigated key properties of these enzymes, namely their substrate specificity and processivity. Here we study these properties in vivo, by applying novel statistical analysis methods to double-stranded DNA methylation patterns collected using hairpin-bisulfite PCR. Our analysis fits a novel Hidden Markov Model (HMM) to the observed data, allowing for potential bisulfite conversion errors, and yields statistical estimates of parameters that quantify enzyme processivity and substrate specificity. We apply this model to methylation patterns established in vivo at three loci in humans: two densely methylated inactive X (Xi)-linked loci ( and ), and an autosomal locus (), where methylation densities are tissue-specific but moderate. We find strong evidence for a high level of processivity of DNMT1 at and , with the mean association tract length being a few hundred base pairs. Regardless of tissue types, methylation patterns at are dominated by DNMT1 maintenance events, similar to the two Xi-linked loci, but are insufficiently informative regarding processivity to draw any conclusions about processivity at that locus. At all three loci we find that DNMT1 shows a strong preference for adding methyl groups to hemi-methylated CpG sites over unmethylated sites. The data at all three loci also suggest low (possibly 0) association of the de novo methyltransferases, the DNMT3s, and are consequently uninformative about processivity or preference of these enzymes. We also extend our HMM to reanalyze published data on mouse DNMT1 activities in vitro. The results suggest shorter association tracts (and hence weaker processivity), and much longer non-association tracts than human DNMT1 in vivo
Testing the FMR1 Promoter for Mosaicism in DNA Methylation among CpG Sites, Strands, and Cells in FMR1-Expressing Males with Fragile X Syndrome
Variability among individuals in the severity of fragile X syndrome (FXS) is influenced by epigenetic methylation mosaicism, which may also be common in other complex disorders. The epigenetic signal of dense promoter DNA methylation is usually associated with gene silencing, as was initially reported for FMR1 alleles in individuals with FXS. A paradox arose when significant levels of FMR1 mRNA were reported for some males with FXS who had been reported to have predominately methylated alleles. We have used hairpin-bisufite PCR, validated with molecular batch-stamps and barcodes, to collect and assess double-stranded DNA methylation patterns from these previously studied males. These patterns enable us to distinguish among three possible forms of methylation mosaicism, any one of which could explain FMR1 expression in these males. Our data indicate that cryptic inter-cell mosaicism in DNA methylation can account for the presence of FMR1 mRNA in some individuals with FXS
Transient exposure to low levels of insecticide affects metabolic networks of honeybee larvae
The survival of a species depends on its capacity to adjust to changing environmental conditions, and new stressors. Such new, anthropogenic stressors include the neonicotinoid class of crop-protecting agents, which have been implicated in the population declines of pollinating insects, including honeybees (Apis mellifera). The low-dose effects of these compounds on larval development and physiological responses have remained largely unknown. Over a period of 15 days, we provided syrup tainted with low levels (2 µg/L−1) of the neonicotinoid insecticide imidacloprid to beehives located in the field. We measured transcript levels by RNA sequencing and established lipid profiles using liquid chromatography coupled with mass spectrometry from worker-bee larvae of imidacloprid-exposed (IE) and unexposed, control (C) hives. Within a catalogue of 300 differentially expressed transcripts in larvae from IE hives, we detect significant enrichment of genes functioning in lipid-carbohydrate-mitochondrial metabolic networks. Myc-involved transcriptional response to exposure of this neonicotinoid is indicated by overrepresentation of E-box elements in the promoter regions of genes with altered expression. RNA levels for a cluster of genes encoding detoxifying P450 enzymes are elevated, with coordinated downregulation of genes in glycolytic and sugar-metabolising pathways. Expression of the environmentally responsive Hsp90 gene is also reduced, suggesting diminished buffering and stability of the developmental program. The multifaceted, physiological response described here may be of importance to our general understanding of pollinator health. Muscles, for instance, work at high glycolytic rates and flight performance could be impacted should low levels of this evolutionarily novel stressor likewise induce downregulation of energy metabolising genes in adult pollinators
A comparative genomics multitool for scientific discovery and conservation
A whole-genome alignment of 240 phylogenetically diverse species of eutherian mammal-including 131 previously uncharacterized species-from the Zoonomia Project provides data that support biological discovery, medical research and conservation. The Zoonomia Project is investigating the genomics of shared and specialized traits in eutherian mammals. Here we provide genome assemblies for 131 species, of which all but 9 are previously uncharacterized, and describe a whole-genome alignment of 240 species of considerable phylogenetic diversity, comprising representatives from more than 80% of mammalian families. We find that regions of reduced genetic diversity are more abundant in species at a high risk of extinction, discern signals of evolutionary selection at high resolution and provide insights from individual reference genomes. By prioritizing phylogenetic diversity and making data available quickly and without restriction, the Zoonomia Project aims to support biological discovery, medical research and the conservation of biodiversity.Peer reviewe
- …