111 research outputs found

    Genome-Scale Oscillations in DNA Methylation during Exit from Pluripotency

    Get PDF
    Pluripotency is accompanied by the erasure of parental epigenetic memory, with naive pluripotent cells exhibiting global DNA hypomethylation both in vitro and in vivo. Exit from pluripotency and priming for differentiation into somatic lineages is associated with genome-wide de novo DNA methylation. We show that during this phase, co-expression of enzymes required for DNA methylation turnover, DNMT3s and TETs, promotes cell-to-cell variability in this epigenetic mark. Using a combination of single- cell sequencing and quantitative biophysical modeling, we show that this variability is associated with coherent, genome-scale oscillations in DNA methylation with an amplitude dependent on CpG density. Analysis of parallel single-cell transcriptional and epigenetic profiling provides evidence for oscillatory dynamics both in vitro and in vivo. These observations provide insights into the emergence of epigenetic heterogeneity during early embryo development, indicating that dynamic changes in DNA methylation might influence early cell fate decisions

    Phronesis and Automated Science: The Case of Machine Learning and Biology

    Get PDF
    The applications of machine learning (ML) and deep learning to the natural sciences has fostered the idea that the automated nature of algorithmic analysis will gradually dispense human beings from scientific work. In this paper, I will show that this view is problematic, at least when ML is applied to biology. In particular, I will claim that ML is not independent of human beings and cannot form the basis of automated science. Computer scientists conceive their work as being a case of Aristotle’s poiesis perfected by techne, which can be reduced to a number of straightforward rules and technical knowledge. I will show a number of concrete cases where at each level of computational analysis, more is required to ML than just poiesis and techne, and that the work of ML practitioners in biology needs also the cultivation of something analogous to phronesis, which cannot be automated. But even if we knew how to frame phronesis into rules (which is inconsistent with its own definition), still this virtue is deeply entrenched in our biological constitution, which computers lack. Whether computers can fully perform scientific practice (which is the result of the way we are cognitively and biologically) independently of humans (and their cognitive and biological specificities) is an ill-posed question

    PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures

    Get PDF
    Plasmids are mobile genetics elements that play an important role in the environmental adaptation of microorganisms. Although plasmids are usually analyzed in cultured microorganisms, there is a need for methods that allow for the analysis of pools of plasmids (plasmidomes) in environmental samples. To that end, several molecular biology and bioinformatics methods have been developed; however, they are limited to environments with low diversity and cannot recover large plasmids. Here, we present PlasFlow, a novel tool based on genomic signatures that employs a neural network approach for identification of bacterial plasmid sequences in environmental samples. PlasFlow can recover plasmid sequences from assembled metagenomes without any prior knowledge of the taxonomical or functional composition of samples with an accuracy up to 96%. It can also recover sequences of both circular and linear plasmids and can perform initial taxonomical classification of sequences. Compared to other currently available tools, PlasFlow demonstrated significantly better performance on test datasets. Analysis of two samples from heavy metal-contaminated microbial mats revealed that plasmids may constitute an important fraction of their metagenomes and carry genes involved in heavy-metal homeostasis, proving the pivotal role of plasmids in microorganism adaptation to environmental conditions

    DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning.

    Get PDF
    Recent technological advances have enabled DNA methylation to be assayed at single-cell resolution. However, current protocols are limited by incomplete CpG coverage and hence methods to predict missing methylation states are critical to enable genome-wide analyses. We report DeepCpG, a computational approach based on deep neural networks to predict methylation states in single cells. We evaluate DeepCpG on single-cell methylation data from five cell types generated using alternative sequencing protocols. DeepCpG yields substantially more accurate predictions than previous methods. Additionally, we show that the model parameters can be interpreted, thereby providing insights into how sequence composition affects methylation variability

    Improving genomic prediction accuracy for meat tenderness in Nellore cattle using artificial neural networks.

    Get PDF
    The goal of this study was to compare the predictive performance of artificial neural networks (ANNs) with Bayesian ridge regression, Bayesian Lasso, Bayes A, Bayes B and Bayes Cπ in estimating genomic breeding values for meat tenderness in Nellore cattle. The animals were genotyped with the Illumina Bovine HD Bead Chip (HD, 777K from 90 samples) and the GeneSeek Genomic Profiler (GGP Indicus HD, 77K from 485 samples). The quality control for the genotypes was applied on each Chip and comprised removal of SNPs located on non‐autosomal chromosomes, with minor allele frequency 0.8. The FImpute program was used for genotype imputation. Pedigree‐based analyses indicated that meat tenderness is moderately heritable (0.35), indicating that it can be improved by direct selection. Prediction accuracies were very similar across the Bayesian regression models, ranging from 0.20 (Bayes A) to 0.22 (Bayes B) and 0.14 (Bayes Cπ) to 0.19 (Bayes A) for the additive and dominance effects, respectively. ANN achieved the highest accuracy (0.33) of genomic prediction of genetic merit. Even though deep neural networks are recognized to deliver more accurate predictions, in our study ANN with one single hidden layer, 105 neurons and rectified linear unit (ReLU) activation function was sufficient to increase the prediction of genetic merit for meat tenderness. These results indicate that an ANN with relatively simple architecture can provide superior genomic predictions for meat tenderness in Nellore cattl

    Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types

    Get PDF
    Abstract Background A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. Results We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14+CD16− monocytes, CD66b+CD16+ neutrophils, and CD4+CD45RA+ naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Conclusions Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability

    Statistical and integrative system-level analysis of DNA methylation data

    Get PDF
    Epigenetics plays a key role in cellular development and function. Alterations to the epigenome are thought to capture and mediate the effects of genetic and environmental risk factors on complex disease. Currently, DNA methylation is the only epigenetic mark that can be measured reliably and genome-wide in large numbers of samples. This Review discusses some of the key statistical challenges and algorithms associated with drawing inferences from DNA methylation data, including cell-type heterogeneity, feature selection, reverse causation and system-level analyses that require integration with other data types such as gene expression, genotype, transcription factor binding and other epigenetic information
    corecore