140 research outputs found

    Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types

    Get PDF
    Abstract Background A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. Results We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14+CD16− monocytes, CD66b+CD16+ neutrophils, and CD4+CD45RA+ naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Conclusions Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability

    Genome-Scale Oscillations in DNA Methylation during Exit from Pluripotency

    Get PDF
    Pluripotency is accompanied by the erasure of parental epigenetic memory, with naive pluripotent cells exhibiting global DNA hypomethylation both in vitro and in vivo. Exit from pluripotency and priming for differentiation into somatic lineages is associated with genome-wide de novo DNA methylation. We show that during this phase, co-expression of enzymes required for DNA methylation turnover, DNMT3s and TETs, promotes cell-to-cell variability in this epigenetic mark. Using a combination of single- cell sequencing and quantitative biophysical modeling, we show that this variability is associated with coherent, genome-scale oscillations in DNA methylation with an amplitude dependent on CpG density. Analysis of parallel single-cell transcriptional and epigenetic profiling provides evidence for oscillatory dynamics both in vitro and in vivo. These observations provide insights into the emergence of epigenetic heterogeneity during early embryo development, indicating that dynamic changes in DNA methylation might influence early cell fate decisions

    Deep Learning for Predicting Refractive Error From Retinal Fundus Images

    Get PDF
    PURPOSE. We evaluate how deep learning can be applied to extract novel information such as refractive error from retinal fundus imaging. METHODS. Retinal fundus images used in this study were 45- and 30-degree field of view images from the UK Biobank and Age-Related Eye Disease Study (AREDS) clinical trials, respectively. Refractive error was measured by autorefraction in UK Biobank and subjective refraction in AREDS. We trained a deep learning algorithm to predict refractive error from a total of 226,870 images and validated it on 24,007 UK Biobank and 15,750 AREDS images. Our model used the ‘‘attention’’ method to identify features that are correlated with refractive error. RESULTS. The resulting algorithm had a mean absolute error (MAE) of 0.56 diopters (95% confidence interval [CI]: 0.55–0.56) for estimating spherical equivalent on the UK Biobank data set and 0.91 diopters (95% CI: 0.89–0.93) for the AREDS data set. The baseline expected MAE (obtained by simply predicting the mean of this population) was 1.81 diopters (95% CI: 1.79–1.84) for UK Biobank and 1.63 (95% CI: 1.60–1.67) for AREDS. Attention maps suggested that the foveal region was one of the most important areas used by the algorithm to make this prediction, though other regions also contribute to the prediction. CONCLUSIONS. To our knowledge, the ability to estimate refractive error with high accuracy from retinal fundus photos has not been previously known and demonstrates that deep learning can be applied to make novel predictions from medical images

    Phronesis and Automated Science: The Case of Machine Learning and Biology

    Get PDF
    The applications of machine learning (ML) and deep learning to the natural sciences has fostered the idea that the automated nature of algorithmic analysis will gradually dispense human beings from scientific work. In this paper, I will show that this view is problematic, at least when ML is applied to biology. In particular, I will claim that ML is not independent of human beings and cannot form the basis of automated science. Computer scientists conceive their work as being a case of Aristotle’s poiesis perfected by techne, which can be reduced to a number of straightforward rules and technical knowledge. I will show a number of concrete cases where at each level of computational analysis, more is required to ML than just poiesis and techne, and that the work of ML practitioners in biology needs also the cultivation of something analogous to phronesis, which cannot be automated. But even if we knew how to frame phronesis into rules (which is inconsistent with its own definition), still this virtue is deeply entrenched in our biological constitution, which computers lack. Whether computers can fully perform scientific practice (which is the result of the way we are cognitively and biologically) independently of humans (and their cognitive and biological specificities) is an ill-posed question

    Improving genomic prediction accuracy for meat tenderness in Nellore cattle using artificial neural networks.

    Get PDF
    The goal of this study was to compare the predictive performance of artificial neural networks (ANNs) with Bayesian ridge regression, Bayesian Lasso, Bayes A, Bayes B and Bayes Cπ in estimating genomic breeding values for meat tenderness in Nellore cattle. The animals were genotyped with the Illumina Bovine HD Bead Chip (HD, 777K from 90 samples) and the GeneSeek Genomic Profiler (GGP Indicus HD, 77K from 485 samples). The quality control for the genotypes was applied on each Chip and comprised removal of SNPs located on non‐autosomal chromosomes, with minor allele frequency 0.8. The FImpute program was used for genotype imputation. Pedigree‐based analyses indicated that meat tenderness is moderately heritable (0.35), indicating that it can be improved by direct selection. Prediction accuracies were very similar across the Bayesian regression models, ranging from 0.20 (Bayes A) to 0.22 (Bayes B) and 0.14 (Bayes Cπ) to 0.19 (Bayes A) for the additive and dominance effects, respectively. ANN achieved the highest accuracy (0.33) of genomic prediction of genetic merit. Even though deep neural networks are recognized to deliver more accurate predictions, in our study ANN with one single hidden layer, 105 neurons and rectified linear unit (ReLU) activation function was sufficient to increase the prediction of genetic merit for meat tenderness. These results indicate that an ANN with relatively simple architecture can provide superior genomic predictions for meat tenderness in Nellore cattl

    Statistical and integrative system-level analysis of DNA methylation data

    Get PDF
    Epigenetics plays a key role in cellular development and function. Alterations to the epigenome are thought to capture and mediate the effects of genetic and environmental risk factors on complex disease. Currently, DNA methylation is the only epigenetic mark that can be measured reliably and genome-wide in large numbers of samples. This Review discusses some of the key statistical challenges and algorithms associated with drawing inferences from DNA methylation data, including cell-type heterogeneity, feature selection, reverse causation and system-level analyses that require integration with other data types such as gene expression, genotype, transcription factor binding and other epigenetic information

    Discriminative modelling of context-specific amino acid substitution probabilities.

    No full text
    Motivation: Protein sequence searching and alignment are fundamental tools of modern biology. Alignments are assessed using their similarity scores, essentially the sum of substitution matrix scores over all pairs of aligned amino acids. We previously proposed a generative probabilistic method that yields scores that take the sequence context around each aligned residue into account. This method showed drastically improved sensitivity and alignment quality compared with standard substitution matrix-based alignment. Results: Here, we develop an alternative discriminative approach to predict sequence context-specific substitution scores. We applied our approach to compute context-specific sequence profiles for Basic Local Alignment Search Tool (BLAST) and compared the new tool (CS-BLASTdis) to BLAST and the previous context-specific version (CS-BLASTgen). On a dataset filtered to 20% maximum sequence identity, CS-BLASTdisis was 51% more sensitive than BLAST and 17% more sensitive than CS-BLASTgenin, detecting remote homologues at 10% false discovery rate. At 30% maximum sequence identity, its alignments contain 21 and 12% more correct residue pairs than those of BLAST and CS-BLASTgen, respectively. Clear improvements are also seen when the approach is combined with PSI-BLAST and HHblits. We believe the context-specific approach should replace substitution matrices wherever sensitivity and alignment quality are critical. Availability: Source code (GNU General Public License, version 3) and benchmark data are available at ftp://toolkit.genzentrum.lmu.de/pub/csblast/
    corecore