35 research outputs found

    Data-driven ocean, atmosphere, and land parameterizations calibrated from indirect data

    No full text
    The multiscale nature of climate necessarily requires approximations within all subcomponents, and may be based on machine learning, physics or both. Our approach to make accurate and trustworthy predictions of future, never-observed climate regimes combines physics-based models with subcomponents learned from accessible data, which are often only indirectly informative about the modeled processes. We have developed a suite of model-agnostic machine learning tools to learn about subcomponent models from such data. These tools rely on ensembles of model simulations, effectively carried out on GPUs in distributed systems; and our framework for assessing uncertainty of calibrations (CalibrateEmulateSample) requires 1,000 times fewer model evaluations than traditional approaches. Our approach has produced several scientific successes, such as a unified turbulence and cloud parameterization calibrated with a library of large-eddy simulations, a neural network snow model trained from station data, and calibration-directed development of a parameterization for upper-ocean turbulence (CATKE).</p

    Additional file 15: of Stratification of candidate genes for Parkinson’s disease using weighted protein-protein interaction network analysis

    No full text
    PD-GWAS gene prioritization. Significant SNPs from PD-GWAS and the number of ORFs in LD (from r2 > 0.5 to r2 > 0.8) are shown. Part A contains significant SNPs as per joint analysis, part B contains significant SNPs as per discovery phase. The candidate genes based on proximity are summarized as suggested in the original GWAS. Newly proposed candidate genes within each are identified on the basis of our analysis of the functionally relevant proteins in the PPI network. Genes previously selected by proximity and now also confirmed by functional analysis of the PD-network are in bold font. The top pie chart represents the distribution of proteins across the different relevant processes. In the final column the cell type with major expression (> 5% of average expression) is reported as calculated from the dataset generated by Zhang et al. [13] (A = mature astrocytes, N = neurons, M = microglia, O = oligodendrocytes and, E = endothelial cells). The bottom pie chart represents the distribution of proteins based on cell type expression in human temporal lobe cortex. (JPG 349 kb

    Additional file 18: of Stratification of candidate genes for Parkinson’s disease using weighted protein-protein interaction network analysis

    No full text
    GO Annotation Frequency. Proteins whose ORF is in the LD blocks around the prioritized SNPs in the PD-GWAS have been evaluate in terms of numbers of GO annotations present in GO for that specific ORF. In red are reported the proteins that correspond to genes that we prioritized with our analysis. In grey, all the other genes are reported. For some of the loci (A) the genes we prioritized was the gene with the maximum number of GO annotations for that locus; in some other cases (B) the genes we prioritized was NOT the gene with the maximum number of GO annotations for that locus. Finally, there are also mixed cases (C). (TIFF 315 kb

    Additional file 17: of Stratification of candidate genes for Parkinson’s disease using weighted protein-protein interaction network analysis

    No full text
    Random Distribution. Distribution of the number of matches obtained in 100,000 simulated experiments in which we matched the relevant PD, process-specific, network proteins to randomly generated gene-sets of the same length as the list of ORFs in LD blocks with the top SNPS in the PD-GWAS. The distribution in blue is generated for random gene-sets of the same length as the list of ORFs in LD r2 ≥ 0.5; the distribution in red is generated for random gene-sets of the same length as the list of ORFs in LD r2 ≥ 0.8. (PDF 36 kb
    corecore