85 research outputs found

    Application of machine learning and deep neural networks for spatial prediction of groundwater nitrate concentration to improve land use management practices

    Get PDF
    The prediction of groundwater nitrate concentration\u27s response to geo-environmental and human-influenced factors is essential to better restore groundwater quality and improve land use management practices. In this paper, we regionalize groundwater nitrate concentration using different machine learning methods (Random forest (RF), unimodal 2D and 3D convolutional neural networks (CNN), and multi-stream early and late fusion 2D-CNNs) so that the nitrate situation in unobserved areas can be predicted. CNNs take into account not only the nitrate values of the grid cells of the observation wells but also the values around them. This has the added benefit of allowing them to learn directly about the influence of the surroundings. The predictive performance of the models was tested on a dataset from a pilot region in Germany, and the results show that, in general, all the machine learning models, after a Bayesian optimization hyperparameter search and training, achieve good spatial predictive performance compared to previous studies based on Kriging and numerical models. Based on the mean absolute error (MAE), the random forest model and the 2DCNN late fusion model performed best with an MAE (STD) of 9.55 (0.367) mg/l, R2 = 0.43 and 10.32 (0.27) mg/l, R2 = 0.27, respectively. The 3DCNN with an MAE (STD) of 11.66 (0.21) mg/l and largest resources consumption is the worst performing model. Feature importance learning from the models was used in conjunction with partial dependency analysis of the most important features to gain greater insight into the major factors explaining the nitrate spatial variability. Large uncertainties in nitrate prediction have been shown in previous studies. Therefore, the models were extended to quantify uncertainty using prediction intervals (PIs) derived from bootstrapping. Knowledge of uncertainty helps the water manager reduce risk and plan more reliably

    Predicative model for investigation high-temperature evaporation processes of water droplets

    Get PDF
    A predicative model has been developed to study the processes of high-temperature heating and evaporation of water droplets, taking into account the main interconnected processes of heat transfer and phase transformations. Typical profiles of temperature and velocity in the system "high-temperature gases - a droplet of water" are established when the temperature of the external gas medium varies from 100 to 800 0С. Various statements of the problem are considered, which differ substantially in the type of processes and factors taken into account. The analysis of temperature regimes of heating and evaporation of water droplets is carried out, at which simplified models can be used, and in which all complex interrelated processes of heat and mass transfer (including convective, conductive and radiant heat transfer in droplets, and also in the near-surface vapor-gas layer) need to be taken into account

    Systematic permutation testing in GWAS pathway analyses: identification of genetic networks in dilated cardiomyopathy and ulcerative colitis

    Get PDF
    Background: Genome wide association studies (GWAS) are applied to identify genetic loci, which are associated with complex traits and human diseases. Analogous to the evolution of gene expression analyses, pathway analyses have emerged as important tools to uncover functional networks of genome-wide association data. Usually, pathway analyses combine statistical methods with a priori available biological knowledge. To determine significance thresholds for associated pathways, correction for multiple testing and over-representation permutation testing is applied. Results: We systematically investigated the impact of three different permutation test approaches for over-representation analysis to detect false positive pathway candidates and evaluate them on genome-wide association data of Dilated Cardiomyopathy (DCM) and Ulcerative Colitis (UC). Our results provide evidence that the gold standard - permuting the case–control status – effectively improves specificity of GWAS pathway analysis. Although permutation of SNPs does not maintain linkage disequilibrium (LD), these permutations represent an alternative for GWAS data when case–control permutations are not possible. Gene permutations, however, did not add significantly to the specificity. Finally, we provide estimates on the required number of permutations for the investigated approaches. Conclusions: To discover potential false positive functional pathway candidates and to support the results from standard statistical tests such as the Hypergeometric test, permutation tests of case control data should be carried out. The most reasonable alternative was case–control permutation, if this is not possible, SNP permutations may be carried out. Our study also demonstrates that significance values converge rapidly with an increasing number of permutations. By applying the described statistical framework we were able to discover axon guidance, focal adhesion and calcium signaling as important DCM-related pathways and Intestinal immune network for IgA production as most significant UC pathway

    Marathon-Induced Cardiac Strain as Model for the Evaluation of Diagnostic microRNAs for Acute Myocardial Infarction

    Get PDF
    Background: The current gold standard biomarker for myocardial infarction (MI), cardiac troponin (cTn), is recognized for its high sensitivity and organ specificity; however, it lacks diagnostic specificity. Numerous studies have introduced circulating microRNAs as potential biomarkers for MI. This study investigates the MI-specificity of these serum microRNAs by investigating myocardial stress/injury due to strenuous exercise. Methods: MicroRNA biomarkers were retrieved by compre hensive review of 109 publications on diagnostic serum microRNAs for MI. MicroRNA levels were first measured by next-generation sequencing in pooled sera from runners (n = 46) before and after conducting a full competitive marathon. Hereafter, reverse transcription quantitative real-time PCR (qPCR) of 10 selected serum microRNAs in 210 marathon runners was performed (>10,000 qPCR measurements). Results: 27 potential diagnostic microRNA for MI were retrieved by the literature review. Eight microRNAs (miR-1-3p, miR-21-5p, miR-26a-5p, miR-122-5p, miR-133a-3p, miR-142-5p, miR-191-5p, miR-486-3p) showed positive correlations with cTnT in marathon runners, whereas two miRNAs (miR-134-5p and miR-499a-5p) showed no correlations. Upregulation of miR-133a-3p (p = 0.03) and miR-142-5p (p = 0.01) went along with elevated cTnT after marathon. Conclusion: Some MI-associated microRNAs (e.g., miR-133a-3p and miR-142-5p) have similar kinetics under strenuous exercise and MI as compared to cTnT, which suggests that their diagnostic specificity could be lim ited. In contrast, several MI-associated microRNAs (miR-26a-5p, miR-134-5p, miR-191-5p) showed different release behavior; hence, combining cTnT with these microRNAs within a multi-marker strategy may add diagnostic accuracy in MI

    Marathon-Induced Cardiac Strain as Model for the Evaluation of Diagnostic microRNAs for Acute Myocardial Infarction

    Full text link
    Background: The current gold standard biomarker for myocardial infarction (MI), cardiac troponin (cTn), is recognized for its high sensitivity and organ specificity; however, it lacks diagnostic specificity. Numerous studies have introduced circulating microRNAs as potential biomarkers for MI. This study investigates the MI-specificity of these serum microRNAs by investigating myocardial stress/injury due to strenuous exercise. Methods: MicroRNA biomarkers were retrieved by comprehensive review of 109 publications on diagnostic serum microRNAs for MI. MicroRNA levels were first measured by next-generation sequencing in pooled sera from runners (n = 46) before and after conducting a full competitive marathon. Hereafter, reverse transcription quantitative real-time PCR (qPCR) of 10 selected serum microRNAs in 210 marathon runners was performed (>10,000 qPCR measurements). Results: 27 potential diagnostic microRNA for MI were retrieved by the literature review. Eight microRNAs (miR-1-3p, miR-21-5p, miR-26a-5p, miR-122-5p, miR-133a-3p, miR-142-5p, miR-191-5p, miR-486-3p) showed positive correlations with cTnT in marathon runners, whereas two miRNAs (miR-134-5p and miR-499a-5p) showed no correlations. Upregulation of miR-133a-3p (p = 0.03) and miR-142-5p (p = 0.01) went along with elevated cTnT after marathon. Conclusion: Some MI-associated microRNAs (e.g., miR-133a-3p and miR-142-5p) have similar kinetics under strenuous exercise and MI as compared to cTnT, which suggests that their diagnostic specificity could be limited. In contrast, several MI-associated microRNAs (miR-26a-5p, miR-134-5p, miR-191-5p) showed different release behavior; hence, combining cTnT with these microRNAs within a multi-marker strategy may add diagnostic accuracy in MI

    Energy Metabolites as Biomarkers in Ischemic and Dilated Cardiomyopathy

    Get PDF
    With more than 25 million people affected, heart failure (HF) is a global threat. As energy production pathways are known to play a pivotal role in HF, we sought here to identify key metabolic changes in ischemic- and non-ischemic HF by using a multi-OMICS approach. Serum metabolites and mRNAseq and epigenetic DNA methylation profiles were analyzed from blood and left ventricular heart biopsy specimens of the same individuals. In total we collected serum from n = 82 patients with Dilated Cardiomyopathy (DCM) and n = 51 controls in the screening stage. We identified several metabolites involved in glycolysis and citric acid cycle to be elevated up to 5.7-fold in DCM (p = 1.7 × 10−6 ). Interestingly, cardiac mRNA and epigenetic changes of genes encoding rate-limiting enzymes of these pathways could also be found and validated in our second stage of metabolite assessment in n = 52 DCM, n = 39 ischemic HF and n = 57 controls. In conclusion, we identified a new set of metabolomic biomarkers for HF. We were able to identify underlying biological cascades that potentially represent suitable intervention targets

    Genomic structural variations lead to dysregulation of important coding and non-coding RNA species in dilated cardiomyopathy

    Get PDF
    The transcriptome needs to be tightly regulated by mechanisms that include transcription factors, enhancers, and repressors as well as non-coding RNAs. Besides this dynamic regulation, a large part of phenotypic variability of eukaryotes is expressed through changes in gene transcription caused by genetic variation. In this study, we evaluate genome-wide structural genomic variants (SVs) and their association with gene expression in the human heart. We detected 3,898 individual SVs affecting all classes of gene transcripts (e.g., mRNA, miRNA, lncRNA) and regulatory genomic regions (e.g., enhancer or TFBS). In a cohort of patients (n = 50) with dilated cardiomyopathy (DCM), 80,635 non-protein-coding elements of the genome are deleted or duplicated by SVs, containing 3,758 long non-coding RNAs and 1,756 protein-coding transcripts. 65.3% of the SV-eQTLs do not harbor a significant SNV-eQTL, and for the regions with both classes of association, we find similar effect sizes. In case of deleted protein-coding exons, we find downregulation of the associated transcripts, duplication events, however, do not show significant changes over all events. In summary, we are first to describe the genomic variability associated with SVs in heart failure due to DCM and dissect their impact on the transcriptome. Overall, SVs explain up to 7.5% of the variation of cardiac gene expression, underlining the importance to study human myocardial gene expression in the context of the individual genome. This has immediate implications for studies on basic mechanisms of cardiac maladaptation, biomarkers, and (gene) therapeutic studies alike

    Breakdown of the Endothelial Barrier Function in Tumor Cell Transmigration

    Get PDF
    The ability of tumor cells to metastasize is associated with a poor prognosis for cancer. During the process of metastasis, tumor cells circulating in the blood or lymph vessels can adhere to, and potentially transmigrate through, the endothelium and invade the connective tissue. We studied the effectiveness of the endothelium as a barrier against the invasion of 51 tumor cell lines into a three-dimensional collagen matrix. Only nine tumor cell lines showed attenuated invasion in the presence of an endothelial cell monolayer, whereas 17 cell lines became invasive or showed a significantly increased invasion. Endothelial cells cocultured with invasive tumor cells increased chemokine gene expression of IL-8 and Gro-β. Expression of the IL-8 and Gro-β receptor, CXCR2, was upregulated in invasive tumor cells. Addition of IL-8 or Gro-β increased tumor cell invasiveness by more than twofold. Tumor cell variants selected for high CXCR2 expression were fourfold more invasive in the presence of an endothelial cell layer, whereas CXCR2 siRNA knock-down cells were fivefold less invasive. We demonstrate that Gro-β and IL-8 secreted by endothelial cells, together with CXCR2 receptor expression on invasive tumor cells, contribute to the breakdown of the endothelial barrier by enhancing tumor cell force generation and cytoskeletal remodeling dynamics

    Atlas of the clinical genetics of human dilated cardiomyopathy

    Get PDF
    [Abstract] Aim. Numerous genes are known to cause dilated cardiomyopathy (DCM). However, until now technological limitations have hindered elucidation of the contribution of all clinically relevant disease genes to DCM phenotypes in larger cohorts. We now utilized next-generation sequencing to overcome these limitations and screened all DCM disease genes in a large cohort. Methods and results. In this multi-centre, multi-national study, we have enrolled 639 patients with sporadic or familial DCM. To all samples, we applied a standardized protocol for ultra-high coverage next-generation sequencing of 84 genes, leading to 99.1% coverage of the target region with at least 50-fold and a mean read depth of 2415. In this well characterized cohort, we find the highest number of known cardiomyopathy mutations in plakophilin-2, myosin-binding protein C-3, and desmoplakin. When we include yet unknown but predicted disease variants, we find titin, plakophilin-2, myosin-binding protein-C 3, desmoplakin, ryanodine receptor 2, desmocollin-2, desmoglein-2, and SCN5A variants among the most commonly mutated genes. The overlap between DCM, hypertrophic cardiomyopathy (HCM), and channelopathy causing mutations is considerably high. Of note, we find that >38% of patients have compound or combined mutations and 12.8% have three or even more mutations. When comparing patients recruited in the eight participating European countries we find remarkably little differences in mutation frequencies and affected genes. Conclusion. This is to our knowledge, the first study that comprehensively investigated the genetics of DCM in a large-scale cohort and across a broad gene panel of the known DCM genes. Our results underline the high analytical quality and feasibility of Next-Generation Sequencing in clinical genetic diagnostics and provide a sound database of the genetic causes of DCM.Hôpitaux de Paris; PHRC AOM0414

    Deletion of the Pichia pastoris KU70 Homologue Facilitates Platform Strain Generation for Gene Expression and Synthetic Biology

    Get PDF
    Targeted gene replacement to generate knock-outs and knock-ins is a commonly used method to study the function of unknown genes. In the methylotrophic yeast Pichia pastoris, the importance of specific gene targeting has increased since the genome sequencing projects of the most commonly used strains have been accomplished, but rapid progress in the field has been impeded by inefficient mechanisms for accurate integration. To improve gene targeting efficiency in P. pastoris, we identified and deleted the P. pastoris KU70 homologue. We observed a substantial increase in the targeting efficiency using the two commonly known and used integration loci HIS4 and ADE1, reaching over 90% targeting efficiencies with only 250-bp flanking homologous DNA. Although the ku70 deletion strain was noted to be more sensitive to UV rays than the corresponding wild-type strain, no lethality, severe growth retardation or loss of gene copy numbers could be detected during repetitive rounds of cultivation and induction of heterologous protein production. Furthermore, we demonstrated the use of the ku70 deletion strain for fast and simple screening of genes in the search of new auxotrophic markers by targeting dihydroxyacetone synthase and glycerol kinase genes. Precise knock-out strains for the well-known P. pastoris AOX1, ARG4 and HIS4 genes and a whole series of expression vectors were generated based on the wild-type platform strain, providing a broad spectrum of precise tools for both intracellular and secreted production of heterologous proteins utilizing various selection markers and integration strategies for targeted or random integration of single and multiple genes. The simplicity of targeted integration in the ku70 deletion strain will further support protein production strain generation and synthetic biology using P. pastoris strains as platform hosts
    corecore