3,976 research outputs found

    Performance of regression models as a function of experiment noise

    Get PDF
    A challenge in developing machine learning regression models is that it is difficult to know whether maximal performance has been reached on a particular dataset, or whether further model improvement is possible. In biology this problem is particularly pronounced as sample labels (response variables) are typically obtained through experiments and therefore have experiment noise associated with them. Such label noise puts a fundamental limit to the performance attainable by regression models. We address this challenge by deriving a theoretical upper bound for the coefficient of determination (R2) for regression models. This theoretical upper bound depends only on the noise associated with the response variable in a dataset as well as its variance. The upper bound estimate was validated via Monte Carlo simulations and then used as a tool to bootstrap performance of regression models trained on biological datasets, including protein sequence data, transcriptomic data, and genomic data. Although we study biological datasets in this work, the new upper bound estimates will hold true for regression models from any research field or application area where response variables have associated noise

    A participatory physical and psychosocial intervention for balancing the demands and resources among industrial workers (PIPPI): study protocol of a cluster-randomized controlled trial

    Get PDF
    Background: Need for recovery and work ability are strongly associated with high employee turnover, well-being and sickness absence. However, scientific knowledge on effective interventions to improve work ability and decrease need for recovery is scarce. Thus, the present study aims to describe the background, design and protocol of a cluster randomized controlled trial evaluating the effectiveness of an intervention to reduce need for recovery and improve work ability among industrial workers. Methods/Design: A two-year cluster randomized controlled design will be utilized, in which controls will also receive the intervention in year two. More than 400 workers from three companies in Denmark will be aimed to be cluster randomized into intervention and control groups with at least 200 workers (at least 9 work teams) in each group. An organizational resources audit and subsequent action planning workshop will be carried out to map the existing resources and act upon initiatives not functioning as intended. Workshops will be conducted to train leaders and health and safety representatives in supporting and facilitating the intervention activities. Group and individual level participatory visual mapping sessions will be carried out allowing team members to discuss current physical and psychosocial work demands and resources, and develop action plans to minimize strain and if possible, optimize the resources. At all levels, the intervention will be integrated into the existing organization of work schedules. An extensive process and effect evaluation on need for recovery and work ability will be carried out via questionnaires, observations, interviews and organizational data assessed at several time points throughout the intervention period. Discussion: This study primarily aims to develop, implement and evaluate an intervention based on the abovementioned features which may improve the work environment, available resources and health of industrial workers, and hence their need for recovery and work ability

    Sources of inaction in household finance: evidence from the Danish mortgage markets

    Get PDF
    We build an empirical model to attribute delays in mortgage refinancing to psychological costs inhibiting refinancing until incentives are sufficiently strong; and behavior, potentially attributable to information-gathering costs, lowering the probability of household refinancing per unit time at any incentive. We estimate the model on administrative panel data from Denmark, where mortgage refinancing without cash-out is unconstrained. Middle-aged and wealthy households act as if they have high psychological refinancing costs; but older, poorer, and less-educated households refinance with lower probability irrespective of incentives, thereby achieving lower savings. We use the model to understand frictions in the mortgage channel of monetary policy transmission

    mbs: modifying Hudson's ms software to generate samples of DNA sequences with a biallelic site under selection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The pattern of single nucleotide polymorphisms, or SNPs, contains a tremendous amount of information with respect to the mechanisms of the micro-evolutionary process of a species. The inference of the roles of these mechanisms, including natural selection, relies heavily on computer simulations. A coalescent simulation is extremely powerful in generating a large number of samples of DNA sequences from a population (species) when all mutations are neutral, and Hudson's <b>ms </b>software is frequently used for this purpose.</p> <p>However, it has been difficult to incorporate natural selection into the coalescent framework.</p> <p>Results</p> <p>We herein present a software application to generate samples of DNA sequences when there is a biallelic site targeted by selection. This software application, referred to as <b>mbs</b>, is developed by modifying Hudson's <b>ms</b>. The <b>mbs </b>software is so flexible that it can incorporate any arbitrary histories of population size changes and any mode of selection as long as selection is operating on a biallelic site.</p> <p>Conclusion</p> <p><b>mbs </b>provides opportunities to investigate the effect of any mode of selection on the pattern of SNPs under various demography.</p

    Scaling of Entanglement close to a Quantum Phase Transitions

    Full text link
    In this Letter we discuss the entanglement near a quantum phase transition by analyzing the properties of the concurrence for a class of exactly solvable models in one dimension. We find that entanglement can be classified in the framework of scaling theory. Further, we reveal a profound difference between classical correlations and the non-local quantum correlation, entanglement: the correlation length diverges at the phase transition, whereas entanglement in general remains short ranged.Comment: 4 pages, 4 figures, revtex. Stylistic changes and format modifie

    An investigation of horizontal transfer of feed introduced DNA to the aerobic microbiota of the gastrointestinal tract of rats

    Get PDF
    Background: Horizontal gene transfer through natural transformation of members of the microbiota of the lower gastrointestinal tract (GIT) of mammals has not yet been described. Insufficient DNA sequence similarity for homologous recombination to occur has been identified as the major barrier to interspecies transfer of chromosomal DNA in bacteria. In this study we determined if regions of high DNA similarity between the genomes of the indigenous bacteria in the GIT of rats and feed introduced DNA could lead to homologous recombination and acquisition of antibiotic resistance genes. Results: Plasmid DNA with two resistance genes (nptII and aadA) and regions of high DNA similarity to 16S rRNA and 23S rRNA genes present in a broad range of bacterial species present in the GIT, where constructed and added to standard rat feed. Six rats, with a normal microbiota, were fed DNA containing pellets daily over four days before sampling of the microbiota from the different GI compartments (stomach, small intestine, cecum and colon). In addition, two rats were included as negative controls. Antibiotic resistant colonies growing on selective media were screened for recombination with feed introduced DNA by PCR targeting unique sites in the putatively recombined regions. Conclusions: The analyses showed that extensive ingestion of DNA (100 \ub5g plasmid) per day did not lead to increased proportions of kanamycin resistant bacteria, nor did it produce detectable transformants among the aerobic microbiota examined for 6 rats (detection limit <1 transformant per 1.1 x 108 cultured bacteria). The key methodological challenges to HGT detection in animal feedings trials are identified and discussed

    Serological markers of extracellular matrix remodeling predict transplant‐free survival in primary sclerosing cholangitis

    Get PDF
    BACKGROUND: Primary sclerosing cholangitis is a progressive liver disease with a remarkably variable course. Biomarkers of disease activity or prognostic models predicting outcome at an individual level are currently not established. AIM: To evaluate the prognostic utility of four biomarkers of basement membrane and interstitial extracellular matrix remodeling in patients with primary sclerosing cholangitis. METHODS: Serum samples were available from 138 large‐duct primary sclerosing cholangitis patients (of which 102 [74%] with IBD) recruited 2008‐2012 and 52 ulcerative colitis patients (controls). The median follow‐up time was 2.2 (range 0‐4.3) years. Specific biomarkers of type III and V collagen formation (PRO‐C3 and PRO‐C5, respectively) and type III and IV collagen degradation (C3M and C4M, respectively) were assessed. The Enhanced Liver Fibrosis test, including procollagen type III N‐terminal peptide, tissue inhibitor of metalloproteinase‐1 and hyaluronic acid was assessed for comparison. RESULTS: All markers were elevated in primary sclerosing cholangitis compared to ulcerative colitis patients (P < 0.001). PRO‐C3 showed the largest difference between the two groups with a threefold increase in primary sclerosing cholangitis compared to ulcerative colitis patients. Patients with high baseline serum levels of all markers, except C3M, had shorter survival compared to patients with low baseline serum levels (P < 0.001). Combining PRO‐C3 and PRO‐C5 the odds ratio for predicting transplant‐free survival was 47 compared to the Enhanced Liver Fibrosis test's odds ratio of 11. CONCLUSIONS: Extracellular matrix remodeling is elevated in primary sclerosing cholangitis patients compared to ulcerative colitis patients. Furthermore, the interstitial matrix marker PRO‐C3 was identified as a potent prognostic marker and an independent predictor of transplant‐free survival in primary sclerosing cholangitis

    Identification of a novel type of spacer element required for imprinting in fission yeast

    Get PDF
    Asymmetrical segregation of differentiated sister chromatids is thought to be important for cellular differentiation in higher eukaryotes. Similarly, in fission yeast, cellular differentiation involves the asymmetrical segregation of a chromosomal imprint. This imprint has been shown to consist of two ribonucleotides that are incorporated into the DNA during laggingstrand synthesis in response to a replication pause, but the underlying mechanism remains unknown. Here we present key novel discoveries important for unravelling this process. Our data show that cis-acting sequences within the mat1 cassette mediate pausing of replication forks at the proximity of the imprinting site, and the results suggest that this pause dictates specific priming at the position of imprinting in a sequence-independent manner. Also, we identify a novel type of cis-acting spacer region important for the imprinting process that affects where subsequent primers are put down after the replication fork is released from the pause. Thus, our data suggest that the imprint is formed by ligation of a not-fullyprocessed Okazaki fragment to the subsequent fragment. The presented work addresses how differentiated sister chromatids are established during DNA replication through the involvement of replication barriers
    corecore