3,407 research outputs found

    Replica analysis of overfitting in regression models for time-to-event data

    Get PDF
    Overfitting, which happens when the number of parameters in a model is too large compared to the number of data points available for determining these parameters, is a serious and growing problem in survival analysis. While modern medicine presents us with data of unprecedented dimensionality, these data cannot yet be used effectively for clinical outcome prediction. Standard error measures in maximum likelihood regression, such as p-values and z-scores, are blind to overfitting, and even for Cox's proportional hazards model (the main tool of medical statisticians), one finds in literature only rules of thumb on the number of samples required to avoid overfitting. In this paper we present a mathematical theory of overfitting in regression models for time-to-event data, which aims to increase our quantitative understanding of the problem and provide practical tools with which to correct regression outcomes for the impact of overfitting. It is based on the replica method, a statistical mechanical technique for the analysis of heterogeneous many-variable systems that has been used successfully for several decades in physics, biology, and computer science, but not yet in medical statistics. We develop the theory initially for arbitrary regression models for time-to-event data, and verify its predictions in detail for the popular Cox model.Comment: 37 pages, 9 figure

    Selective recruitment designs for improving observational studies using electronic health records

    Get PDF
    Large‐scale electronic health records (EHRs) present an opportunity to quickly identify suitable individuals in order to directly invite them to participate in an observational study. EHRs can contain data from millions of individuals, raising the question of how to optimally select a cohort of size n from a larger pool of size N . In this article, we propose a simple selective recruitment protocol that selects a cohort in which covariates of interest tend to have a uniform distribution. We show that selectively recruited cohorts potentially offer greater statistical power and more accurate parameter estimates than randomly selected cohorts. Our protocol can be applied to studies with multiple categorical and continuous covariates. We apply our protocol to a numerically simulated prospective observational study using an EHR database of stable acute coronary disease patients from 82 089 individuals in the U.K. Selective recruitment designs require a smaller sample size, leading to more efficient and cost‐effective studies

    Women and Heart Disease: Neglected Directions for Future Research

    Get PDF
    Before age 65, women have less heart disease than men. For many years, estrogen was the most popular explanation for this female advantage, and observational studies through the 1980s showed a lower risk of heart attacks in postmenopausal women taking “replacement” estrogen. But the Women’s Health Initiative (WHI), the first placebo-controlled trials of hormone therapy with the size and statistical power necessary to study clinical cardiovascular outcomes, did not confirm the hormone-healthy heart hypothesis. Now, at least 5 years later, the most unexpected WHI result may be how resilient the estrogen hypothesis has been. Where, beyond estrogen therapy, should we go from here to explain the striking sex differences in heart disease rates? A broader spectrum of research about the female cardiovascular advantage and its translation is needed

    Vulnerability of refugees with communication disabilities to SGBV: evidence from Rwanda

    Get PDF
    Refugees with communication disabilities are particularly vulnerable to sexual and gender-based violence, in part because of their limited ability to report abuse

    Beliefs and Values About Music in Early Childhood Education and Care: Perspectives From Practitioners

    Get PDF
    This paper reports the findings of a study that aimed to identify the music beliefs and values of educators in early childhood education and care settings in Australia. The aims of the study were 2-fold: to adapt and pilot a survey of music beliefs and values which might be implemented subsequently nationally in childcare settings; and, secondly, to identify the music beliefs and values held by early childhood and care educators concerning music in children's learning. The research questions that guided this component of the study were: What is the profile of early childhood and care educators? What beliefs and values for music engagement are held by early childhood and care educators? What shapes early childhood and care educators' music beliefs and values? Findings indicated that educators' beliefs and values on all items are above the mid-point indicating overall positive attitudes toward music despite the majority having no formal qualifications in music or a history of instrumental performance and/or singing. Given the overall positive attitudes toward music we suggest there is enormous potential within this population for further professional learning and development targeted at music and its potential wider benefits in young children's learning and lives

    Economic costs of minor depression: a population-based study

    Get PDF
    Objective: Although the clinical relevance of minor depression has been demonstrated in many studies, the economic costs are not well explored. In this study, we examine the economic costs of minor depression. Method: In a large-scale, population-based study in the Netherlands (n ¼ 5504) the costs of minor depression were compared with the costs of major depression and dysthymia. Excess costs, i.e. the costs of a disorder over and above the costs attributable to other illnesses, were estimated with help of regression analysis. The direct medical costs, the direct non-medical costs and the indirect non-medical costs were calculated. The year 2003 was used as the reference year. Results: The annual per capita excess costs of minor depression were US2141(95 2141 (95% CI ¼ 753–3529) higher than the base rate costs of US 1023, while the costs of major depression were US$ 3313 (95% CI ¼ 1234–5390) higher than the base rate. The costs of minor depression per 1 million inhabitants were 160 million dollars per year, which is somewhat less than the costs of major depression (192 million dollars per year). Conclusion: The economic costs associated with minor depression are considerable and approach those of major depression

    Regulation of pituitary MT1 melatonin receptor expression by gonadotrophin-releasing hormone (GnRH) and early growth response factor-1 (Egr-1) : in vivo and in vitro studies

    Get PDF
    Copyright: © 2014 Bae et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This work was funded by the UK Biotechnology and Biological Sciences Research Council (BBSRC; grant BB/F020309/1; http://www.bbsrc.ac.uk/home/home.aspx). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Peer reviewedPublisher PD

    Death, dying and informatics: misrepresenting religion on MedLine

    Get PDF
    BACKGROUND: The globalization of medical science carries for doctors worldwide a correlative duty to deepen their understanding of patients' cultural contexts and religious backgrounds, in order to satisfy each as a unique individual. To become better informed, practitioners may turn to MedLine, but it is unclear whether the information found there is an accurate representation of culture and religion. To test MedLine's representation of this field, we chose the topic of death and dying in the three major monotheistic religions. METHODS: We searched MedLine using PubMed in order to retrieve and thematically analyze full-length scholarly journal papers or case reports dealing with religious traditions and end-of-life care. Our search consisted of a string of words that included the most common denominations of the three religions, the standard heading terms used by the National Reference Center for Bioethics Literature (NRCBL), and the Medical Subject Headings (MeSH) used by the National Library of Medicine. Eligible articles were limited to English-language papers with an abstract. RESULTS: We found that while a bibliographic search in MedLine on this topic produced instant results and some valuable literature, the aggregate reflected a selection bias. American writers were over-represented given the global prevalence of these religious traditions. Denominationally affiliated authors predominated in representing the Christian traditions. The Islamic tradition was under-represented. CONCLUSION: MedLine's capability to identify the most current, reliable and accurate information about purely scientific topics should not be assumed to be the same case when considering the interface of religion, culture and end-of-life care

    Probing the Heterogeneity of Protein Kinase Activation in Cells by Super-Resolution Microscopy

    Get PDF
    Heterogeneity of mitogen-activated protein kinase (MAPK) activation in genetically identical cells, which occurs in response to epidermal growth factor receptor (EGFR) signaling, remains poorly understood. MAPK cascades integrate signals emanating from different EGFR spatial locations, including the plasma membrane and endocytic compartment. We previously hypothesized that in EGF-stimulated cells the MAPK phosphorylation (pMAPK) level and activity are largely determined by the spatial organization of the EGFR clusters within the cell. For experimental testing of this hypothesis, we used super-resolution microscopy to define EGFR clusters by receptor numbers (N) and average intra-cluster distances (d). From this data, we predicted the extent of pMAPK with 85% accuracy on a cell-to-cell basis with control data returning 54% accuracy (P50nm were most predictive for pMAPK level in cells. Electron microscopy revealed that these large clusters were primarily localized to the limiting membrane of multivesicular bodies (MVB). Many tighter packed dimers/multimers (d<50nm) were found on intraluminal vesicles within MVBs, where they were unlikely to activate MAPK because of the physical separation. Our results suggest that cell-to-cell differences in N and d contain crucial information to predict EGFR-activated cellular pMAPK levels and explain pMAPK heterogeneity in isogenic cells

    Rank-based poverty measures and poverty ordering with an application to Tunisia

    Get PDF
    Using the normative approach, we develop a class of poverty measures that is function of a weighting system. Each particular weighting function corresponds to a particular social judgment. This offers the decision-maker a large selection of social preferences functions, and he can choose the one that best represents his social judgment. We also develop new concepts of a-extended TIP curves. They are used to establish the conditions of the robust and unanimous poverty ranking of our measures. These conditions are in terms of second-and higher-degree TIP dominance. Finally, we provide an empirical illustration using Tunisian data on the 2005–2010 period.info:eu-repo/semantics/publishedVersio
    corecore