325 research outputs found

    Confidence Intervals for Maximin Effects in Inhomogeneous Large-Scale Data

    Full text link
    One challenge of large-scale data analysis is that the assumption of an identical distribution for all samples is often not realistic. An optimal linear regression might, for example, be markedly different for distinct groups of the data. Maximin effects have been proposed as a computationally attractive way to estimate effects that are common across all data without fitting a mixture distribution explicitly. So far just point estimators of the common maximin effects have been proposed in Meinshausen and B\"uhlmann (2014). Here we propose asymptotically valid confidence regions for these effects

    backShift: Learning causal cyclic graphs from unknown shift interventions

    Full text link
    We propose a simple method to learn linear causal cyclic models in the presence of latent variables. The method relies on equilibrium data of the model recorded under a specific kind of interventions ("shift interventions"). The location and strength of these interventions do not have to be known and can be estimated from the data. Our method, called backShift, only uses second moments of the data and performs simple joint matrix diagonalization, applied to differences between covariance matrices. We give a sufficient and necessary condition for identifiability of the system, which is fulfilled almost surely under some quite general assumptions if and only if there are at least three distinct experimental settings, one of which can be pure observational data. We demonstrate the performance on some simulated data and applications in flow cytometry and financial time series. The code is made available as R-package backShift

    THE EFFECTS OF CARDIAC SURGICAL PROCEDURES ON HEALTH – RELATED QUALITY OF LIFE, COGNITIVE PERFORMANCE, AND EMOTIONAL STATUS OUTCOMES: A PROSPECTIVE 6 – MONTH FOLLOW – UP STUDY

    Get PDF
    Introduction: The aim of this study was to assess the course of health – related quality of life, cognitive and emotional change during the six months after elective CABG, and to investigate how cognitive impairments, depression and posttraumatic stress symptoms were related to quality of life. Method: In a prospective study, we followed up for 6 months 138 of the original 147 patients who had undergone elective CABG surgery. Conclusion: Elective CABG is associated with significant improvements in HRQOL relative to the preoperative period, but impairments in HRQOL were found in a subgroup of post – CABG patients with evidence of PTSD, depression, or cognitive impairments at 6 – month follow – up

    Distributionally robust and generalizable inference

    Full text link
    We discuss recently developed methods that quantify the stability and generalizability of statistical findings under distributional changes. In many practical problems, the data is not drawn i.i.d. from the target population. For example, unobserved sampling bias, batch effects, or unknown associations might inflate the variance compared to i.i.d. sampling. For reliable statistical inference, it is thus necessary to account for these types of variation. We discuss and review two methods that allow quantifying distribution stability based on a single dataset. The first method computes the sensitivity of a parameter under worst-case distributional perturbations to understand which types of shift pose a threat to external validity. The second method treats distributional shifts as random which allows assessing average robustness (instead of worst-case). Based on a stability analysis of multiple estimators on a single dataset, it integrates both sampling and distributional uncertainty into a single confidence interval

    One estimator, many estimands: fine-grained quantification of uncertainty using conditional inference

    Full text link
    Statistical uncertainty has many components, such as measurement errors, temporal variation, or sampling. Not all of these sources are relevant when considering a specific application, since practitioners might view some attributes of observations as fixed. We study the statistical inference problem arising when data is drawn conditionally on some attributes. These attributes are assumed to be sampled from a super-population but viewed as fixed when conducting uncertainty quantification. The estimand is thus defined as the parameter of a conditional distribution. We propose methods to construct conditionally valid p-values and confidence intervals for these conditional estimands based on asymptotically linear estimators. In this setting, a given estimator is conditionally unbiased for potentially many conditional estimands, which can be seen as parameters of different populations. Testing different populations raises questions of multiple testing. We discuss simple procedures that control novel conditional error rates. In addition, we introduce a bias correction technique that enables transfer of estimators across conditional distributions arising from the same super-population. This can be used to infer parameters and estimators on future datasets based on some new data. The validity and applicability of the proposed methods are demonstrated on simulated and real-world data.Comment: 60 page
    • …
    corecore