374 research outputs found

    Multiple Imputation for Incomplete Data in Epidemiologic Studies

    Get PDF
    Epidemiologic studies are frequently susceptible to missing information. Omitting observations with missing variables remains a common strategy in epidemiologic studies, yet this simple approach can often severely bias parameter estimates of interest if the values are not missing completely at random. Even when missingness is completely random, complete-case analysis can reduce the efficiency of estimated parameters, because large amounts of available data are simply tossed out with the incomplete observations. Alternative methods for mitigating the influence of missing information, such as multiple imputation, are becoming an increasing popular strategy in order to retain all available information, reduce potential bias, and improve efficiency in parameter estimation. In this paper, we describe the theoretical underpinnings of multiple imputation, and we illustrate application of this method as part of a collaborative challenge to assess the performance of various techniques for dealing with missing data (Am J Epidemiol. 2018;187(3):568–575). We detail the steps necessary to perform multiple imputation on a subset of data from the Collaborative Perinatal Project (1959–1974), where the goal is to estimate the odds of spontaneous abortion associated with smoking during pregnancy

    Generalizing the per-protocol treatment effect: The case of ACTG A5095

    Get PDF
    Background Intention-to-treat comparisons of randomized trials provide asymptotically consistent estimators of the effect of treatment assignment, without regard to compliance. However, decision makers often wish to know the effect of a per-protocol comparison. Moreover, decision makers may also wish to know the effect of treatment assignment or treatment protocol in a user-specified target population other than the sample in which the trial was fielded. Here, we aimed to generalize results from the ACTG A5095 trial to the US recently HIV-diagnosed target population. Methods We first replicated the published conventional intention-to-treat estimate (2-year risk difference and hazard ratio) comparing a four-drug antiretroviral regimen to a three-drug regimen in the A5095 trial. We then estimated the intention-to-treat effect that accounted for informative dropout and the per-protocol effect that additionally accounted for protocol deviations by constructing inverse probability weights. Furthermore, we employed inverse odds of sampling weights to generalize both intention-to-treat and per-protocol effects to a target population comprising US individuals with HIV diagnosed during 2008–2014. Results Of 761 subjects in the analysis, 82 dropouts (36 in the three-drug arm and 46 in the four-drug arm) and 59 protocol deviations (25 in the three-drug arm and 34 in the four-drug arm) occurred during the first 2 years of follow-up. A total of 169 subjects incurred virologic failure or death. The 2-year risks were similar both in the trial and in the US HIV-diagnosed target population for estimates from the conventional intention-to-treat, dropout-weighted intention-to-treat, and per-protocol analyses. In the US target population, the 2-year conventional intention-to-treat risk difference (unit: %) for virologic failure or death comparing the four-drug arm to the three-drug arm was −0.4 (95% confidence interval: −6.2, 5.1), while the hazard ratio was 0.97 (95% confidence interval: 0.70, 1.34); the 2-year risk difference was −0.9 (95% confidence interval: −6.9, 5.3) for the dropout-weighted intention-to-treat comparison (hazard ratio = 0.95, 95% confidence interval: 0.68, 1.32) and −0.7 (95% confidence interval: −6.7, 5.5) for the per-protocol comparison (hazard ratio = 0.96, 95% confidence interval: 0.69, 1.34). Conclusion No benefit of four-drug antiretroviral regimen over three-drug regimen was found from the conventional intention-to-treat, dropout-weighted intention-to-treat or per-protocol estimates in the trial sample or target population

    Inverse-Probability-Weighted Estimation for Monotone and Nonmonotone Missing Data

    Get PDF
    Missing data is a common occurrence in epidemiologic research. In this paper, 3 data sets with induced missing values from the Collaborative Perinatal Project, a multisite US study conducted from 1959 to 1974, are provided as examples of prototypical epidemiologic studies with missing data. Our goal was to estimate the association of maternal smoking behavior with spontaneous abortion while adjusting for numerous confounders. At the same time, we did not necessarily wish to evaluate the joint distribution among potentially unobserved covariates, which is seldom the subject of substantive scientific interest. The inverse probability weighting (IPW) approach preserves the semiparametric structure of the underlying model of substantive interest and clearly separates the model of substantive interest from the model used to account for the missing data. However, IPW often will not result in valid inference if the missing-data pattern is nonmonotone, even if the data are missing at random. We describe a recently proposed approach to modeling nonmonotone missing-data mechanisms under missingness at random to use in constructing the weights in IPW complete-case estimation, and we illustrate the approach using 3 data sets described in a companion article (Am J Epidemiol. 2018;187(3):568–575)

    Principled Approaches to Missing Data in Epidemiologic Studies

    Get PDF
    Principled methods with which to appropriately analyze missing data have long existed; however, broad implementation of these methods remains challenging. In this and 2 companion papers (Am J Epidemiol. 2018;187(3):576–584 and Am J Epidemiol. 2018;187(3):585–591), we discuss issues pertaining to missing data in the epidemiologic literature. We provide details regarding missing-data mechanisms and nomenclature and encourage the conduct of principled analyses through a detailed comparison of multiple imputation and inverse probability weighting. Data from the Collaborative Perinatal Project, a multisite US study conducted from 1959 to 1974, are used to create a masked data-analytical challenge with missing data induced by known mechanisms. We illustrate the deleterious effects of missing data with naive methods and show how principled methods can sometimes mitigate such effects. For example, when data were missing at random, naive methods showed a spurious protective effect of smoking on the risk of spontaneous abortion (odds ratio (OR) = 0.43, 95% confidence interval (CI): 0.19, 0.93), while implementation of principled methods multiple imputation (OR = 1.30, 95% CI: 0.95, 1.77) or augmented inverse probability weighting (OR = 1.40, 95% CI: 1.00, 1.97) provided estimates closer to the “true” full-data effect (OR = 1.31, 95% CI: 1.05, 1.64). We call for greater acknowledgement of and attention to missing data and for the broad use of principled missing-data methods in epidemiologic research

    Large Scale Pressure Fluctuations and Sunyaev-Zel'dovich Effect

    Get PDF
    The Sunyaev-Zel'dovich (SZ) effect associated with pressure fluctuations of the large scale structure gas distribution will be probed with current and upcoming wide-field small angular scale cosmic microwave background experiments. We study the generation of pressure fluctuations by baryons which are present in virialized dark matter halos and by baryons present in small overdensities. For collapsed halos, assuming the gas distribution is in hydrostatic equilibrium with matter density distribution, we predict the pressure power spectrum and bispectrum associated with the large scale structure gas distribution by extending the dark matter halo approach which describes the density field in terms of correlations between and within halos. The projected pressure power spectrum allows a determination of the resulting SZ power spectrum due to virialized structures. The unshocked photoionized baryons present in smaller overdensities trace the Jeans-scale smoothed dark matter distribution. They provide a lower limit to the SZ effect due to large scale structure in the absence of massive collapsed halos. We extend our calculations to discuss higher order statistics, such as bispectrum and skewness in SZ data. The SZ-weak lensing cross-correlation is suggested as a probe of correlations between dark matter and baryon density fields, while the probability distribution functions of peak statistics of SZ halos in wide field CMB data can be used as a probe of cosmology and non-Gaussian evolution of large scale structure pressure fluctuations.Comment: 16 pages, 9 figures; Revised with expanded discussions. Phys. Rev. D. (in press

    Learning physics in context: a study of student learning about electricity and magnetism

    Full text link
    This paper re-centres the discussion of student learning in physics to focus on context. In order to do so, a theoretically-motivated understanding of context is developed. Given a well-defined notion of context, data from a novel university class in electricity and magnetism are analyzed to demonstrate the central and inextricable role of context in student learning. This work sits within a broader effort to create and analyze environments which support student learning in the sciencesComment: 36 pages, 4 Figure

    Influence of indomethacin on lens regeneration in the newt notophthalmus viridescens

    Full text link
    Following lentectomy newts were injected with indomethacin in a variety of carrier solutions at doses ranging from 1.2–120 mg/kg body weight every other day for 15–17 days. The results show that injection of this drug according to the regimen used has no significant effect on regeneration of the lens. The data suggest, but do not prove, that prostaglandins may not play a major role in the early phases of lens regeneration in the newt.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/47503/1/427_2004_Article_BF00848434.pd

    Transportability from Randomized Trials to Clinical Care: On Initial HIV Treatment with Efavirenz and Suicidal Thoughts or Behaviors

    Get PDF
    In an analysis of randomized trials, use of efavirenz for treatment of human immunodeficiency virus (HIV) infection was associated with increased suicidal thoughts/behaviors. However, analyses of observational data have found no evidence of increased risk. To assess whether population differences might explain this divergence, we transported the effect of efavirenz use from these trials to a specific target population. Using inverse odds weights and multiple imputation, we transported the effect of efavirenz on suicidal thoughts/behaviors in these randomized trials (participants were enrolled in 2001-2007) to a trials-eligible cohort of US adults initiating antiretroviral therapy while receiving HIV clinical care at medical centers between 1999 and 2015. Overall, 8,291 cohort participants and 3,949 trial participants were eligible. Prescription of antidepressants (19% vs. 13%) and injection drug history (16% vs. 10%) were more frequent in the cohort than in the trial participants. Compared with the effect in trials, the estimated hazard ratio for efavirenz on suicidal thoughts/behaviors was attenuated in our target population (trials: hazard ratio (HR) = 2.3 (95% confidence interval (CI): 1.2, 4.4); transported: HR = 1.8 (95% CI: 0.9, 4.4)), whereas the incidence rate difference was similar (trials: HR = 5.1 (95% CI: 1.6, 8.7); transported: HR = 5.4 (95% CI:-0.4, 11.4)). In our target population, there was greater than 20% attenuation of the hazard ratio estimate as compared with the trials-only estimate. Transporting results from trials to a target population is informative for addressing external validity

    Fitting the integrated Spectral Energy Distributions of Galaxies

    Full text link
    Fitting the spectral energy distributions (SEDs) of galaxies is an almost universally used technique that has matured significantly in the last decade. Model predictions and fitting procedures have improved significantly over this time, attempting to keep up with the vastly increased volume and quality of available data. We review here the field of SED fitting, describing the modelling of ultraviolet to infrared galaxy SEDs, the creation of multiwavelength data sets, and the methods used to fit model SEDs to observed galaxy data sets. We touch upon the achievements and challenges in the major ingredients of SED fitting, with a special emphasis on describing the interplay between the quality of the available data, the quality of the available models, and the best fitting technique to use in order to obtain a realistic measurement as well as realistic uncertainties. We conclude that SED fitting can be used effectively to derive a range of physical properties of galaxies, such as redshift, stellar masses, star formation rates, dust masses, and metallicities, with care taken not to over-interpret the available data. Yet there still exist many issues such as estimating the age of the oldest stars in a galaxy, finer details ofdust properties and dust-star geometry, and the influences of poorly understood, luminous stellar types and phases. The challenge for the coming years will be to improve both the models and the observational data sets to resolve these uncertainties. The present review will be made available on an interactive, moderated web page (sedfitting.org), where the community can access and change the text. The intention is to expand the text and keep it up to date over the coming years.Comment: 54 pages, 26 figures, Accepted for publication in Astrophysics & Space Scienc

    Recent natural selection causes adaptive evolution of an avian polygenic trait

    Get PDF
    We used extensive data from a long-term study of great tits (Parus major) in the United Kingdom and Netherlands to better understand how genetic signatures of selection translate into variation in fitness and phenotypes. We found that genomic regions under differential selection contained candidate genes for bill morphology and used genetic architecture analyses to confirm that these genes, especially the collagen gene COL4A5, explained variation in bill length. COL4A5 variation was associated with reproductive success, which, combined with spatiotemporal patterns of bill length, suggested ongoing selection for longer bills in the United Kingdom. Last, bill length and COL4A5 variation were associated with usage of feeders, suggesting that longer bills may have evolved in the United Kingdom as a response to supplementary feeding
    • 

    corecore