804 research outputs found

    Temperature variations from Hubble Space Telescope imagery and spectroscopy of NGC 7009

    Get PDF
    We present new Hubble Space Telescope (HST)/WFPC2 imagery and STIS long-slit spectroscopy of the planetary nebula NGC 7009. The primary goal was to obtain high spatial resolution of the intrinsic line ratio [O III] 4364/5008 and thereby evaluate the electron temperature (Te) and the fractional mean-square Te variation (tA2)across the nebula. The WFPC2 Te map is rather uniform; almost all values are between 9000–11 000 K, with the higher Te values closely coinciding with the inner He++ zone. The results indicate very small values–≲0.01– for tA2 throughout. Our STIS data allow an even more direct determination of Te and tA2, albeit for a much smaller area than with WFPC2. We present results from binning the data along the slit into tiles that are 0.5-arcsec square (matching the slit width). The average [O III] temperature using 45 tiles (excluding the central star and STIS fiducial bars) is 10 139 K; tA2 is 0.0035. The measurements of Te reported here are an average along each line of sight. Therefore, despite finding remarkably low tA2, we cannot completely rule out temperature fluctuations along the line of sight as the cause of the large abundance discrepancy between heavy element abundances inferred from collisionally excited emission lines compared to those derived from recombination lines

    Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study

    Get PDF
    Background: There is no consensus on the most appropriate approach to handle missing covariate data within prognostic modelling studies. Therefore a simulation study was performed to assess the effects of different missing data techniques on the performance of a prognostic model. Methods: Datasets were generated to resemble the skewed distributions seen in a motivating breast cancer example. Multivariate missing data were imposed on four covariates using four different mechanisms; missing completely at random (MCAR), missing at random (MAR), missing not at random (MNAR) and a combination of all three mechanisms. Five amounts of incomplete cases from 5% to 75% were considered. Complete case analysis (CC), single imputation (SI) and five multiple imputation (MI) techniques available within the R statistical software were investigated: a) data augmentation (DA) approach assuming a multivariate normal distribution, b) DA assuming a general location model, c) regression switching imputation, d) regression switching with predictive mean matching (MICE-PMM) and e) flexible additive imputation models. A Cox proportional hazards model was fitted and appropriate estimates for the regression coefficients and model performance measures were obtained. Results: Performing a CC analysis produced unbiased regression estimates, but inflated standard errors, which affected the significance of the covariates in the model with 25% or more missingness. Using SI, underestimated the variability; resulting in poor coverage even with 10% missingness. Of the MI approaches, applying MICE-PMM produced, in general, the least biased estimates and better coverage for the incomplete covariates and better model performance for all mechanisms. However, this MI approach still produced biased regression coefficient estimates for the incomplete skewed continuous covariates when 50% or more cases had missing data imposed with a MCAR, MAR or combined mechanism. When the missingness depended on the incomplete covariates, i.e. MNAR, estimates were biased with more than 10% incomplete cases for all MI approaches. Conclusion: The results from this simulation study suggest that performing MICE-PMM may be the preferred MI approach provided that less than 50% of the cases have missing data and the missing data are not MNAR

    Comparison of methods for handling missing data on immunohistochemical markers in survival analysis of breast cancer

    Get PDF
    Background:Tissue micro-arrays (TMAs) are increasingly used to generate data of the molecular phenotype of tumours in clinical epidemiology studies, such as studies of disease prognosis. However, TMA data are particularly prone to missingness. A variety of methods to deal with missing data are available. However, the validity of the various approaches is dependent on the structure of the missing data and there are few empirical studies dealing with missing data from molecular pathology. The purpose of this study was to investigate the results of four commonly used approaches to handling missing data from a large, multi-centre study of the molecular pathological determinants of prognosis in breast cancer.Patients and Methods:We pooled data from over 11 000 cases of invasive breast cancer from five studies that collected information on seven prognostic indicators together with survival time data. We compared the results of a multi-variate Cox regression using four approaches to handling missing data-complete case analysis (CCA), mean substitution (MS) and multiple imputation without inclusion of the outcome (MI) and multiple imputation with inclusion of the outcome (MI). We also performed an analysis in which missing data were simulated under different assumptions and the results of the four methods were compared.Results:Over half the cases had missing data on at least one of the seven variables and 11 percent had missing data on 4 or more. The multi-variate hazard ratio estimates based on multiple imputation models were very similar to those derived after using MS, with similar standard errors. Hazard ratio estimates based on the CCA were only slightly different, but the estimates were less precise as the standard errors were large. However, in data simulated to be missing completely at random (MCAR) or missing at random (MAR), estimates for MI were least biased and most accurate, whereas estimates for CCA were most biased and least accurate.Conclusion:In this study, empirical results from analyses using CCA, MS, MI and MI were similar, although results from CCA were less precise. The results from simulations suggest that in general MI is likely to be the best. Given the ease of implementing MI in standard statistical software, the results of MI and CCA should be compared in any multi-variate analysis where missing data are a problem. © 2011 Cancer Research UK. All rights reserved

    Imputation of continuous variables missing at random using the method of simulated scores

    Get PDF
    For multivariate datasets with missing values, we present a procedure of statistical inference and state its "optimal" properties. Two main assumptions are needed: (1) data are missing at random (MAR); (2) the data generating process is a multivariate normal linear regression. Disentangling the problem of convergence of the iterative estimation/imputation procedure, we show that the estimator is a "method of simulated scores" (a particular case of McFadden's "method of simulated moments"); thus the estimator is equivalent to maximum likelihood if the number of replications is conveniently large, and the whole procedure can be considered an optimal parametric technique for imputation of missing data

    Quantum states made to measure

    Full text link
    Recent progress in manipulating quantum states of light and matter brings quantum-enhanced measurements closer to prospective applications. The current challenge is to make quantum metrologic strategies robust against imperfections.Comment: 4 pages, 3 figures, Commentary for Nature Photonic

    Multiple Imputation Ensembles (MIE) for dealing with missing data

    Get PDF
    Missing data is a significant issue in many real-world datasets, yet there are no robust methods for dealing with it appropriately. In this paper, we propose a robust approach to dealing with missing data in classification problems: Multiple Imputation Ensembles (MIE). Our method integrates two approaches: multiple imputation and ensemble methods and compares two types of ensembles: bagging and stacking. We also propose a robust experimental set-up using 20 benchmark datasets from the UCI machine learning repository. For each dataset, we introduce increasing amounts of data Missing Completely at Random. Firstly, we use a number of single/multiple imputation methods to recover the missing values and then ensemble a number of different classifiers built on the imputed data. We assess the quality of the imputation by using dissimilarity measures. We also evaluate the MIE performance by comparing classification accuracy on the complete and imputed data. Furthermore, we use the accuracy of simple imputation as a benchmark for comparison. We find that our proposed approach combining multiple imputation with ensemble techniques outperform others, particularly as missing data increases

    Neonatal-onset multisystem inflammatory disease responsive to interleukin-1 beta inhibition

    Get PDF
    BACKGROUND:Neonatal-onset multisystem inflammatory disease is characterized by fever, urticarial rash, aseptic meningitis, deforming arthropathy, hearing loss, and mental retardation. Many patients have mutations in the cold-induced autoinflammatory syndrome 1 (CIAS1) gene, encoding cryopyrin, a protein that regulates inflammation.METHODS:We selected 18 patients with neonatal-onset multisystem inflammatory disease (12 with identifiable CIAS1 mutations) to receive anakinra, an interleukin-1-receptor antagonist (1 to 2 mg per kilogram of body weight per day subcutaneously). In 11 patients, anakinra was withdrawn at three months until a flare occurred. The primary end points included changes in scores in a daily diary of symptoms, serum levels of amyloid A and C-reactive protein, and the erythrocyte sedimentation rate from baseline to month 3 and from month 3 until a disease flare.RESULTS:All 18 patients had a rapid response to anakinra, with disappearance of rash. Diary scores improved (P<0.001) and serum amyloid A (from a median of 174 mg to 8 mg per liter), C-reactive protein (from a median of 5.29 mg to 0.34 mg per deciliter), and the erythrocyte sedimentation rate decreased at month 3 (all P<0.001), and remained low at month 6. Magnetic resonance imaging showed improvement in cochlear and leptomeningeal lesions as compared with baseline. Withdrawal of anakinra uniformly resulted in relapse within days; retreatment led to rapid improvement. There were no drug-related serious adverse events.CONCLUSIONS:Daily injections of anakinra markedly improved clinical and laboratory manifestations in patients with neonatal-onset multisystem inflammatory disease, with or without CIAS1 mutations

    Is gender encoded in the smile? A computational framework for the analysis of the smile driven dynamic face for gender recognition

    Get PDF
    YesAutomatic gender classification has become a topic of great interest to the visual computing research community in recent times. This is due to the fact that computer-based automatic gender recognition has multiple applications including, but not limited to, face perception, age, ethnicity, identity analysis, video surveillance and smart human computer interaction. In this paper, we discuss a machine learning approach for efficient identification of gender purely from the dynamics of a person’s smile. Thus, we show that the complex dynamics of a smile on someone’s face bear much relation to the person’s gender. To do this, we first formulate a computational framework that captures the dynamic characteristics of a smile. Our dynamic framework measures changes in the face during a smile using a set of spatial features on the overall face, the area of the mouth, the geometric flow around prominent parts of the face and a set of intrinsic features based on the dynamic geometry of the face. This enables us to extract 210 distinct dynamic smile parameters which form as the contributing features for machine learning. For machine classification, we have utilised both the Support Vector Machine and the k-Nearest Neighbour algorithms. To verify the accuracy of our approach, we have tested our algorithms on two databases, namely the CK+ and the MUG, consisting of a total of 109 subjects. As a result, using the k-NN algorithm, along with tenfold cross validation, for example, we achieve an accurate gender classification rate of over 85%. Hence, through the methodology we present here, we establish proof of the existence of strong indicators of gender dimorphism, purely in the dynamics of a person’s smile

    Statistical power considerations in genotype-based recall randomized controlled trials

    Get PDF
    Randomized controlled trials (RCT) are often underpowered for validating gene-treatment interactions. Using published data from the Diabetes Prevention Program (DPP), we examined power in conventional and genotype-based recall (GBR) trials. We calculated sample size and statistical power for genemetformin interactions (vs. placebo) using incidence rates, gene-drug interaction effect estimates and allele frequencies reported in the DPP for the rs8065082 SLC47A1 variant, a metformin transported encoding locus. We then calculated statistical power for interactions between genetic risk scores (GRS), metformin treatment and intensive lifestyle intervention (ILI) given a range of sampling frames, clinical trial sample sizes, interaction effect estimates, and allele frequencies; outcomes were type 2 diabetes incidence (time-to-event) and change in small LDL particles (continuous outcome). Thereafter, we compared two recruitment frameworks: GBR (participants recruited from the extremes of a GRS distribution) and conventional sampling (participants recruited without explicit emphasis on genetic characteristics). We further examined the influence of outcome measurement error on statistical power. Under most simulated scenarios, GBR trials have substantially higher power to observe gene-drug and gene-lifestyle interactions than same-sized conventional RCTs. GBR trials are becoming popular for validation of gene-treatment interactions; our analyses illustrate the strengths and weaknesses of this design
    • …
    corecore