Search CORE

804 research outputs found

Temperature variations from Hubble Space Telescope imagery and spectroscopy of NGC 7009

Author: Balick B
Barlow MJ
Bhatt NJ
Buckalew BA
Dufour RJ
Ferland GJ
Harrington JP
Liu XW
Martin PG
Rubin RH
Storey PJ
Publication venue: WILEY-BLACKWELL
Publication date: 01/08/2002
Field of study

We present new Hubble Space Telescope (HST)/WFPC2 imagery and STIS long-slit spectroscopy of the planetary nebula NGC 7009. The primary goal was to obtain high spatial resolution of the intrinsic line ratio [O III] 4364/5008 and thereby evaluate the electron temperature (Te) and the fractional mean-square Te variation (tA2)across the nebula. The WFPC2 Te map is rather uniform; almost all values are between 9000–11 000 K, with the higher Te values closely coinciding with the inner He++ zone. The results indicate very small values–≲0.01– for tA2 throughout. Our STIS data allow an even more direct determination of Te and tA2, albeit for a much smaller area than with WFPC2. We present results from binning the data along the slit into tiles that are 0.5-arcsec square (matching the slit width). The average [O III] temperature using 45 tiles (excluding the central star and STIS fiducial bars) is 10 139 K; tA2 is 0.0035. The measurements of Te reported here are an average along each line of sight. Therefore, despite finding remarkably low tA2, we cannot completely rule out temperature fluctuations along the line of sight as the cause of the large abundance discrepancy between heavy element abundances inferred from collisionally excited emission lines compared to those derived from recombination lines

UCL Discovery

Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study

Author: A Burton
A Burton
A Marshall
AH Herring
AM Wood
Andrea Marshall
DB Rubin
DB Rubin
DB Rubin
Douglas G Altman
F Barzi
FE Harrell
FE Harrell
FH Kong
HY Chen
I White
J Schafer
J Scheffer
JL Schafer
JL Schafer
JL Schafer
JL Schafer
JL Schafer
KH Li
LM Collins
LQ Tang
M Hu
N Schenker
NJ Horton
P Royston
Patrick Royston
PD Faris
R Bender
R Development Core Team
R Oostenbrink
RJA Little
Roger L Holder
S Demissie
S Greenland
S van Buuren
S van Buuren
SR Lipsitz
SR Lipsitz
TG Clark
W Sauerbrei
W Vach
XL Meng
XL Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: There is no consensus on the most appropriate approach to handle missing covariate data within prognostic modelling studies. Therefore a simulation study was performed to assess the effects of different missing data techniques on the performance of a prognostic model. Methods: Datasets were generated to resemble the skewed distributions seen in a motivating breast cancer example. Multivariate missing data were imposed on four covariates using four different mechanisms; missing completely at random (MCAR), missing at random (MAR), missing not at random (MNAR) and a combination of all three mechanisms. Five amounts of incomplete cases from 5% to 75% were considered. Complete case analysis (CC), single imputation (SI) and five multiple imputation (MI) techniques available within the R statistical software were investigated: a) data augmentation (DA) approach assuming a multivariate normal distribution, b) DA assuming a general location model, c) regression switching imputation, d) regression switching with predictive mean matching (MICE-PMM) and e) flexible additive imputation models. A Cox proportional hazards model was fitted and appropriate estimates for the regression coefficients and model performance measures were obtained. Results: Performing a CC analysis produced unbiased regression estimates, but inflated standard errors, which affected the significance of the covariates in the model with 25% or more missingness. Using SI, underestimated the variability; resulting in poor coverage even with 10% missingness. Of the MI approaches, applying MICE-PMM produced, in general, the least biased estimates and better coverage for the incomplete covariates and better model performance for all mechanisms. However, this MI approach still produced biased regression coefficient estimates for the incomplete skewed continuous covariates when 50% or more cases had missing data imposed with a MCAR, MAR or combined mechanism. When the missingness depended on the incomplete covariates, i.e. MNAR, estimates were biased with more than 10% incomplete cases for all MI approaches. Conclusion: The results from this simulation study suggest that performing MICE-PMM may be the preferred MI approach provided that less than 50% of the cases have missing data and the missing data are not MNAR

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

UCL Discovery

Warwick Research Archives Portal Repository

Oxford University Research Archive

Imputation of Continuous Variables Missing at Random using the Method of Simulated Scores

Author: C Gourieroux
D Fadden Mc
DB Rubin
DB Rubin
DB Rubin
DB Rubin
G Calzolari
JL Schafer
NJ Horton
RA Thisted
RJA Little
TE Raghunathan
V Hajivassiliou
WH Greene
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

Crossref

Comparison of methods for handling missing data on immunohistochemical markers in survival analysis of breast cancer

Author: A Marshall
AR Donders
DB Rubin
DB Rubin
DB Rubin
G Ambler
G Van der Heijden
IR White
JA Sterne
JL Schafer
JL Schafer
JL Schafer
JM Engels
JO Kim
KG Moons
NJ Horton
RA Little
RA Little
S Greenland
S van Buuren
SJ Dawson
TE Bodner
W Vach
Publication venue: Nature Publishing Group
Publication date: 01/01/2011
Field of study

Background:Tissue micro-arrays (TMAs) are increasingly used to generate data of the molecular phenotype of tumours in clinical epidemiology studies, such as studies of disease prognosis. However, TMA data are particularly prone to missingness. A variety of methods to deal with missing data are available. However, the validity of the various approaches is dependent on the structure of the missing data and there are few empirical studies dealing with missing data from molecular pathology. The purpose of this study was to investigate the results of four commonly used approaches to handling missing data from a large, multi-centre study of the molecular pathological determinants of prognosis in breast cancer.Patients and Methods:We pooled data from over 11 000 cases of invasive breast cancer from five studies that collected information on seven prognostic indicators together with survival time data. We compared the results of a multi-variate Cox regression using four approaches to handling missing data-complete case analysis (CCA), mean substitution (MS) and multiple imputation without inclusion of the outcome (MI) and multiple imputation with inclusion of the outcome (MI). We also performed an analysis in which missing data were simulated under different assumptions and the results of the four methods were compared.Results:Over half the cases had missing data on at least one of the seven variables and 11 percent had missing data on 4 or more. The multi-variate hazard ratio estimates based on multiple imputation models were very similar to those derived after using MS, with similar standard errors. Hazard ratio estimates based on the CCA were only slightly different, but the estimates were less precise as the standard errors were large. However, in data simulated to be missing completely at random (MCAR) or missing at random (MAR), estimates for MI were least biased and most accurate, whereas estimates for CCA were most biased and least accurate.Conclusion:In this study, empirical results from analyses using CCA, MS, MI and MI were similar, although results from CCA were less precise. The results from simulations suggest that in general MI is likely to be the best. Given the ease of implementing MI in standard statistical software, the results of MI and CCA should be compared in any multi-variate analysis where missing data are a problem. © 2011 Cancer Research UK. All rights reserved

Crossref

PubMed Central

Archivio della Ricerca - Università di Pisa

University of Melbourne Institutional Repository

Imputation of continuous variables missing at random using the method of simulated scores

Author: C Gourieroux
D Fadden Mc
DB Rubin
DB Rubin
DB Rubin
DB Rubin
G Calzolari
JL Schafer
NJ Horton
RA Thisted
RJA Little
TE Raghunathan
V Hajivassiliou
WH Greene
Publication venue
Publication date: 01/01/2002
Field of study

For multivariate datasets with missing values, we present a procedure of statistical inference and state its "optimal" properties. Two main assumptions are needed: (1) data are missing at random (MAR); (2) the data generating process is a multivariate normal linear regression. Disentangling the problem of convergence of the iterative estimation/imputation procedure, we show that the estimator is a "method of simulated scores" (a particular case of McFadden's "method of simulated moments"); thus the estimator is equivalent to maximum likelihood if the number of replications is conveniently large, and the whole procedure can be considered an optimal parametric technique for imputation of missing data

Munich RePEc Personal Archive

Crossref

Quantum states made to measure

Author: BL Higgins
C-Y Lu
CM Caves
F Dell'Anno
G Gilbert
H Vahlbruch
Ian A. Walmsley
J Appel
J Estève
J Ye
JG Rarity
JJ Bollinger
JL O'Brien
JP Dowling
Konrad Banaszek
M Bourennane
MA Rubin
MW Mitchell
NJ Cerf
P Walther
PAM Dirac
PJ Mosley
RA Fisher
Rafał Demkowicz-Dobrzański
S Boixo
T Nagata
U Dorner
V Giovannetti
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Recent progress in manipulating quantum states of light and matter brings quantum-enhanced measurements closer to prospective applications. The current challenge is to make quantum metrologic strategies robust against imperfections.Comment: 4 pages, 3 figures, Commentary for Nature Photonic

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Multiple Imputation Ensembles (MIE) for dealing with missing data

Author: A Farhangfar
AM Sefidian
B Schölkopf
C Cortes
CT Tran
DA Newman
DB Rubin
DB Rubin
DH Wolpert
EL Silva-Ramírez
GE Batista
GJ van der Heijden
H Gao
IH Witten
J Demšar
J Honaker
J Honaker
J Scheffer
JA Sterne
JL Schafer
JL Schafer
JR Quinlan
K Abayomi
KM Ting
L Breiman
L Breiman
L Rokach
M Fichman
M Khalilia
M Spratt
MA Klebanoff
MJ Azur
NJ Horton
PJ García-Laencina
PJ Kelly
PN Tan
RJ Little
S García
S Van Buuren
S Van Buuren
SS Chae
SS Choi
U Garciarena
V Vapnik
X Chen
Y Dong
Y Freund
Y He
Z Che
Z Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2020
Field of study

Missing data is a significant issue in many real-world datasets, yet there are no robust methods for dealing with it appropriately. In this paper, we propose a robust approach to dealing with missing data in classification problems: Multiple Imputation Ensembles (MIE). Our method integrates two approaches: multiple imputation and ensemble methods and compares two types of ensembles: bagging and stacking. We also propose a robust experimental set-up using 20 benchmark datasets from the UCI machine learning repository. For each dataset, we introduce increasing amounts of data Missing Completely at Random. Firstly, we use a number of single/multiple imputation methods to recover the missing values and then ensemble a number of different classifiers built on the imputed data. We assess the quality of the imputation by using dissimilarity measures. We also evaluate the MIE performance by comparing classification accuracy on the complete and imputed data. Furthermore, we use the accuracy of simple imputation as a benchmark for comparison. We find that our proposed approach combining multiple imputation with ensemble techniques outperform others, particularly as missing data increases

Crossref

University of East Anglia digital repository

Neonatal-onset multisystem inflammatory disease responsive to interleukin-1 beta inhibition

Author: Adams BS
Aksentijevich I
Beitz LO
Brewer C
Butman JA
Canna SW
Cole JL
Dailey NJ
Fuhlbrigge RC
Gardner G
Gelabert A
Goldbach-Mansky R
Hannan WP
Haverkamp MH
Hawkins PN
Hill S
Hoffmann SC
Holland SM
Horn W
Jarvis JN
Jones J
Karp BI
Kastner DL
Kim HJ
Moore TL
O'Neil K
Paul SM
Penzak SR
Pham TH
Pucino F
Rubin BI
Shaham B
Snyder C
Stein L
Turner ML
Vehe RK
Warren RW
Wesley RA
Wiggs E
Zalewski C
Publication venue: 'Massachusetts Medical Society'
Publication date: 10/08/2006
Field of study

BACKGROUND:Neonatal-onset multisystem inflammatory disease is characterized by fever, urticarial rash, aseptic meningitis, deforming arthropathy, hearing loss, and mental retardation. Many patients have mutations in the cold-induced autoinflammatory syndrome 1 (CIAS1) gene, encoding cryopyrin, a protein that regulates inflammation.METHODS:We selected 18 patients with neonatal-onset multisystem inflammatory disease (12 with identifiable CIAS1 mutations) to receive anakinra, an interleukin-1-receptor antagonist (1 to 2 mg per kilogram of body weight per day subcutaneously). In 11 patients, anakinra was withdrawn at three months until a flare occurred. The primary end points included changes in scores in a daily diary of symptoms, serum levels of amyloid A and C-reactive protein, and the erythrocyte sedimentation rate from baseline to month 3 and from month 3 until a disease flare.RESULTS:All 18 patients had a rapid response to anakinra, with disappearance of rash. Diary scores improved (P<0.001) and serum amyloid A (from a median of 174 mg to 8 mg per liter), C-reactive protein (from a median of 5.29 mg to 0.34 mg per deciliter), and the erythrocyte sedimentation rate decreased at month 3 (all P<0.001), and remained low at month 6. Magnetic resonance imaging showed improvement in cochlear and leptomeningeal lesions as compared with baseline. Withdrawal of anakinra uniformly resulted in relapse within days; retreatment led to rapid improvement. There were no drug-related serious adverse events.CONCLUSIONS:Daily injections of anakinra markedly improved clinical and laboratory manifestations in patients with neonatal-onset multisystem inflammatory disease, with or without CIAS1 mutations

UCL Discovery

Is gender encoded in the smile? A computational framework for the analysis of the smile driven dynamic face for gender recognition

Author: A Dantcheva
Ahmad Al-dahoud
AM Bukar
DP Lale
E Cashdan
EL Abel
FM Deutsch
G Antipov
Hassan Ugail
HC Lian
LR Brody
LR Rubin
MF Liébart
NJ Briton
NS Altman
P Ekman
P Viola
RW Simon
S Kalam
U Dimberg
U Dimberg
U Hess
V Surakka
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/03/2018
Field of study

YesAutomatic gender classification has become a topic of great interest to the visual computing research community in recent times. This is due to the fact that computer-based automatic gender recognition has multiple applications including, but not limited to, face perception, age, ethnicity, identity analysis, video surveillance and smart human computer interaction. In this paper, we discuss a machine learning approach for efficient identification of gender purely from the dynamics of a person’s smile. Thus, we show that the complex dynamics of a smile on someone’s face bear much relation to the person’s gender. To do this, we first formulate a computational framework that captures the dynamic characteristics of a smile. Our dynamic framework measures changes in the face during a smile using a set of spatial features on the overall face, the area of the mouth, the geometric flow around prominent parts of the face and a set of intrinsic features based on the dynamic geometry of the face. This enables us to extract 210 distinct dynamic smile parameters which form as the contributing features for machine learning. For machine classification, we have utilised both the Support Vector Machine and the k-Nearest Neighbour algorithms. To verify the accuracy of our approach, we have tested our algorithms on two databases, namely the CK+ and the MUG, consisting of a total of 109 subjects. As a result, using the k-NN algorithm, along with tenfold cross validation, for example, we achieve an accurate gender classification rate of over 85%. Hence, through the methodology we present here, we establish proof of the existence of strong indicators of gender dimorphism, purely in the dynamics of a person’s smile

Crossref

Bradford Scholars

Statistical power considerations in genotype-based recall randomized controlled trials

Author: A Kurbasic
A Maitournam
AP Morris
BF Voight
C Moore
C Sudlow
E Zeggini
F Renstrom
GM Peloso
GS Yeo
JA Nettleton
JL Perez-Gracia
KA Jablonski
LT Guey
M Garcia-Closas
MJ Go
MY Wong
NJ Schork
PW Franks
R Bender
R Simon
RA Scott
RR Rubin
SF Grant
TI Pollin
W Johnson
WC Knowler
Y Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Randomized controlled trials (RCT) are often underpowered for validating gene-treatment interactions. Using published data from the Diabetes Prevention Program (DPP), we examined power in conventional and genotype-based recall (GBR) trials. We calculated sample size and statistical power for genemetformin interactions (vs. placebo) using incidence rates, gene-drug interaction effect estimates and allele frequencies reported in the DPP for the rs8065082 SLC47A1 variant, a metformin transported encoding locus. We then calculated statistical power for interactions between genetic risk scores (GRS), metformin treatment and intensive lifestyle intervention (ILI) given a range of sampling frames, clinical trial sample sizes, interaction effect estimates, and allele frequencies; outcomes were type 2 diabetes incidence (time-to-event) and change in small LDL particles (continuous outcome). Thereafter, we compared two recruitment frameworks: GBR (participants recruited from the extremes of a GRS distribution) and conventional sampling (participants recruited without explicit emphasis on genetic characteristics). We further examined the influence of outcome measurement error on statistical power. Under most simulated scenarios, GBR trials have substantially higher power to observe gene-drug and gene-lifestyle interactions than same-sized conventional RCTs. GBR trials are becoming popular for validation of gene-treatment interactions; our analyses illustrate the strengths and weaknesses of this design

Lund University Publications

Crossref

Publikationer från Umeå universitet

Harvard University - DASH

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

University of Dundee Online Publications