Search CORE

3,497 research outputs found

Climatology and dynamics of the link between dry intrusions and cold fronts during winter. Part I: global climatology

Author: Catto JL
Raveh-Rubin S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2019
Field of study

This is the final version. Available on open access from Springer via the DOI in this recordData availability: ERA-Interim data are available online (http://apps.ecmwf.int/datasets/).Cold fronts are a primary feature of the day-to-day variability of weather in the midlatitudes, and feature in conceptual extratropical cyclone models alongside the dry intrusion airstream. Here the climatological frequency and spatial distribution of the co-occurrence of these two features are quantified, and the differences in cold front characteristics (intensity, size, and precipitation) when a dry intrusion is present or not are calculated. Fronts are objectively identified in the ECMWF ERA-Interim dataset for the winter seasons in each hemisphere and split into 3 sub-types: central fronts (within a cyclone area); trailing fronts (outwith the cyclone area but connected to a central front); and isolated fronts (not connected to a cyclone). These are then associated with dry intrusions identified using Lagrangian trajectory analysis. Trailing fronts are most likely to be associated with a DI in both hemispheres, and this occurs more frequently in the western parts of the major storm track regions. Isolated fronts are linked to DIs more frequently on the eastern ends of the storm tracks, and in the subtropics. All front types, when co-occurring with a DI, are stronger in terms of their temperature gradient, are much larger in area, and typically have higher average precipitation. Therefore, climatologically the link with DIs increases the impact of cold fronts. There are some differences in the statistics of the precipitation for trailing and isolated fronts that are further investigated in Part II of this study.Australian Research CouncilSwiss National Science FoundationBenoziyo Endowment Fund for the Advancement of Scienc

Open Research Exeter

Climatology and dynamics of the link between dry intrusions and cold fronts during winter, Part II: Front-centred perspective

Author: Catto JL
Raveh-Rubin S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/06/2019
Field of study

This is the author accepted manuscript. The final version is available from Springer via the DOI in this record.The conceptual picture of an extratropical cyclone typically includes a cold front and a dry intrusion (DI) behind it. By objectively identifying fronts and DIs in ECMWF ERA-Interim data for 1979–2014, Part I quantified the climatological relationship between cold fronts and DIs. Driven by the finding that front intensity and frontal precipitation are enhanced in the presence of DIs, here we employ a front-centred perspective to focus on the dynamical and thermodynamical environment of cold fronts with and without DIs in the Northern Hemisphere winter. Distinguishing between trailing fronts (that connect to a parent cyclone) and isolated fronts, examples of DIs behind each type illustrate the baroclinic environment of the trailing front, and the lack of strong temperature gradients across the isolated front. Composite analyses of North Atlantic and North Pacific fronts outline the major differences in the presence of DIs, compared to similar fronts but without DIs in their vicinity. The magnitude and spatial structure of the modification by DIs depends on the front intensity. Yet, generally with DIs, trailing fronts occur with stronger SLP dipole, deeper upper-tropospheric trough, stronger 10-m wind gusts, enhanced ocean sensible and latent heat fluxes in the cyclone cold sector and heavier precipitation. Isolated weak fronts exhibit similar behaviour, with different spatial structure. This study highlights the central role of DIs for shaping the variability of fronts and their associated environment and impact.Australian Research Council DECR

Open Research Exeter

Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study

Author: A Burton
A Burton
A Marshall
AH Herring
AM Wood
Andrea Marshall
DB Rubin
DB Rubin
DB Rubin
Douglas G Altman
F Barzi
FE Harrell
FE Harrell
FH Kong
HY Chen
I White
J Schafer
J Scheffer
JL Schafer
JL Schafer
JL Schafer
JL Schafer
JL Schafer
KH Li
LM Collins
LQ Tang
M Hu
N Schenker
NJ Horton
P Royston
Patrick Royston
PD Faris
R Bender
R Development Core Team
R Oostenbrink
RJA Little
Roger L Holder
S Demissie
S Greenland
S van Buuren
S van Buuren
SR Lipsitz
SR Lipsitz
TG Clark
W Sauerbrei
W Vach
XL Meng
XL Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: There is no consensus on the most appropriate approach to handle missing covariate data within prognostic modelling studies. Therefore a simulation study was performed to assess the effects of different missing data techniques on the performance of a prognostic model. Methods: Datasets were generated to resemble the skewed distributions seen in a motivating breast cancer example. Multivariate missing data were imposed on four covariates using four different mechanisms; missing completely at random (MCAR), missing at random (MAR), missing not at random (MNAR) and a combination of all three mechanisms. Five amounts of incomplete cases from 5% to 75% were considered. Complete case analysis (CC), single imputation (SI) and five multiple imputation (MI) techniques available within the R statistical software were investigated: a) data augmentation (DA) approach assuming a multivariate normal distribution, b) DA assuming a general location model, c) regression switching imputation, d) regression switching with predictive mean matching (MICE-PMM) and e) flexible additive imputation models. A Cox proportional hazards model was fitted and appropriate estimates for the regression coefficients and model performance measures were obtained. Results: Performing a CC analysis produced unbiased regression estimates, but inflated standard errors, which affected the significance of the covariates in the model with 25% or more missingness. Using SI, underestimated the variability; resulting in poor coverage even with 10% missingness. Of the MI approaches, applying MICE-PMM produced, in general, the least biased estimates and better coverage for the incomplete covariates and better model performance for all mechanisms. However, this MI approach still produced biased regression coefficient estimates for the incomplete skewed continuous covariates when 50% or more cases had missing data imposed with a MCAR, MAR or combined mechanism. When the missingness depended on the incomplete covariates, i.e. MNAR, estimates were biased with more than 10% incomplete cases for all MI approaches. Conclusion: The results from this simulation study suggest that performing MICE-PMM may be the preferred MI approach provided that less than 50% of the cases have missing data and the missing data are not MNAR

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

UCL Discovery

Warwick Research Archives Portal Repository

Oxford University Research Archive

Comparison of methods for handling missing data on immunohistochemical markers in survival analysis of breast cancer

Author: A Marshall
AR Donders
DB Rubin
DB Rubin
DB Rubin
G Ambler
G Van der Heijden
IR White
JA Sterne
JL Schafer
JL Schafer
JL Schafer
JM Engels
JO Kim
KG Moons
NJ Horton
RA Little
RA Little
S Greenland
S van Buuren
SJ Dawson
TE Bodner
W Vach
Publication venue: Nature Publishing Group
Publication date: 01/01/2011
Field of study

Background:Tissue micro-arrays (TMAs) are increasingly used to generate data of the molecular phenotype of tumours in clinical epidemiology studies, such as studies of disease prognosis. However, TMA data are particularly prone to missingness. A variety of methods to deal with missing data are available. However, the validity of the various approaches is dependent on the structure of the missing data and there are few empirical studies dealing with missing data from molecular pathology. The purpose of this study was to investigate the results of four commonly used approaches to handling missing data from a large, multi-centre study of the molecular pathological determinants of prognosis in breast cancer.Patients and Methods:We pooled data from over 11 000 cases of invasive breast cancer from five studies that collected information on seven prognostic indicators together with survival time data. We compared the results of a multi-variate Cox regression using four approaches to handling missing data-complete case analysis (CCA), mean substitution (MS) and multiple imputation without inclusion of the outcome (MI) and multiple imputation with inclusion of the outcome (MI). We also performed an analysis in which missing data were simulated under different assumptions and the results of the four methods were compared.Results:Over half the cases had missing data on at least one of the seven variables and 11 percent had missing data on 4 or more. The multi-variate hazard ratio estimates based on multiple imputation models were very similar to those derived after using MS, with similar standard errors. Hazard ratio estimates based on the CCA were only slightly different, but the estimates were less precise as the standard errors were large. However, in data simulated to be missing completely at random (MCAR) or missing at random (MAR), estimates for MI were least biased and most accurate, whereas estimates for CCA were most biased and least accurate.Conclusion:In this study, empirical results from analyses using CCA, MS, MI and MI were similar, although results from CCA were less precise. The results from simulations suggest that in general MI is likely to be the best. Given the ease of implementing MI in standard statistical software, the results of MI and CCA should be compared in any multi-variate analysis where missing data are a problem. © 2011 Cancer Research UK. All rights reserved

Crossref

PubMed Central

Archivio della Ricerca - Università di Pisa

University of Melbourne Institutional Repository

On the variational cohomology of g-invariant foliations

Author: Almeida R
Kumpera A
Rubin JL
Publication venue: 'AIP Publishing'
Publication date: 26/11/2015
Field of study

Let S be an integrable Pfaffian system. If it is invariant under a transversally free infinitesimal action of a finite dimensional real Lie algebra g, we show that the 'vertical' variational cohomology of S is equal to the Lie algebra cohomology of g with values in the space of the 'horizontal' cohomology in a maximum dimension. This result, besides giving an effective algorithm for the computation of the variational cohomology of an invariant Pfaffian system, provides a method for detecting obstructions to the existence of infinitesimal actions leaving a given system invariant. (C) 2003 American Institute of Physics.44104702471

Repositorio da Producao Cientifica e Intelectual da Unicamp

Stereotyping and the treatment of missing data for drug and alcohol clinical trials

Author: AJ Figueredo
DR Rubin
JL Schafer
R Little
RJA Little
S Fielding
S Hedden
Stephan Arndt
Publication venue: BioMed Central
Publication date: 01/02/2009
Field of study

Stigma and stereotyping of marginalized groups often is insidious and shows up in unlikely places, for instance in how clinical trials consider dropouts in treatment research. A surprising number of studies presume that people who do not complete the study protocol relapse and code their data as if they had been observed. There is no good statistical rationale for this treatment of missing data and numerous and more defensible alternative methods are available. We need to be mindful about our attitudes and preconceptions about the people we are intending to help. There is no good reason to continue to support science built on this scientifically indefensible stereotyping, however unintentional

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recovery of information from multiple imputation: a simulation study

Author: A Marshall
D Rubin
DB Rubin
DB Rubin
F Barzi
H Demirtas
I White
JAC Sterne
JL Schafer
JL Schafer
John B Carlin
JR Carpenter
JR Carpenter
Katherine J Lee
KJ Lee
LM Collins
MA Klebanoff
P Royston
S VanBuuren
StataCorp
TE Raghunathan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines

Author: A Burton
A Rouxel
AC Mertens
Andrea Marshall
BL Thomsen
C Serrat
D Collett
DB Rubin
DB Rubin
DB Rubin
DG Altman
Douglas G Altman
DW Hosmer
FE Harrell
FE Harrell
FR Hampel
G Ambler
G Vaughn
HC van Houwelingen
J O'Quigley
JA Hoeting
JC Wyatt
JL Schafer
JW Graham
KH Li
M Schemper
M Schemper
MG Kenward
MW Heymans
N Orsini
O Harel
P Peduzzi
P Royston
Patrick Royston
RA Fisher
Roger L Holder
S Gill
S Sinharay
S van Buuren
T Bärnighausen
TG Clark
TG Clark
WM Stadler
XL Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: Multiple imputation (MI) provides an effective approach to handle missing covariate data within prognostic modelling studies, as it can properly account for the missing data uncertainty. The multiply imputed datasets are each analysed using standard prognostic modelling techniques to obtain the estimates of interest. The estimates from each imputed dataset are then combined into one overall estimate and variance, incorporating both the within and between imputation variability. Rubin's rules for combining these multiply imputed estimates are based on asymptotic theory. The resulting combined estimates may be more accurate if the posterior distribution of the population parameter of interest is better approximated by the normal distribution. However, the normality assumption may not be appropriate for all the parameters of interest when analysing prognostic modelling studies, such as predicted survival probabilities and model performance measures. Methods: Guidelines for combining the estimates of interest when analysing prognostic modelling studies are provided. A literature review is performed to identify current practice for combining such estimates in prognostic modelling studies. Results: Methods for combining all reported estimates after MI were not well reported in the current literature. Rubin's rules without applying any transformations were the standard approach used, when any method was stated. Conclusion: The proposed simple guidelines for combining estimates after MI may lead to a wider and more appropriate use of MI in future prognostic modelling studies

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

UCL Discovery

Warwick Research Archives Portal Repository

Oxford University Research Archive

Multiple Imputation Ensembles (MIE) for dealing with missing data

Author: A Farhangfar
AM Sefidian
B Schölkopf
C Cortes
CT Tran
DA Newman
DB Rubin
DB Rubin
DH Wolpert
EL Silva-Ramírez
GE Batista
GJ van der Heijden
H Gao
IH Witten
J Demšar
J Honaker
J Honaker
J Scheffer
JA Sterne
JL Schafer
JL Schafer
JR Quinlan
K Abayomi
KM Ting
L Breiman
L Breiman
L Rokach
M Fichman
M Khalilia
M Spratt
MA Klebanoff
MJ Azur
NJ Horton
PJ García-Laencina
PJ Kelly
PN Tan
RJ Little
S García
S Van Buuren
S Van Buuren
SS Chae
SS Choi
U Garciarena
V Vapnik
X Chen
Y Dong
Y Freund
Y He
Z Che
Z Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2020
Field of study

Missing data is a significant issue in many real-world datasets, yet there are no robust methods for dealing with it appropriately. In this paper, we propose a robust approach to dealing with missing data in classification problems: Multiple Imputation Ensembles (MIE). Our method integrates two approaches: multiple imputation and ensemble methods and compares two types of ensembles: bagging and stacking. We also propose a robust experimental set-up using 20 benchmark datasets from the UCI machine learning repository. For each dataset, we introduce increasing amounts of data Missing Completely at Random. Firstly, we use a number of single/multiple imputation methods to recover the missing values and then ensemble a number of different classifiers built on the imputed data. We assess the quality of the imputation by using dissimilarity measures. We also evaluate the MIE performance by comparing classification accuracy on the complete and imputed data. Furthermore, we use the accuracy of simple imputation as a benchmark for comparison. We find that our proposed approach combining multiple imputation with ensemble techniques outperform others, particularly as missing data increases

Crossref

University of East Anglia digital repository

Methane storms as a driver of Titan's dune orientation

Author: A Lucas
Antoine Lucas
B Charnay
Benjamin Charnay
CA Griffith
Clément Narteau
DM Burr
DM Rubin
DM Rubin
EL Barth
EL Barth
EP Turtle
EP Turtle
Erika Barth
HB Niemann
J Radebaugh
JL Mitchell
KM Mahoney
M Fulchignoni
NT Bridges
RC Ewing
RD Lorenz
RD Lorenz
RD Lorenz
RD Lorenz
S Courrech du Pont
S Lebonnois
S Miller
S Rodriguez
S Rodriguez
Scot Rafkin
Sylvain Courrech du Pont
Sébastien Lebonnois
Sébastien Rodriguez
T Schneider
T Tokano
T Tokano
X Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Titan's equatorial regions are covered by eastward propagating linear dunes. This direction is opposite to mean surface winds simulated by Global Climate Models (GCMs), which are oriented westward at these latitudes, similar to trade winds on Earth. Different hypotheses have been proposed to address this apparent contradiction, involving Saturn's gravitational tides, large scale topography or wind statistics, but none of them can explain a global eastward dune propagation in the equatorial band. Here we analyse the impact of equinoctial tropical methane storms developing in the superrotating atmosphere (i.e. the eastward winds at high altitude) on Titan's dune orientation. Using mesoscale simulations of convective methane clouds with a GCM wind profile featuring superrotation, we show that Titan's storms should produce fast eastward gust fronts above the surface. Such gusts dominate the aeolian transport, allowing dunes to extend eastward. This analysis therefore suggests a coupling between superrotation, tropical methane storms and dune formation on Titan. Furthermore, together with GCM predictions and analogies to some terrestrial dune fields, this work provides a general framework explaining several major features of Titan's dunes: linear shape, eastward propagation and poleward divergence, and implies an equatorial origin of Titan's dune sand.Comment: Published online on Nature Geoscience on 13 April 201

arXiv.org e-Print Archive

HAL-Ecole des Ponts ParisTech