974 research outputs found
An Assessment of Data Transfer Performance for Large-Scale Climate Data Analysis and Recommendations for the Data Infrastructure for CMIP6
We document the data transfer workflow, data transfer performance, and other
aspects of staging approximately 56 terabytes of climate model output data from
the distributed Coupled Model Intercomparison Project (CMIP5) archive to the
National Energy Research Supercomputing Center (NERSC) at the Lawrence Berkeley
National Laboratory required for tracking and characterizing extratropical
storms, a phenomena of importance in the mid-latitudes. We present this
analysis to illustrate the current challenges in assembling multi-model data
sets at major computing facilities for large-scale studies of CMIP5 data.
Because of the larger archive size of the upcoming CMIP6 phase of model
intercomparison, we expect such data transfers to become of increasing
importance, and perhaps of routine necessity. We find that data transfer rates
using the ESGF are often slower than what is typically available to US
residences and that there is significant room for improvement in the data
transfer capabilities of the ESGF portal and data centers both in terms of
workflow mechanics and in data transfer performance. We believe performance
improvements of at least an order of magnitude are within technical reach using
current best practices, as illustrated by the performance we achieved in
transferring the complete raw data set between two high performance computing
facilities. To achieve these performance improvements, we recommend: that
current best practices (such as the Science DMZ model) be applied to the data
servers and networks at ESGF data centers; that sufficient financial and human
resources be devoted at the ESGF data centers for systems and network
engineering tasks to support high performance data movement; and that
performance metrics for data transfer between ESGF data centers and major
computing facilities used for climate data analysis be established, regularly
tested, and published
Quantifying statistical uncertainty in the attribution of human influence on severe weather
Event attribution in the context of climate change seeks to understand the
role of anthropogenic greenhouse gas emissions on extreme weather events,
either specific events or classes of events. A common approach to event
attribution uses climate model output under factual (real-world) and
counterfactual (world that might have been without anthropogenic greenhouse gas
emissions) scenarios to estimate the probabilities of the event of interest
under the two scenarios. Event attribution is then quantified by the ratio of
the two probabilities. While this approach has been applied many times in the
last 15 years, the statistical techniques used to estimate the risk ratio based
on climate model ensembles have not drawn on the full set of methods available
in the statistical literature and have in some cases used and interpreted the
bootstrap method in non-standard ways. We present a precise frequentist
statistical framework for quantifying the effect of sampling uncertainty on
estimation of the risk ratio, propose the use of statistical methods that are
new to event attribution, and evaluate a variety of methods using statistical
simulations. We conclude that existing statistical methods not yet in use for
event attribution have several advantages over the widely-used bootstrap,
including better statistical performance in repeated samples and robustness to
small estimated probabilities. Software for using the methods is available
through the climextRemes package available for R or Python. While we focus on
frequentist statistical methods, Bayesian methods are likely to be particularly
useful when considering sources of uncertainty beyond sampling uncertainty.Comment: 41 pages, 11 figures, 1 tabl
Quantifying the effect of interannual ocean variability on the attribution of extreme climate events to human influence
In recent years, the climate change research community has become highly
interested in describing the anthropogenic influence on extreme weather events,
commonly termed "event attribution." Limitations in the observational record
and in computational resources motivate the use of uncoupled,
atmosphere/land-only climate models with prescribed ocean conditions run over a
short period, leading up to and including an event of interest. In this
approach, large ensembles of high-resolution simulations can be generated under
factual observed conditions and counterfactual conditions that might have been
observed in the absence of human interference; these can be used to estimate
the change in probability of the given event due to anthropogenic influence.
However, using a prescribed ocean state ignores the possibility that estimates
of attributable risk might be a function of the ocean state. Thus, the
uncertainty in attributable risk is likely underestimated, implying an
over-confidence in anthropogenic influence.
In this work, we estimate the year-to-year variability in calculations of the
anthropogenic contribution to extreme weather based on large ensembles of
atmospheric model simulations. Our results both quantify the magnitude of
year-to-year variability and categorize the degree to which conclusions of
attributable risk are qualitatively affected. The methodology is illustrated by
exploring extreme temperature and precipitation events for the northwest coast
of South America and northern-central Siberia; we also provides results for
regions around the globe. While it remains preferable to perform a full
multi-year analysis, the results presented here can serve as an indication of
where and when attribution researchers should be concerned about the use of
atmosphere-only simulations
Evaluation of NASA\u27s MERRA Precipitation Product in Reproducing the Observed Trend and Distribution of Extreme Precipitation Events in the United States
This study evaluates the performance of NASA’s Modern-Era Retrospective Analysis for Research and Applications (MERRA) precipitation product in reproducing the trend and distribution of extreme precipitation events. Utilizing the extreme value theory, time-invariant and time-variant extreme value distributions are developed to model the trends and changes in the patterns of extreme precipitation events over the contiguous United States during 1979–2010. The Climate Prediction Center (CPC) U.S. Unified gridded observation data are used as the observational dataset. The CPC analysis shows that the eastern and western parts of the United States are experiencing positive and negative trends in annual maxima, respectively. The continental-scale patterns of change found in MERRA seem to reasonably mirror the observed patterns of change found in CPC. This is not previously expected, given the difficulty in constraining precipitation in reanalysis products. MERRA tends to overestimate the frequency at which the 99th percentile of precipitation is exceeded because this threshold tends to be lower in MERRA, making it easier to be exceeded. This feature is dominant during the summer months. MERRA tends to reproduce spatial patterns of the scale and location parameters of the generalized extreme value and generalized Pareto distributions. However, MERRA underestimates these parameters, particularly over the Gulf Coast states, leading to lower magnitudes in extreme precipitation events. Two issues in MERRA are identified: 1) MERRA shows a spurious negative trend in Nebraska and Kansas, which is most likely related to the changes in the satellite observing system over time that has apparently affected the water cycle in the central United States, and 2) the patterns of positive trend over the Gulf Coast states and along the East Coast seem to be correlated with the tropical cyclones in these regions. The analysis of the trends in the seasonal precipitation extremes indicates that the hurricane and winter seasons are contributing the most to these trend patterns in the southeastern United States. In addition, the increasing annual trend simulated by MERRA in the Gulf Coast region is due to an incorrect trend in winter precipitation extremes
Detected changes in precipitation extremes at their native scales derived from in situ measurements
The gridding of daily accumulated precipitation -- especially extremes --
from ground-based station observations is problematic due to the fractal nature
of precipitation, and therefore estimates of long period return values and
their changes based on such gridded daily data sets are generally
underestimated. In this paper, we characterize high-resolution changes in
observed extreme precipitation from 1950 to 2017 for the contiguous United
States (CONUS) based on in situ measurements only. Our analysis utilizes
spatial statistical methods that allow us to derive gridded estimates that do
not smooth extreme daily measurements and are consistent with statistics from
the original station data while increasing the resulting signal to noise ratio.
Furthermore, we use a robust statistical technique to identify significant
pointwise changes in the climatology of extreme precipitation while carefully
controlling the rate of false positives. We present and discuss seasonal
changes in the statistics of extreme precipitation: the largest and most
spatially-coherent pointwise changes are in fall (SON), with approximately 33%
of CONUS exhibiting significant changes (in an absolute sense). Other seasons
display very few meaningful pointwise changes (in either a relative or absolute
sense), illustrating the difficulty in detecting pointwise changes in extreme
precipitation based on in situ measurements. While our main result involves
seasonal changes, we also present and discuss annual changes in the statistics
of extreme precipitation. In this paper we only seek to detect changes over
time and leave attribution of the underlying causes of these changes for future
work
Recommended from our members
Western North Pacific tropical cyclone model tracks in present and future climates
Western North Pacific tropical cyclone (TC) model tracks are analyzed in two large multimodel ensembles, spanning a large variety of models and multiple future climate scenarios. Two methodologies are used to synthesize the properties of TC tracks in this large data set: cluster analysis and mass moment ellipses. First, the models' TC tracks are compared to observed TC tracks' characteristics, and a subset of the models is chosen for analysis, based on the tracks' similarity to observations and sample size. Potential changes in track types in a warming climate are identified by comparing the kernel smoothed probability distributions of various track variables in historical and future scenarios using a Kolmogorov-Smirnov significance test. Two track changes are identified. The first is a statistically significant increase in the north-south expansion, which can also be viewed as a poleward shift, as TC tracks are prevented from expanding equatorward due to the weak Coriolis force near the equator. The second change is an eastward shift in the storm tracks that occur near the central Pacific in one of the multimodel ensembles, indicating a possible increase in the occurrence of storms near Hawaii in a warming climate. The dependence of the results on which model and future scenario are considered emphasizes the necessity of including multiple models and scenarios when considering future changes in TC characteristics
- …
