69 research outputs found
Generic Conditions for Forecast Dominance
Recent studies have analyzed whether one forecast method dominates another
under a class of consistent scoring functions. While the existing literature
focuses on empirical tests of forecast dominance, little is known about the
theoretical conditions under which one forecast dominates another. To address
this question, we derive a new characterization of dominance among forecasts of
the mean functional. We present various scenarios under which dominance occurs.
Unlike existing results, our results allow for the case that the forecasts'
underlying information sets are not nested, and allow for uncalibrated
forecasts that suffer, e.g., from model misspecification or parameter
estimation error. We illustrate the empirical relevance of our results via data
examples from finance and economics
Copula Calibration
We propose notions of calibration for probabilistic forecasts of general
multivariate quantities. Probabilistic copula calibration is a natural analogue
of probabilistic calibration in the univariate setting. It can be assessed
empirically by checking for the uniformity of the copula probability integral
transform (CopPIT), which is invariant under coordinate permutations and
coordinatewise strictly monotone transformations of the predictive distribution
and the outcome. The CopPIT histogram can be interpreted as a generalization
and variant of the multivariate rank histogram, which has been used to check
the calibration of ensemble forecasts. Climatological copula calibration is an
analogue of marginal calibration in the univariate setting. Methods and tools
are illustrated in a simulation study and applied to compare raw numerical
model and statistically postprocessed ensemble forecasts of bivariate wind
vectors
Higher order elicitability and Osband's principle
A statistical functional, such as the mean or the median, is called
elicitable if there is a scoring function or loss function such that the
correct forecast of the functional is the unique minimizer of the expected
score. Such scoring functions are called strictly consistent for the
functional. The elicitability of a functional opens the possibility to compare
competing forecasts and to rank them in terms of their realized scores. In this
paper, we explore the notion of elicitability for multi-dimensional functionals
and give both necessary and sufficient conditions for strictly consistent
scoring functions. We cover the case of functionals with elicitable components,
but we also show that one-dimensional functionals that are not elicitable can
be a component of a higher order elicitable functional. In the case of the
variance this is a known result. However, an important result of this paper is
that spectral risk measures with a spectral measure with finite support are
jointly elicitable if one adds the `correct' quantiles. A direct consequence of
applied interest is that the pair (Value at Risk, Expected Shortfall) is
jointly elicitable under mild conditions that are usually fulfilled in risk
management applications.Comment: 32 page
Supplement to "Erratum: Higher Order Elicitability and Osband's Principle"
This note corrects conditions in Proposition 3.4 and Theorem 5.2(ii) and
comments on imprecisions in Propositions 4.2 and 4.4 in Fissler and Ziegel
(2016).Comment: 12 pages, 1 figure, to appear as a supplement in the Annals of
Statistic
Elicitability and backtesting: Perspectives for banking regulation
Conditional forecasts of risk measures play an important role in internal
risk management of financial institutions as well as in regulatory capital
calculations. In order to assess forecasting performance of a risk measurement
procedure, risk measure forecasts are compared to the realized financial losses
over a period of time and a statistical test of correctness of the procedure is
conducted. This process is known as backtesting. Such traditional backtests are
concerned with assessing some optimality property of a set of risk measure
estimates. However, they are not suited to compare different risk estimation
procedures. We investigate the proposal of comparative backtests, which are
better suited for method comparisons on the basis of forecasting accuracy, but
necessitate an elicitable risk measure. We argue that supplementing traditional
backtests with comparative backtests will enhance the existing trading book
regulatory framework for banks by providing the correct incentive for accuracy
of risk measure forecasts. In addition, the comparative backtesting framework
could be used by banks internally as well as by researchers to guide selection
of forecasting methods. The discussion focuses on three risk measures,
Value-at-Risk, expected shortfall and expectiles, and is supported by a
simulation study and data analysis
Expected Shortfall is jointly elicitable with Value at Risk - Implications for backtesting
In this note, we comment on the relevance of elicitability for backtesting
risk measure estimates. In particular, we propose the use of Diebold-Mariano
tests, and show how they can be implemented for Expected Shortfall (ES), based
on the recent result of Fissler and Ziegel (2015) that ES is jointly elicitable
with Value at Risk
- …