807 research outputs found
Asymptotics for the Hirsch Index
The last decade methods for quantifying the research output of individual researchers have become quite popular in academic policy making. The h- index (Hirsch, 2005) constitutes an interesting quality measure that has attracted a lot of attention recently. It is now a standard measure available for instance on theWeb of Science. In this paper we establish the asymptotic normality of the empirical h-index. The rate of convergence is non-standard: ph=(1 + nf(h)), where f is the density of the citation distribution and n the number of publications of a researcher. In case that the citations follow a Pareto-type or a Weibull-type distribution as defined in extreme value theory, our general result nicely specializes to results that are useful for constructing confidence intervals for the h-index.Asymptotic normality;confidence interval;extreme value theory;research output;scientometrics;tail empirical process.
Second-order refined peaks-over-threshold modelling for heavy-tailed distributions
Modelling excesses over a high threshold using the Pareto or generalized
Pareto distribution (PD/GPD) is the most popular approach in extreme value
statistics. This method typically requires high thresholds in order for the
(G)PD to fit well and in such a case applies only to a small upper fraction of
the data. The extension of the (G)PD proposed in this paper is able to describe
the excess distribution for lower thresholds in case of heavy tailed
distributions. This yields a statistical model that can be fitted to a larger
portion of the data. Moreover, estimates of tail parameters display stability
for a larger range of thresholds. Our findings are supported by asymptotic
results, simulations and a case study.Comment: to appear in the Journal of Statistical Planning and Inferenc
Mandelbrot's Extremism
In the sixties Mandelbrot already showed that extreme price swings are more likely than some of us think or incorporate in our models.A modern toolbox for analyzing such rare events can be found in the field of extreme value theory.At the core of extreme value theory lies the modelling of maxima over large blocks of observations and of excesses over high thresholds.The general validity of these models makes them suitable for out-of-sample extrapolation.By way of illustration we assess the likeliness of the crash of the Dow Jones on October 19, 1987, a loss that was more than twice as large as on any other single day from 1954 until 2004.exceedances;extreme value theory;heavy tails;maxima
Unbiased Tail Estimation by an Extension of the Generalized Pareto Distribution
AMS classifications: 62G20; 62G32;bias;exchange rate;heavy tails;peaks-over-threshold;regular variation;tail index
Semiparametric Lower Bounds for Tail Index Estimation
indexation;semiparametric estimation
Modelling Censored Losses Using Splicing: a Global Fit Strategy With Mixed Erlang and Extreme Value Distributions
In risk analysis, a global fit that appropriately captures the body and the
tail of the distribution of losses is essential. Modelling the whole range of
the losses using a standard distribution is usually very hard and often
impossible due to the specific characteristics of the body and the tail of the
loss distribution. A possible solution is to combine two distributions in a
splicing model: a light-tailed distribution for the body which covers light and
moderate losses, and a heavy-tailed distribution for the tail to capture large
losses. We propose a splicing model with a mixed Erlang (ME) distribution for
the body and a Pareto distribution for the tail. This combines the flexibility
of the ME distribution with the ability of the Pareto distribution to model
extreme values. We extend our splicing approach for censored and/or truncated
data. Relevant examples of such data can be found in financial risk analysis.
We illustrate the flexibility of this splicing model using practical examples
from risk measurement
- …