807 research outputs found

    Asymptotics for the Hirsch Index

    Get PDF
    The last decade methods for quantifying the research output of individual researchers have become quite popular in academic policy making. The h- index (Hirsch, 2005) constitutes an interesting quality measure that has attracted a lot of attention recently. It is now a standard measure available for instance on theWeb of Science. In this paper we establish the asymptotic normality of the empirical h-index. The rate of convergence is non-standard: ph=(1 + nf(h)), where f is the density of the citation distribution and n the number of publications of a researcher. In case that the citations follow a Pareto-type or a Weibull-type distribution as defined in extreme value theory, our general result nicely specializes to results that are useful for constructing confidence intervals for the h-index.Asymptotic normality;confidence interval;extreme value theory;research output;scientometrics;tail empirical process.

    Second-order refined peaks-over-threshold modelling for heavy-tailed distributions

    Full text link
    Modelling excesses over a high threshold using the Pareto or generalized Pareto distribution (PD/GPD) is the most popular approach in extreme value statistics. This method typically requires high thresholds in order for the (G)PD to fit well and in such a case applies only to a small upper fraction of the data. The extension of the (G)PD proposed in this paper is able to describe the excess distribution for lower thresholds in case of heavy tailed distributions. This yields a statistical model that can be fitted to a larger portion of the data. Moreover, estimates of tail parameters display stability for a larger range of thresholds. Our findings are supported by asymptotic results, simulations and a case study.Comment: to appear in the Journal of Statistical Planning and Inferenc

    Mandelbrot's Extremism

    Get PDF
    In the sixties Mandelbrot already showed that extreme price swings are more likely than some of us think or incorporate in our models.A modern toolbox for analyzing such rare events can be found in the field of extreme value theory.At the core of extreme value theory lies the modelling of maxima over large blocks of observations and of excesses over high thresholds.The general validity of these models makes them suitable for out-of-sample extrapolation.By way of illustration we assess the likeliness of the crash of the Dow Jones on October 19, 1987, a loss that was more than twice as large as on any other single day from 1954 until 2004.exceedances;extreme value theory;heavy tails;maxima

    Unbiased Tail Estimation by an Extension of the Generalized Pareto Distribution

    Get PDF
    AMS classifications: 62G20; 62G32;bias;exchange rate;heavy tails;peaks-over-threshold;regular variation;tail index

    Semiparametric Lower Bounds for Tail Index Estimation

    Get PDF
    indexation;semiparametric estimation

    Modelling Censored Losses Using Splicing: a Global Fit Strategy With Mixed Erlang and Extreme Value Distributions

    Full text link
    In risk analysis, a global fit that appropriately captures the body and the tail of the distribution of losses is essential. Modelling the whole range of the losses using a standard distribution is usually very hard and often impossible due to the specific characteristics of the body and the tail of the loss distribution. A possible solution is to combine two distributions in a splicing model: a light-tailed distribution for the body which covers light and moderate losses, and a heavy-tailed distribution for the tail to capture large losses. We propose a splicing model with a mixed Erlang (ME) distribution for the body and a Pareto distribution for the tail. This combines the flexibility of the ME distribution with the ability of the Pareto distribution to model extreme values. We extend our splicing approach for censored and/or truncated data. Relevant examples of such data can be found in financial risk analysis. We illustrate the flexibility of this splicing model using practical examples from risk measurement
    corecore