10,080 research outputs found

    Sparse Structures for Multivariate Extremes

    Full text link
    Extreme value statistics provides accurate estimates for the small occurrence probabilities of rare events. While theory and statistical tools for univariate extremes are well-developed, methods for high-dimensional and complex data sets are still scarce. Appropriate notions of sparsity and connections to other fields such as machine learning, graphical models and high-dimensional statistics have only recently been established. This article reviews the new domain of research concerned with the detection and modeling of sparse patterns in rare events. We first describe the different forms of extremal dependence that can arise between the largest observations of a multivariate random vector. We then discuss the current research topics including clustering, principal component analysis and graphical modeling for extremes. Identification of groups of variables which can be concomitantly extreme is also addressed. The methods are illustrated with an application to flood risk assessment

    Using a priori knowledge to construct copulas

    Full text link
    Our purpose is to model the dependence between two random variables, taking into account a priori knowledge on these variables. For example, in many applications (oceanography, finance...), there exists an order relation between the two variables; when one takes high values, the other cannot take low values, but the contrary is possible. The dependence for the high values of the two variables is, therefore, not symmetric. However a minimal dependence also exists: low values of one variable are associated with low values of the other variable. The dependence can also be extreme for the maxima or the minima of the two variables. In this paper, we construct step by step asymmetric copulas with asymptotic minimal dependence, and with or without asymptotic maximal dependence, using mixture variables to get at first asymmetric dependence and then minimal dependence. We fit these models to a real dataset of sea states and compare them using Likelihood Ratio Tests when they are nested, and BIC- criterion (Bayesian Information criterion) otherwise

    Exploratory data analysis for moderate extreme values using non-parametric kernel methods

    Full text link
    In many settings it is critical to accurately model the extreme tail behaviour of a random process. Non-parametric density estimation methods are commonly implemented as exploratory data analysis techniques for this purpose as they possess excellent visualisation properties, and can naturally avoid the model specification biases implied by using parametric estimators. In particular, kernel-based estimators place minimal assumptions on the data, and provide improved visualisation over scatterplots and histograms. However kernel density estimators are known to perform poorly when estimating extreme tail behaviour, which is important when interest is in process behaviour above some large threshold, and they can over-emphasise bumps in the density for heavy tailed data. In this article we develop a transformation kernel density estimator, and demonstrate that its mean integrated squared error (MISE) efficiency is equivalent to that of standard, non-tail focused kernel density estimators. Estimator performance is illustrated in numerical studies, and in an expanded analysis of the ability of well known global climate models to reproduce observed temperature extremes in Sydney, Australia

    High-dimensional peaks-over-threshold inference

    Full text link
    Max-stable processes are increasingly widely used for modelling complex extreme events, but existing fitting methods are computationally demanding, limiting applications to a few dozen variables. rr-Pareto processes are mathematically simpler and have the potential advantage of incorporating all relevant extreme events, by generalizing the notion of a univariate exceedance. In this paper we investigate score matching for performing high-dimensional peaks over threshold inference, focusing on extreme value processes associated to log-Gaussian random functions and discuss the behaviour of the proposed estimators for regularly-varying distributions with normalized marginals. Their performance is assessed on grids with several hundred locations, simulating from both the true model and from its domain of attraction. We illustrate the potential and flexibility of our methods by modelling extreme rainfall on a grid with 36003600 locations, based on risks for exceedances over local quantiles and for large spatially accumulated rainfall, and briefly discuss diagnostics of model fit. The differences between the two fitted models highlight the importance of the choice of risk and its impact on the dependence structure

    Modeling asymptotically independent spatial extremes based on Laplace random fields

    Full text link
    We tackle the modeling of threshold exceedances in asymptotically independent stochastic processes by constructions based on Laplace random fields. These are defined as Gaussian random fields scaled with a stochastic variable following an exponential distribution. This framework yields useful asymptotic properties while remaining statistically convenient. Univariate distribution tails are of the half exponential type and are part of the limiting generalized Pareto distributions for threshold exceedances. After normalizing marginal tail distributions in data, a standard Laplace field can be used to capture spatial dependence among extremes. Asymptotic properties of Laplace fields are explored and compared to the classical framework of asymptotic dependence. Multivariate joint tail decay rates for Laplace fields are slower than for Gaussian fields with the same covariance structure; hence they provide more conservative estimates of very extreme joint risks while maintaining asymptotic independence. Statistical inference is illustrated on extreme wind gusts in the Netherlands where a comparison to the Gaussian dependence model shows a better goodness-of-fit in terms of Akaike's criterion. In this specific application we fit the well-adapted Weibull distribution as univariate tail model, such that the normalization of univariate tail distributions can be done through a simple power transformation of data

    Some variations of EM algorithms for Marshall-Olkin bivariate Pareto distribution with location and scale

    Full text link
    Recently Asimit et. al used an EM algorithm to estimate Marshall-Olkin bivariate Pareto distribution. The distribution has seven parameters. We describe few alternative approaches of EM algorithm. A numerical simulation is performed to verify the performance of different proposed algorithms. A real-life data analysis is also shown for illustrative purposes

    Factor Copula Models for Replicated Spatial Data

    Full text link
    We propose a new copula model that can be used with replicated spatial data. Unlike the multivariate normal copula, the proposed copula is based on the assumption that a common factor exists and affects the joint dependence of all measurements of the process. Moreover, the proposed copula can model tail dependence and tail asymmetry. The model is parameterized in terms of a covariance function that may be chosen from the many models proposed in the literature, such as the Matern model. For some choice of common factors, the joint copula density is given in closed form and therefore likelihood estimation is very fast. In the general case, one-dimensional numerical integration is needed to calculate the likelihood, but estimation is still reasonably fast even with large data sets. We use simulation studies to show the wide range of dependence structures that can be generated by the proposed model with different choices of common factors. We apply the proposed model to spatial temperature data and compare its performance with some popular geostatistics models.Comment: 34 pages, 3 tables and 5 figure

    Robust Estimation of Bivariate Tail Dependence Coefficient

    Full text link
    The problem of estimating the coefficient of bivariate tail dependence is considered here from the robustness point of view; it combines two apparently contradictory theories of robust statistics and extreme value statistics. The usual maximum likelihood based or the moment type estimators of tail dependence coefficient are highly sensitive to the presence of outlying observations in data. This paper proposes some alternative robust estimators obtained by minimizing the density power divergence with suitable model assumptions; their robustness properties are examined through the classical influence function analysis. The performance of the proposed estimators is illustrated through an extensive empirical study considering several important bivariate extreme value distributions.Comment: Pre-Print, 20 page

    A direct verification argument for the Hamilton-Jacobi equation continuum limit of nondominated sorting

    Full text link
    Nondominated sorting is a combinatorial algorithm that sorts points in Euclidean space into layers according to a partial order. It was recently shown that nondominated sorting of random points has a Hamilton-Jacobi equation continuum limit. The original proof relies on a continuum variational problem. In this paper, we give a new proof using a direct verification argument that completely avoids the variational interpretation. We believe this proof is new in the homogenization literature, and may be generalized to apply to other stochastic homogenization problems for which there is no obvious underlying variational principle

    Simple models for multivariate regular variations and the H\"usler-Reiss Pareto distribution

    Full text link
    We revisit multivariate extreme value theory modeling by emphasizing multivariate regular variations and the multivariate Breiman Lemma. This allows us to recover in a simple framework the most popular multivariate extreme value distributions, such as the logistic, negative logistic, Dirichlet, extremal-tt and H\"usler-Reiss models. In a second part of the paper, we focus on the H\"usler-Reiss Pareto model and its surprising exponential family property. After a thorough study of this exponential family structure, we focus on maximum likelihood estimation. We also consider the generalized H\"usler-Reiss Pareto model with different tail indices and a likelihood ratio test for discriminating constant tail index versus varying tail indices
    • …
    corecore