57 research outputs found

    On the informative value of the largest sample element of log-Gumbel distribution

    Get PDF
    Extremes of stream flow and precipitation are commonly modeled by heavytailed distributions. While scrutinizing annual flow maxima or the peaks over threshold, the largest sample elements are quite often suspected to be low quality data, outliers or values corresponding to much longer return periods than the observation period. Since the interest is primarily in the estimation of the right tail (in the case of floods or heavy rainfalls), sensitivity of upper quantiles to largest elements of a series constitutes a problem of special concern. This study investigated the sensitivity problem using the log-Gumbel distribution by generating samples of different sizes (n) and different values of the coefficient of variation by Monte Carlo experiments. Parameters of the log-Gumbel distribution were estimated by the probability weighted moments (PWMs) method, method of moments (MOMs) and maximum likelihood method (MLM), both for complete samples and the samples deprived of their largest elements. In the latter case, the distribution censored by the non-exceedance probability threshold, FT , was considered. Using FT instead of the censored threshold T creates possibility of controlling estimator property. The effect of the FT value on the performance of the quantile estimates was then examined. It is shown that right censoring of data need not reduce an accuracy of large quantile estimates if the method of PWMs or MOMs is employed. Moreover allowing bias of estimates one can get the gain in variance and in mean square error of large quantiles even if ML method is used

    Flood frequency analysis supported by the largest historical flood

    No full text
    The use of non-systematic flood data for statistical purposes depends on the reliability of the assessment of both flood magnitudes and their return period. The earliest known extreme flood year is usually the beginning of the historical record. Even if one properly assesses the magnitudes of historic floods, the problem of their return periods remains unsolved. The matter at hand is that only the largest flood (XM) is known during whole historical period and its occurrence marks the beginning of the historical period and defines its length (<i>L</i>). It is common practice to use the earliest known flood year as the beginning of the record. It means that the <i>L</i> value selected is an empirical estimate of the lower bound on the effective historical length <i>M</i>. The estimation of the return period of XM based on its occurrence (<i>L</i>), i.e. <span style="position:relative; margin-left:-0.45m; top:-0.3em;">^</span><span style="position:relative; margin-left:-1.0em; top:0.3em;"><i>M</i></span> = <i>L</i>, gives a severe upward bias. The problem arises that to estimate the time period (<i>M</i>) representative of the largest observed flood XM. <br><br> From the discrete uniform distribution with support 1, 2, ... , <i>M</i> of the probability of the <i>L</i> position of XM, one gets <span style="position: relative; margin-left: -0.45m; top: -0.3em;">^</span><span style="position:relative; margin-left:-1.0em; top:0.3em;"><i>L</i></span> = <i>M</i>/2. Therefore <span style="position: relative; margin-left: -0.45m; top: -0.3em;">^</span><span style="position:relative; margin-left:-1.0em; top:0.3em;"><i>M</i></span> = 2<i>L</i> has been taken as the return period of XM and as the effective historical record length as well this time. As in the systematic period (<i>N</i>) all its elements are smaller than XM, one can get <span style="position: relative; margin-left: -0.45m; top: -0.3em;">^</span><i><span style="position:relative; margin-left:-1.0em; top:0.3em;">M</i></span> = 2<i>t</i>( <i>L</i>+<i>N</i>). <br><br> The efficiency of using the largest historical flood (XM) for large quantile estimation (i.e. one with return period <i>T</i> = 100 years) has been assessed using the maximum likelihood (ML) method with various length of systematic record (<i>N</i>) and various estimates of the historical period length <span style="position: relative; margin-left: -0.45m; top: -0.3em;">^</span><span style="position:relative; margin-left:-1.0em; top:0.3em;"><i>M</i> </span> comparing accuracy with the case when systematic records alone (<i>N</i>) are used only. The simulation procedure used for the purpose incorporates <i>N</i> systematic record and the largest historic flood (XM<sub><i>i</i></sub>) in the period <i>M</i>, which appeared in the <i>L</i><sub><i>i</i></sub> year of the historical period. The simulation results for selected two-parameter distributions, values of their parameters, different <i>N</i> and <i>M</i> values are presented in terms of bias and root mean square error RMSEs of the quantile of interest are more widely discussed

    On accuracy of upper quantiles estimation

    No full text
    Flood frequency analysis (FFA) entails the estimation of the upper tail of a probability density function (PDF) of annual peak flows obtained from either the annual maximum series or partial duration series. In hydrological practice, the properties of various methods of upper quantiles estimation are identified with the case of known population distribution function. In reality, the assumed hypothetical model differs from the true one and one cannot assess the magnitude of error caused by model misspecification in respect to any estimated statistics. The opinion about the accuracy of the methods of upper quantiles estimation formed from the case of known population distribution function is upheld. The above-mentioned issue is the subject of the paper. The accuracy of large quantile assessments obtained from the four estimation methods is compared to two-parameter log-normal and log-Gumbel distributions and their three-parameter counterparts, i.e., three-parameter log-normal and GEV distributions. The cases of true and false hypothetical models are considered. The accuracy of flood quantile estimates depends on the sample size, the distribution type (both true and hypothetical), and strongly depends on the estimation method. In particular, the maximum likelihood method loses its advantageous properties in case of model misspecification

    Physics Of Flood Frequency Analysis. Part I. Linear Convective Diffusion Wave Model

    No full text
    It is hypothesized that the impulse response of a linearized convective diffusion wave (CD) model is a probability distribution suitable for flood frequency analysis. This flood frequency model has two parameters, which are derived using the methods of moments and maximum likelihood. Also derived are errors in quantiles for these methods of parameter estimation. The distribution shows an equivalency of the two estimation methods with respect to the mean value - an important property in the case of unknown true distribution function. As the coefficient of variation tends to zero (with the mean fixed), the distribution tends to a normal one, similar to the lognormal and gamma distributions

    Model Error In Flood Frequency Estimation

    No full text
    Asymptotic bias in large quantiles and moments for three parameter estimation methods, including the maximum likelihood method (MLM), moments method (MOM) and linear moments method (LMM), is derived when a probability distribution function (PDF) is falsely assumed. It is illustrated using an alternative set of PDFs consisting of five two-parameter PDFs that are lower-bounded at zero, i.e., Log-Gumbel (LG), Log-logistic (LL), Log-normal (LN), Linear Diffusion (LD) and Gamma (Ga) distribution functions. The stress is put on applicability of LG and LL in the real conditions, where the hypothetical distribution (H) differs from the true one (T). Therefore, the following cases are considered: H=LG; T=LL, LN, LD and Ga, and H=LL, LN, LD and Ga, T=LG. It is shown that for every pair (H;T) and for every method, the relative bias (RB) of moments and quantiles corresponding to the upper tail is an increasing function of the true value of the coefficient of variation (cᵧ), except that RB of moments for MOM is zero. The value of RB is smallest for MOM and the largest for MLM. The bias of LMM occupies an intermediate position. Since MLM used as the approximation method is irreversible, the asymptotic bias of the MLM-estimate of any statistical characteristic is not asymmetric as is for the MOM and LMM. MLM turns out to be the worst method if the assumed LG or LL distribution is not the true one. It produces a huge bias of upper quantiles, which is at least one order higher than that of the other two methods. However, the reverse case, i.e., acceptance of LN, LD or Ga as a hypothetical distribution while LG or LL as the true one, gives the MLM-bias of reasonable magnitude in upper quantiles. Therefore, one should be highly reluctant in choosing the LG and LL in flood frequency analysis, especially if MLM is to be applied
    • …
    corecore