Exploiting the information content of hydrological "outliers" for goodness-of-fit testing

10Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.

Abstract

Validation of probabilistic models based on goodness-of-fit tests is an essential step for the frequency analysis of extreme events. The outcome of standard testing techniques, however, is mainly determined by the behavior of the hypothetical model, FΧ(x) in the central part of the distribution, while the behavior in the tails of the distribution, which is indeed very relevant in hydrological applications, is relatively unimportant for the results of the tests. The maximum-value test, originally proposed as a technique for outlier detection, is a suitable, but seldom applied, technique that addresses this problem. The test is specifically targeted to verify if the maximum (or minimum) values in the sample are consistent with the hypothesis that the distribution FΧ(x) is the real parent distribution. The application of this test is hindered by the fact that the critical values for the test should be numerically obtained when the parameters of F Χ(x) are estimated on the same sample used for verification, which is the standard situation in hydrological applications. We propose here a simple, analytically explicit, technique to suitably account for this effect, based on the application of censored L-moments estimators of the parameters. We demonstrate, with an application that uses artificially generated samples, the superiority of this modified maximum-value test with respect to the standard version of the test. We also show that the test has comparable or larger power with respect to other goodness-of-fit tests (e.g., chi-squared test, Anderson-Darling test, Fung and Paul test), in particular when dealing with small samples (sample size lower than 20-25) and when the parent distribution is similar to the distribution being tested. © 2010 Author(s).

Figures

  • Table 1. Probability models considered in this paper. 0[·] is the gamma function.
  • Table 2. Distribution dependent functions to be used for parameter estimation, as defined in Eqs. (6 and 10). 0[·] is the gamma function, 0[·;·] is the incomplete gamma function, and Norminv is the inverse of the standardized Gaussian cumulative distribution function.
  • Fig. 1. Power of the tests considered in this paper (MMV = modified maximum value; SMV = standard maximum value; AD = AndersonDarling; CHI = chi-squared; FP = Fung-Paul) as a function of the shape parameter θ3 of the parent GEV distribution. The hypothetical distribution is Gumbel, the sample size is n=20 in panel (A) and n=50 in panel (B).
  • Fig. 2. Comparison of a GEV(dashed line) and a Gumbel (solid line) distribution function sharing the first two L-moments. The design event xT is plotted as a function of return period T (in logarithmic scale). The parameter values for the GEV are θ1=0, θ2=1, θ3=0.15 in panel (A) and θ1=0, θ2=1, θ3=0.45 in panel (B). The parameters of the Gumbel distribution are found by matching the first two distribution L-moments to those of the GEV distribution.
  • Fig. 3. Power of the modified maximum-value (MMV) and Anderson-Darling (AD) tests for a TCEV parent distribution. In panel (A) θ4=10, in panel (B) θ4=5, and in panel (C) θ4=2. The sample size is n=20.

Cited by Powered by Scopus

145Citations
187Readers

This article is free to access.

Get full text
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Laio, F., Allamano, P., & Claps, P. (2010). Exploiting the information content of hydrological “outliers” for goodness-of-fit testing. Hydrology and Earth System Sciences, 14(10), 1909–1917. https://doi.org/10.5194/hess-14-1909-2010

Readers over time

‘10‘11‘12‘14‘15‘16‘17‘18‘19‘20‘21‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 11

52%

Researcher 7

33%

Professor / Associate Prof. 3

14%

Readers' Discipline

Tooltip

Earth and Planetary Sciences 7

35%

Engineering 6

30%

Environmental Science 6

30%

Agricultural and Biological Sciences 1

5%

Save time finding and organizing research with Mendeley

Sign up for free
0