Extreme Value Theory

Interactive exploration of Extreme Value Theory (EVT) for modelling distribution tails and computing VaR and ES at high confidence levels

Extreme Value Theory (EVT) provides a rigorous mathematical framework for modelling the tails of loss distributions. Standard statistical methods that work well for the centre of a distribution may perform poorly in the tails, where data are sparse. EVT offers principled methods for smoothing the empirical tail distribution and extrapolating beyond observed data to higher confidence levels (Hull 2023, chap. 12; Christoffersen 2012, chap. 6; McNeil et al. 2015, chap. 5).

The key result (Gnedenko’s theorem) states that for a wide class of distributions, the excess distribution over a high threshold converges to a Generalized Pareto Distribution (GPD). This is analogous to how the central limit theorem shows that normalized sums converge to the normal distribution, but EVT applies to extremes rather than averages.

1. The Generalized Pareto Distribution

The GPD is the cornerstone of practical EVT applications. For losses exceeding a threshold \(u\), the excess distribution converges to:

\[ G_{\xi,\beta}(y) = \begin{cases} 1 - \left(1 + \xi \frac{y}{\beta}\right)^{-1/\xi}, & \text{if } \xi \neq 0 \\ 1 - \exp\left(-\frac{y}{\beta}\right), & \text{if } \xi = 0 \end{cases} \]

where \(x\) is the loss, \(u\) is the threshold, \(y = x - u \geq 0\) is the excess of the loss over the threshold, \(\xi\) is the shape parameter (determines tail heaviness), and \(\beta > 0\) is the scale parameter.

The shape parameter \(\xi\) determines the type of tail decay:

  • \(\xi > 0\): Heavy tails (Frechet class). Power-law decay. Higher moments may be infinite. This is the typical case for financial data, with \(\xi\) usually between 0.1 and 0.4.
  • \(\xi = 0\): Exponential tails (Gumbel class). The normal distribution belongs here.
  • \(\xi < 0\): Short tails (Weibull class). Finite right endpoint at \(y = -\beta/\xi\). Less relevant for financial losses.

The scale parameter \(\beta\) controls the spread of excesses over the threshold. Larger \(\beta\) means that excesses tend to be larger: the mean of the GPD is \(\beta/(1-\xi)\) (when \(\xi < 1\)), so \(\beta\) directly scales the expected overshoot above the threshold.

Tip

How to experiment

Increase \(\xi\) from 0 to 0.4 and watch the tail thicken in the PDF plot, the survival function decay more slowly (power law vs. exponential), and high-order moments become infinite. Compare positive \(\xi\) values to the \(\xi = 0\) exponential reference (dashed line).

2. Threshold selection and the mean excess plot

The choice of threshold \(u\) involves a bias-variance tradeoff. A threshold that is too low gives many exceedances (low variance) but the GPD approximation may not hold (high bias). A threshold that is too high gives few exceedances (high variance) but better GPD approximation (low bias).

The mean excess plot is a key diagnostic. Given \(n\) loss observations \(v_1, \ldots, v_n\), let \(n_u\) denote the number of observations exceeding the threshold \(u\). The sample mean excess function

\[ e_n(u) = \frac{1}{n_u} \sum_{i: v_i > u} (v_i - u) \]

plotted against \(u\), should become approximately linear above a good threshold. An upward linear trend indicates \(\xi > 0\) (heavy tails).

The Hill estimator provides a simple estimate of \(\xi\) (for \(\xi > 0\)), using the \(n_u\) observations \(v_i\) that exceed the threshold \(u\):

\[ \hat{\xi} = \frac{1}{n_u} \sum_{i=1}^{n_u} \ln\left(\frac{v_i}{u}\right) \]

Note

Simulated data. We simulate \(n\) i.i.d. losses from a Pareto distribution, whose tail is exactly a Generalized Pareto Distribution with known parameters \(\xi\) and \(\beta\). This means the GPD approximation holds exactly above any threshold, and we can compare estimates against the true values without the complication of slow asymptotic convergence that affects distributions like the Student-\(t\).

Tip

How to experiment

Increase \(\xi\) to make the tail heavier. In the mean excess plot, look for where the plot becomes approximately linear, and move the threshold slider there. In the Hill plot, look for a stable plateau around the true \(\xi\). Compare the Hill and GPD MLE estimates in the parameter table. Click “New sample” repeatedly to see sampling variability.

3. EVT VaR and ES across confidence levels

Given \(n\) total observations, of which \(n_u\) exceed the threshold \(u\), the EVT formulas for VaR and ES at confidence level \(q\) are:

\[ \text{VaR}_q = u + \frac{\beta}{\xi}\left\{\left[\frac{n}{n_u}(1-q)\right]^{-\xi} - 1\right\} \]

\[ \text{ES}_q = \frac{\text{VaR}_q + \beta - \xi u}{1-\xi} \]

A key result is the asymptotic ES/VaR ratio:

\[ \lim_{q \to 1} \frac{\text{ES}_q}{\text{VaR}_q} = \frac{1}{1-\xi} \]

For fat-tailed distributions (\(\xi > 0\)), this ratio exceeds the normal ratio of approximately 1.15 (at 99%), making ES particularly valuable as a risk measure.

The illustrations below use a threshold \(u = 200\) ($000s), scale parameter \(\beta = 80\) ($000s), and \(n_u = 5\%\) of \(n\) exceedances, following the example in Hull (2023, chap. 12). The key control is the shape parameter \(\xi\), which drives the divergence between EVT and Normal risk measures.

Tip

How to experiment

Increase \(\xi\) and watch the EVT VaR and ES diverge from the Normal at high confidence levels. In the ES/VaR ratio tab, observe convergence to \(1/(1-\xi)\). The “Extrapolation power” tab shows why EVT matters: at 99.9% confidence with 500 observations, standard historical simulation is impossible, but EVT gives a principled estimate.

4. Normal vs. EVT tail comparison

Even when two distributions have the same VaR, their tail behaviour beyond VaR can be dramatically different. This section calibrates a Normal distribution to have the same 1% VaR as the GPD, then compares the tail shapes (Christoffersen 2012, sec. 6.8).

Tip

How to experiment

Increase \(\xi\) and watch the EVT tail extend far beyond the Normal tail. Even though both distributions have the same 1% VaR by construction, the EVT ES is much larger. The probability table shows orders-of-magnitude differences in the probability of extreme losses.

5. Conditional EVT (GARCH + GPD)

For time series with volatility clustering, EVT is applied to the standardized residuals from a GARCH model rather than to raw returns. This is called conditional EVT (Christoffersen 2012, sec. 6.8; Hull 2023, chap. 12):

  1. Fit a GARCH model to the return series \(R_t\) to obtain the conditional volatility \(\hat{\sigma}_t\)
  2. Extract standardized residuals: \(\hat{z}_t = R_t / \hat{\sigma}_t\), which should be approximately i.i.d.
  3. Apply EVT to the tail of the \(\hat{z}_t\) distribution (fit GPD)
  4. Compute conditional risk measures, scaling by the volatility forecast \(\hat{\sigma}_{t+1}\):

\[ \text{VaR}_q^t = \hat{\sigma}_{t+1} \cdot \text{VaR}_q(z), \quad \text{ES}_q^t = \hat{\sigma}_{t+1} \cdot \text{ES}_q(z) \]

This combines GARCH’s ability to capture volatility dynamics with EVT’s rigorous tail modelling.

Note

Simulated data. A GARCH(1,1) process is simulated with standardized Student-\(t\) shocks. The GARCH parameters and shock distribution are known (since we generated the data). In practice, both the GARCH model and the EVT parameters would need to be estimated. The FHS method simulates 5,000 next-day returns by drawing standardized residuals with replacement and scaling by \(\hat{\sigma}_{t+1}\); VaR and ES are then computed from this simulated distribution.

Tip

How to experiment

Lower the degrees of freedom \(d\) to make the shocks heavier-tailed, and observe that GARCH+EVT captures this while GARCH-Normal underestimates tail risk. Increase the volatility forecast \(\hat{\sigma}_{t+1}\) to see all conditional measures scale proportionally. Compare how the four methods behave at different confidence levels in the “Confidence sweep” tab.

References

Christoffersen, Peter F. 2012. Elements of Financial Risk Management. 2nd ed. Academic Press.
Hull, John. 2023. Risk Management and Financial Institutions. 6th ed. John Wiley & Sons.
McNeil, Alexander J., Rüdiger Frey, and Paul Embrechts. 2015. Quantitative Risk Management: Concepts, Techniques and Tools. Revised. Princeton University Press.