Correcting for model changes in statistical postprocessing – an approach based on response theory

Demaeyer, Jonathan; Vannitsem, Stéphane

doi:https://doi.org/10.5194/npg-27-307-2020

Articles | Volume 27, issue 2

https://doi.org/10.5194/npg-27-307-2020

Special issue:

Advances in post-processing and blending of deterministic...

https://doi.org/10.5194/npg-27-307-2020

Articles | Volume 27, issue 2

Research article

27 May 2020

Research article |

| 27 May 2020

Correcting for model changes in statistical postprocessing – an approach based on response theory

Jonathan Demaeyer and Stéphane Vannitsem

Abstract

For most statistical postprocessing schemes used to correct weather forecasts, changes to the forecast model induce a considerable reforecasting effort. We present a new approach based on response theory to cope with slight model changes. In this framework, the model change is seen as a perturbation of the original forecast model. The response theory allows us then to evaluate the variation induced on the parameters involved in the statistical postprocessing, provided that the magnitude of this perturbation is not too large. This approach is studied in the context of a simple Ornstein–Uhlenbeck model and then on a more realistic, yet simple, quasi-geostrophic model. The analytical results for the former case help to pose the problem, while the application to the latter provides a proof of concept and assesses the potential performance of response theory in a chaotic system. In both cases, the parameters of the statistical postprocessing used – the Error-in-Variables Model Output Statistics (EVMOS) method – are appropriately corrected when facing a model change. The potential application in an operational environment is also discussed.

Download & links

Article (PDF, 5440 KB)

Supplement (2804 KB)

Download & links

How to cite.

Received: 14 Nov 2019 – Discussion started: 21 Nov 2019 – Revised: 17 Apr 2020 – Accepted: 22 Apr 2020 – Published: 27 May 2020

1 Introduction

A generic property of the atmospheric dynamics is its sensitivity to initial conditions. This implies that probabilistic forecasts is necessary to adequately describe this behaviour (Kalnay, 2003; Wilks, 2011). Indeed, these methods represent a way to go beyond the natural predictability barrier that the chaotic atmospheric models exhibit (Vannitsem, 2017). These forecasts are at the same time subject to the impact of the presence of structural uncertainties, also known as “model errors”. Such errors degrade the forecasts as well, and their impact needs to be mitigated.

Statistical postprocessing methods are used to correct the operational predictions of the atmospheric models. An important family of statistical techniques used to postprocess the forecasts are linear-regression techniques, possibly with multiple predictors (Glahn and Lowry, 1972; Vannitsem and Nicolis, 2008), also known as model output statistics (MOS). This rather simple but very efficient technique can be adapted to ensemble forecasts (e.g. Vannitsem, 2009; Johnson and Bowler, 2009; Glahn et al., 2009; Van Schaeybroeck and Vannitsem, 2015). One of the first approaches that was proposed is called the Error-in-Variable MOS (EVMOS) method because it takes into account the presence of errors in both the observations and model observables (Vannitsem, 2009).

Despite their simplicity, most postprocessing schemes depend on the availability of a database of past forecasts, which allows one to train the regression algorithm by comparison with the observations database. Operational models are however subject to frequent evolution cycles, which are needed to improve their representation of the atmospheric processes. Therefore, there is a continuous need to recompute forecasts starting from past initial conditions with the latest model version to avoid a degradation of the postprocessing schemes due to model change. Such a recomputation of the past forecasts are called “reforecasts” and typically requires a huge data storage and management framework, as well as many computational resources (Hamill, 2018). For instance, the European Centre for Medium-range Weather Forecast (ECMWF) and the National Weather Service (NWS) both produce hundreds of reforecasts every week (Hamill et al., 2013).

Recent research has investigated non-homogeneous regression with a time-adaptative training scheme, for which a trade-off between large training data sets for stable estimates and the benefit of a shorter training period for faster adaptation to data changes is considered (Lang et al., 2020). These results can help mitigate the impact of model change on postprocessing and may call into question the need for reforecast systems. These systems do however help to better represent rare events; they increase the size of the training data sets and greatly improve sub-seasonal forecasts (Scheuerer and Hamill, 2015; Hamill, 2018), which can justify their very high cost.

The present work investigates another research direction and considers a new technique to reduce the cost of adapting a postprocessing scheme to a model change. This method relies on the response theory for dynamical systems (Ruelle, 2009) and assumes that the model change can be written as analytical perturbations of the model tendencies. In this context, parameter modifications as well as new terms in the tendencies are potential model changes.

In Sect. 2, we start by introducing the Ruelle response theory that is used to adapt past postprocessing parameters to new model versions. A didactical example of such an adaptation is considered with a simple Ornstein–Uhlenbeck model in Sect. 3. It is used to describe the methodology and the concept involved. We show that obtaining a new postprocessing scheme after a model change requires the computation of the response of the average of the involved predictors, seen as observables of the system. In the simple case considered, exact analytical results for the response can be obtained up to any order. The correction of the model observables and the postprocessing parameters due to the model change only requires the response-theory corrections up to the second order.

In Sect. 4, a more complex case is considered with a toy model of atmospheric variability in the form of a two-layer quasi-geostrophic model with an orography. We compute the linear response of the parameters of the postprocessing scheme for two model change experiments involving a modification of the friction and the horizontal temperature gradient of the model. The response-theory approach provides an efficient correction of the postprocessing scheme up to a lead time of 4 d, which matches the lead-time window where the scheme's correction is efficient.

In the last section, we discuss the implications that this new method could have on operational forecast postprocessing systems, as well as new research avenues.

2 Response theory

The systems used to produce the weather forecasts are typically non-linear dynamical systems whose time evolution is governed by multi-dimensional ordinary differential equations:

\begin{matrix} (1) & \dot{y} = F (t, y) . \end{matrix}

The generic chaotic nature of these systems for some parameter values implies that they are sensitive to the initial data used to produce the forecasts. For such chaotic dynamical systems, one can assume that a well-defined time-invariant measure exists with which the averages are performed. However, the existence of such measures has been proved for systems that are uniformly hyperbolic, and they are called Sinai–Ruelle–Bowen (SRB) measures (Young, 2002), but rigorous proofs for other systems are rather difficult to obtain. A way to proceed is then to continue as if physical systems were uniformly hyperbolic. This assumption is called the Gallavotti–Cohen hypothesis (Gallavotti and Cohen, 1995 a, b). With this assumption, response theory has been successfully used in various weather and climate-related problems (Demaeyer and Vannitsem, 2018; Vissio and Lucarini, 2018; Lembo et al., 2019; Bódai et al., 2020). Indeed, the systems used to produce weather forecasts are typically not uniformly hyperbolic, but thanks to the aforementioned hypothesis, one can still use what will follow and compare with the results obtained with experiments.¹ It is the rationale behind the formal presentation of the linear response theory for general systems like Eq. (1) in Ruelle (1998 a). The main concepts that will be used in this article are now introduced.

2.1 Perturbations of dynamical systems

We shall assume for simplicity that the system defined by Eq. (1) is autonomous and given by

\begin{matrix} (2) & \dot{y} = F (y) . \end{matrix}

In the general setting considered, let us assume that any given probability measure converges to a unique invariant measure ρ under the time evolution given by the Liouville equation of Eq. (2). This measure is used to compute the average of an arbitrary observable A (a smooth function of the state y) of the system, which is given by

\begin{matrix} (3) & 〈 A 〉_{y} = \int ρ (d y) A (y), \end{matrix}

and assuming the ergodicity of the system, a time average of the observable A along a trajectory of the system on its attractor can be equivalently performed as

\begin{matrix} (4) & 〈 A 〉_{y} = lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} d τ A (y (τ)), \end{matrix}

where y(τ) is a solution of Eq. (2). If a perturbation Ψ of the dynamical system is introduced in the original system at time τ=0 as

\begin{matrix} (5) & \dot{y} = F (y) + Ψ (y), \end{matrix}

it induces a perturbation of the observable's average, which at the first order is given by²

\begin{matrix} (6) & \begin{aligned} δ 〈 A (τ) 〉_{y} & = \int ρ (d y_{0}) δ A (f^{τ} (y_{0})) \\ = \int ρ (d y_{0}) δ y (τ)^{T} \cdot \nabla_{f^{τ} (y_{0})} A, \end{aligned} \end{matrix}

where f^τ is the flow of the system defined by Eq. (2) mapping an initial condition y₀ to the system's state at time τ as y(τ)=f^τ(y₀), and the capital T represents the transposition. δy is the perturbation of the trajectory of the system induced by the perturbation Ψ. This formula gives the transient response to the perturbation, and the long-time average of the integrand of Eq. (6) gives the stationary response to the perturbation, i.e. the sensitivity δ〈A〉_y of the system observables to the perturbation (Eyink et al., 2004; Wang, 2013). The higher-order corrections δ^k〈A(τ)〉_y can in principle be computed as well but are quite complicated to obtain for chaotic dynamical systems; see for instance Lucarini (2009). We will show an analytically tractable case in Sect. 3.

2.2 The tangent linear model

The linear perturbation δy of the trajectories of Eq. (2) can be computed by introducing y+δy in Eq. (5) to get at the first order

\begin{matrix} (7) & \dot{δ y} = \nabla_{y} F \cdot δ y + Ψ (y), \end{matrix}

where y is the solution of Eq. (2) and ∇_yF is the Jacobian matrix evaluated along this solution. Therefore, both Eqs. (2) and (7) have to be integrated simultaneously. In the weather forecasting context, this latter linearised equation without the perturbation term Ψ is called the tangent linear model of Eq. (2) (Kalnay, 2003). Here, Eq. (7) is initialised with δy(0)=0 and provides the linear response of the trajectory y(τ) to the perturbation Ψ. It is thus assumed that there is no interference due to initial-condition errors in the perturbation problem. Note however that the effects on the trajectories of both the initial-condition perturbation and the Ψ perturbation can be investigated through this equation by setting δy(0)≠0, although we are not aware of any study of the response to both types of perturbations together.

The tangent model provides thus the tool through which we will evaluate the impact of the model change on the average used by statistical postprocessing schemes. In other words, the tangent model will allow us to take into account the information on the model change (viewed as a perturbation of the initial model) to modify the previous postprocessing scheme and adapt it to the new model version. The solution to Eq. (7) with δy(0)=0 is by

\begin{matrix} (8) & δ y (τ) = \int_{0}^{τ} d τ^{'} M (τ - τ^{'}, f^{τ^{'}} (y_{0})) \cdot Ψ (f^{τ^{'}} (y_{0})), \end{matrix}

where M is the fundamental matrix of Eq. (7) (Gaspard, 2005; Nicolis, 2016) defined as

\begin{matrix} (9) & M (τ, y) = \nabla_{y} f^{τ}, \end{matrix}

which is the solution of the homogeneous equation $\dot{M} = \nabla_{y} F \cdot M$ . Using the chain rule, the response defined by Eq. (6) is rewritten in terms of the perturbation alone:

\begin{matrix} (10) & \begin{aligned} δ 〈 A (τ) 〉_{y} & = \int_{0}^{τ} d τ^{'} \int ρ (d y_{0}) Ψ {(f^{τ^{'}} (y_{0}))}^{T} \\ \cdot \nabla_{f^{τ^{'}} (y_{0})} A (f^{τ} (y_{0})), \end{aligned} \end{matrix}

where the causality of the perturbation acting on the system and perturbing the averaged observable appears (Lucarini, 2008), since $τ^{'} < τ$ . We will also use this alternative expression throughout the article. Note that when the initial perturbation δy(0) is not equal to 0, additional terms to Eqs. (8) and (10) will appear. These will not be addressed here, but some references to this can be found in Nicolis et al. (2009) and Nicolis (2016).

2.3 Non-stationary response theory

Equation (6) gives the transient, non-stationary response to the perturbation, evaluated for averages computed with the invariant measure. However, in this work, we need to evaluate the response to perturbations for averages computed with non-stationary measures evolving in time. In that sense, it is a “non-stationary response theory”, which is performed with an arbitrary initial probability density. As such, all the formulas presented are valid if the measure being used is the measure at the time when the perturbation is introduced (τ=0), as shown in Appendix A. In this case, other usual formulas obtained through substitution, for instance to obtain an adjoint representation of Eq. (10), should be used with care, since the measure is no longer invariant and an extra Jacobian term appears in the integrand.

We will thus also assume that the measures ρ_τ being used are absolutely continuous with respect to the Lebesgue measure. In this case, we can write ρ_τ(dy)=ρ_τ(y) dy. We now present the problem of model change in the framework of postprocessing and show on a simple stochastic model³ how response theory allows us to tackle the issue.

3 A simple analytical example

In order to get a first impression of the impact of a model change on a postprocessing scheme, we consider two Ornstein–Uhlenbeck processes representing reality x(τ) and a model y(τ) of reality. These processes obey the following equations:

\begin{matrix} (11) & \dot{x} (τ) = - λ_{x} x (τ) + K_{x} + Q_{x} ξ_{x} (τ), \\ (12) & \dot{y} (τ) = - λ_{y} y (τ) + K_{y} + Q_{y} ξ_{y} (τ), \end{matrix}

where ξ_x and ξ_y are Gaussian white-noise processes such that

\begin{aligned} 〈 ξ_{x} (τ) 〉 = 〈 ξ_{y} (τ) 〉 & = 0 \\ 〈 ξ_{x} (τ) ξ_{x} (τ^{'}) 〉 & = δ (τ - τ^{'}) \\ 〈 ξ_{y} (τ) ξ_{y} (τ^{'}) 〉 & = δ (τ - τ^{'}) \\ 〈 ξ_{x} (τ) ξ_{y} (τ^{'}) 〉 & = 0 . \end{aligned}

These are therefore uncorrelated Ornstein–Uhlenbeck processes with noise amplitudes Q_x and Q_y.

We then consider a change Ψ_y of model y(τ), possibly improving or degrading the forecast performances as

\begin{matrix} (13) & \dot{\hat{y}} (τ) = - λ_{y} \hat{y} (τ) + K_{y} + Q_{y} ξ_{y} (τ) + Ψ_{y} (τ), \end{matrix}

where

\begin{matrix} (14) & Ψ_{y} (τ) = - κ (δ K + δ Q ξ_{y} (τ)) \end{matrix}

with $δ K = K_{y} - K_{x}$ and $δ Q = Q_{y} - Q_{x}$ . It can represent, for example, a better parameterisation of subgrid-scale processes or an increase of the model resolution. Note that the best correction is obtained if κ=1.

We have thus reality x(τ) and two different models of it: y(τ) and $\hat{y} (τ)$ . We now want to evaluate the difference between a postprocessing scheme constructed before the model change (with the past forecasts of model y(τ)) and one constructed after (with the past forecasts of model $\hat{y} (τ)$ ).

3.1 The postprocessing method

We now consider a forecast situation where model y is initialised at time τ=0 with a perfect observation of reality: $y (0) = x (0) = x_{0}$ . We use the Error-in-Variables Model Output Statistics (EVMOS) postprocessing scheme (Vannitsem, 2009) to correct the forecasts of model y based on these initial conditions. In this context, given N past forecasts y_n and observations x_n, the correction of the univariate EVMOS postprocessing of variable x from a new forecast y(τ) is provided by the linear regression

\begin{matrix} (15) & y_{C} (τ) = α (τ) + β (τ) y (τ) . \end{matrix}

The coefficients α and β are obtained by minimising the functional

\begin{matrix} (16) & J (τ) = \sum_{n = 1}^{N} \frac{{[{α (τ) + β (τ) y_{n} (τ)} - x_{n} (τ)]}^{2}}{σ_{x}^{2} (τ) + β^{2} (τ) σ_{y}^{2} (τ)} \end{matrix}

and are thus given by the following equations:

\begin{array}{l} (17) & α (τ) & = 〈 x (τ) 〉 - β (τ) 〈 y (τ) 〉, \\ (18) & β (τ) & = \sqrt{\frac{σ_{x}^{2} (τ)}{σ_{y}^{2} (τ)}}, \end{array}

where

\begin{array}{l} (19) & σ_{x}^{2} (τ) & = 〈(x (τ) - 〈 x (τ) 〉)^{2}〉, \\ (20) & σ_{y}^{2} (τ) & = 〈(y (τ) - 〈 y (τ) 〉)^{2}〉, \end{array}

The averages 〈⋅〉 are taken over an ensemble of past forecasts and observations. This approach has been developed to obtain a correct climatological forecast calibration. It constitutes a simple setting in which the impact of model changes can be evaluated and corrected. More sophisticated approaches can be evaluated in the future (other linear MOS schemes, ensemble MOS, etc.).

Since we are dealing with simple analytical models here, we can compute the theoretical values of the coefficient α and β with an infinite ensemble of past forecasts, and the averaged quantities involved in this computation are then given by the averages of an infinite number of realisations of the Ornstein–Uhlenbeck processes, as if we had an infinite ensemble of past forecasts.

3.2 Averaging the Ornstein–Uhlenbeck processes

For reality x and model y, we directly get the averages (Gardiner, 2009)

\begin{matrix} (21) & 〈 x (τ) 〉 = 〈 x_{0} 〉 e^{- λ_{x} τ} + \frac{K_{x}}{λ_{x}} (1 - e^{- λ_{x} τ}), \\ (22) & σ_{x}^{2} (τ) = σ_{x_{0}}^{2} e^{- 2 λ_{x} τ} + \frac{Q_{x}^{2}}{2 λ_{x}} (1 - e^{- 2 λ_{x} τ}) \end{matrix}

and

\begin{matrix} (23) & 〈 y (τ) 〉 = 〈 x_{0} 〉 e^{- λ_{y} τ} + \frac{K_{y}}{λ_{y}} (1 - e^{- λ_{y} τ}), \\ (24) & σ_{y}^{2} (τ) = σ_{x_{0}}^{2} e^{- 2 λ_{y} τ} + \frac{Q_{y}^{2}}{2 λ_{y}} (1 - e^{- 2 λ_{y} τ}), \end{matrix}

where we note that the model is initialised with the same initial conditions as reality:

\begin{matrix} (25) & 〈 y (0) 〉 = 〈 x (0) 〉 = 〈 x_{0} 〉, σ_{y}^{2} (0) = σ_{x}^{2} (0) = σ_{x_{0}}^{2} . \end{matrix}

We get the postprocessing coefficients before the model change α(τ) and β(τ) by inserting these expressions in Eqs. (17) and (18).

Similarly, we get the same kind of results for model $\hat{y}$ , after model change Ψ_y:

\begin{matrix} (26) & 〈 \hat{y} (τ) 〉 = 〈 x_{0} 〉 e^{- λ_{y} τ} + \frac{K_{y} - κ δ K}{λ_{y}} (1 - e^{- λ_{y} τ}), \\ (27) & σ_{\hat{y}}^{2} (τ) = σ_{x_{0}}^{2} e^{- 2 λ_{y} τ} + \frac{(Q_{y} - κ δ Q)^{2}}{2 λ_{y}} (1 - e^{- 2 λ_{y} τ}) . \end{matrix}

We also obtain the postprocessing coefficients after the model change $\hat{α} (τ)$ and $\hat{β} (τ)$ (see also the analysis in Vannitsem, 2011). We can also compute the variation of the bias α:

\begin{matrix} (28) & \hat{α} (τ) - α (τ) = δ α (τ) = β (τ) 〈 y (τ) 〉 - \hat{β} (τ) 〈 \hat{y} (τ) 〉 . \end{matrix}

The ratio between the parameters β is given by

\begin{matrix} (29) & \frac{\hat{β} (τ)}{β (τ)} = \sqrt{\frac{σ_{y}^{2} (τ)}{σ_{\hat{y}}^{2} (τ)}} . \end{matrix}

For $τ ≫ max (1 / λ_{x}, 1 / λ_{y})$ , we note that this ratio tends to

\begin{matrix} (30) & \frac{\hat{β} (τ)}{β (τ)} \approx \frac{1}{1 - κ δ Q / Q_{y}}, \end{matrix}

and the difference between the biases α of the two models is approximatively given by

\begin{matrix} (31) & δ α (τ) \approx - β (τ) \frac{K_{y}}{λ_{y}} [\frac{1 - κ δ K / K_{y}}{1 - κ δ Q / Q_{y}} - 1] . \end{matrix}

Let us now assume that model change Ψ_y can be considered as a perturbation of the initial model y. Using response theory, the averages $〈 \hat{y} 〉$ and $σ_{\hat{y}}^{2}$ can be estimated using the initial model y instead of the perturbed model $\hat{y}$ . In turn, these new estimated averages give us the new postprocessing scheme coefficients $\hat{α}$ and $\hat{β}$ . We now detail the results obtained by using this method.

3.3 Model change and response theory

After the model change, the forecasts are provided by model $\hat{y}$ , and their time evolution is given by Eq. (13). This model can be seen as a perturbation of model y by the term Ψ_y given by Eq. (14). In such a case, given an observable A, its average after the model change can then be related to its original average by

\begin{matrix} (32) & 〈 A (τ) 〉_{\hat{y}} = 〈 A (τ) 〉_{y} + δ 〈 A (τ) 〉_{y} + δ^{2} 〈 A (τ) 〉_{y} + \dots, \end{matrix}

where the averages on the right-hand side are taken over the forecasts of model y. Response theory allows us to obtain the average over the model $\hat{y}$ forecasts (the left-hand side) based solely on the average over the model y forecasts. The $\hat{y}$ model forecasts are therefore not required to estimate the new postprocessing scheme.

The observables depend on the lead time τ of the forecast, as do the parameters α and β which determine the postprocessing correction for every lead time. This reflects the fact that the postprocessing problem is typically a non-stationary initial-value problem, since the initial conditions of the model Eqs. (12) and (13) are typically not chosen on their respective model attractor but rather as observations⁴ of reality defined by Eq. (11). As a consequence, the model averages of Eq. (32) relax toward the stationary response in the long-time limit, and the stationary response theory (Ruelle, 2009; Wang, 2013) cannot provide us their short-time relaxation behaviour. Instead, the Ruelle time-dependent response theory should be used (Ruelle, 1998 a). It follows that, if the perturbation (14) is small, then the first order is given by (see Sect. 2)

\begin{matrix} (33) & δ 〈 A (τ) 〉_{y} = \int_{0}^{τ} d τ^{'} \int d x_{0} ρ_{0} (x_{0}) 〈Ψ_{y} (τ^{'}) \nabla_{f^{τ^{'}} (x_{0})} A (f^{τ} (x_{0}))〉, \end{matrix}

where ρ₀ is the distribution of the initial conditions (observations) used to initialise the models. ∇_x is the gradient evaluated at the point x, and here it is the simple derivative. As indicated by Eq. (25), in the postprocessing framework, ρ₀ is taken as the stationary distribution of reality. As shown in Appendix A, Eq. (33) can be obtained through a Kubo-type perturbative expansion (Lucarini, 2008). We remark that this example deals with stochastic models, due to which we have to perform an additional averaging over the realisations of the stochastic processes, denoted here as 〈⋅〉 (Lucarini, 2012). Finally the mapping f^τ which appears in Eq. (33) is the stochastic flow

\begin{matrix} (34) & f^{τ} (x_{0}) = x_{0} e^{- λ_{y} τ} + \int_{0}^{τ} d τ^{'} e^{- λ_{y} (τ - τ^{'})} [Q_{y} ξ_{y} (τ^{'}) + K_{y}] . \end{matrix}

This maps an initial condition x₀ of model y to the state f^τ(x₀) of a realisation of this model at the later lead time τ. The principle of causality is thus implicit in Eq. (33), which estimates the impact of the perturbation Ψ_y on the subsequent perturbed-model time evolution by developing around the unperturbed-model y trajectories.

Evaluating Eq. (33) and its stochastic integrals (Gardiner, 2009) gives us the variation of the averages 〈y(τ)〉 and 〈y(τ)²〉 to the perturbation Ψ_y:

\begin{matrix} (35) & δ 〈 y (τ) 〉_{y} = - κ \int_{0}^{τ} d τ^{'} δ K e^{- λ_{y} (τ - τ^{'})} = - \frac{κ}{λ_{y}} δ K (1 - e^{- λ_{y} τ}), \end{matrix}

\begin{matrix} (36) & \begin{aligned} δ 〈 y (τ)^{2} 〉_{y} & = - 2 κ δ K \int_{0}^{τ} d τ^{'} [〈 x_{0} 〉 e^{- λ_{y} (2 τ - τ^{'})} \\ + \frac{K_{y}}{λ_{y}} e^{- λ_{y} (τ - τ^{'})} (1 - e^{- λ_{y} τ})] \\ - 2 κ δ Q Q_{y} \int_{0}^{τ} d τ^{'} e^{- 2 λ_{y} (τ - τ^{'})} \\ = - 2 κ \frac{δ K}{λ_{y}} 〈y (τ)〉 (1 - e^{- λ_{y} τ}) \\ - \frac{κ}{λ_{y}} δ Q Q_{y} (1 - e^{- 2 λ_{y} τ}) . \end{aligned} \end{matrix}

Rearranging these two terms, we also get the following expression for the variation of the variance given by Eq. (24):

\begin{matrix} (37) & \begin{aligned} δ σ_{y}^{2} (τ) & = - \frac{κ}{λ_{y}} δ Q Q_{y} (1 - e^{- 2 λ_{y} τ}) \\ - \frac{κ^{2}}{λ_{y}^{2}} δ K^{2} {(1 - e^{- λ_{y} τ})}^{2} . \end{aligned} \end{matrix}

Note that the variation given by Eq. (35) corresponds exactly to the difference between the average of the two models $〈 \hat{y} (τ) 〉 - 〈 y (τ) 〉$ . On the other hand the variation given by Eq. (37) lacks the term of order κ² involving δQ that appears in the difference between $σ_{\hat{y}}^{2} (τ)$ and $σ_{y}^{2} (τ)$ given respectively by Eqs. (27) and (24). Instead, another term of order κ² and involving δK is present, indicating that higher-order terms of response theory need to be considered to correct it (Ruelle, 1998 b). The second-order term is given by the expression⁵ (Lucarini, 2012):

\begin{matrix} (38) & \begin{aligned} δ^{2} 〈 A (τ) 〉_{y} & = \int_{0}^{τ} d τ^{'} \int_{τ^{'}}^{τ} d τ^{''} \int d y ρ_{0} (x_{0}) 〈Ψ_{y} (τ^{'}) \nabla_{f^{τ^{'}} (x_{0})} \\ Ψ_{y} (τ^{''}) \nabla_{f^{τ^{''}} (x_{0})} A (f^{τ} (x_{0}))〉 . \end{aligned} \end{matrix}

Applying this to the first moment of the y models directly yields

\begin{matrix} (39) & δ^{2} 〈 y (τ) 〉_{y} = 0 . \end{matrix}

On the other hand, integrating the stochastic integrals present in this expression for the moment 〈y(τ)²〉 gives

\begin{matrix} (40) & \begin{aligned} δ^{2} 〈 y (τ)^{2} 〉_{y} & = \frac{κ^{2} δ K^{2}}{λ_{y}^{2}} {(1 - e^{- λ_{y} τ})}^{2} \\ + \frac{κ^{2} δ Q^{2}}{2 λ_{y}} (1 - e^{- 2 λ_{y} τ}), \end{aligned} \end{matrix}

which corrects the κ²δK² term in Eq. (37) and makes the response theory up to order 2 exactly match the difference between $σ_{\hat{y}}^{2} (τ)$ and $σ_{y}^{2} (τ)$ , for every lead time τ. In fact, the subsequent orders of the response vanish due to the linearity of the simple Ornstein–Uhlenbeck models, which enables us to truncate the response Kubo-like expansion to the second order. Finally, this shows that the (non-stationary) response theory can be used to estimate the postprocessing parameters after the model change based on the forecasts of the initial model. Indeed, instead of the averages $〈 \hat{y} (τ) 〉$ and $σ_{\hat{y}}^{2} (τ)$ , the approximate averages 〈y(τ)〉+δ〈y(τ)〉_y and $σ_{y}^{2} (τ) + δ σ_{y}^{2} (τ) + δ^{2} σ_{y}^{2} (τ)$ can be used to compute $\hat{α}$ and $\hat{β}$ . We emphasise that the second-order contribution had to be considered in order to obtain the exact result. Nevertheless, the difference between the first- and the second-order response is of the order κ², which implies that for a small perturbation (model change), the first order will generally be a sufficiently good approximation. A more detailed derivation of the results obtained in this section can be found in the Supplement.

In order to investigate this research avenue on a case closer to those encountered in reality, we will now consider the application of postprocessing and response theory to a low-order atmospheric model displaying chaos.

4 Application to a low-order atmospheric model

A two-layer quasi-geostrophic atmospheric system on a β plane with an orography is considered (Charney and Straus, 1980; Reinhold and Pierrehumbert, 1982). This spectral model possesses well-identified large-scale flow regimes, such as “zonal” and “blocked” regimes. The horizontal nondimensionalised coordinates are denoted as x and y, with the model's domain being defined by $(0 \leq x \leq \frac{2 π}{n}, 0 \leq y \leq π)$ , with $n = 2 L_{y} / L_{x}$ as the aspect ratio between its meridional and zonal extents L_y and L_x. The two main fields of this model are the 500 hPa pressure anomaly and temperature, which are proportional to the barotropic streamfunction ψ(x, y) and the baroclinic streamfunction θ(x, y), respectively. Both fields are defined in a zonally periodic channel with no-flux boundary conditions in the meridional direction ( $\partial \cdot / \partial x \equiv 0$ at $y = 0, π$ ). The fields are expanded in Fourier modes respecting these boundary conditions:

\begin{aligned} F_{1} (x, y) & = \sqrt{2} \cos (y), \\ F_{2} (x, y) & = 2 \cos (n x) \sin (y), \\ F_{3} (x, y) & = 2 \sin (n x) \sin (y), \\ F_{4} (x, y) & = \sqrt{2} \cos (2 y), \\ ⋮ \end{aligned}

such that

\begin{matrix} (41) & \nabla^{2} F_{i} (x, y) = - a_{i}^{2} F_{i} (x, y) \end{matrix}

with eigenvalues $a_{1}^{2} = 1, a_{2}^{2} = a_{3}^{2} = 1 + n^{2}, a_{4}^{2} = 4, \dots$ . We have thus the following decomposition

\begin{matrix} (42) & ψ (x, y) = \sum_{i = 1}^{n_{a}} ψ_{i} F_{i} (x, y), \\ (43) & θ (x, y) = \sum_{i = 1}^{n_{a}} θ_{i} F_{i} (x, y), \end{matrix}

where n_a is the number of modes of the spectral expansion. The partial differential equations controlling the time evolution of the fields ψ(x, y) and θ(x, y) can then be projected on the Fourier modes to finally give a set of ordinary differential equations for the coefficients ψ_i and θ_i

\begin{matrix} (44) & \dot{x} = F (x), x = (ψ_{1}, \dots, ψ_{n_{a}}, θ_{1}, \dots, θ_{n_{a}}) \end{matrix}

that can be solved with usual numerical integrators. All variables are nondimensionalised. The ordinary differential equations of the model are detailed in Appendix B.

In the version proposed by Reinhold and Pierrehumbert using the 10 first modes beyond a certain value of the zonal temperature gradient, the system displays chaos and makes transitions between the blocked and zonal flow regimes embedded in its global attractor. Here, we use their main nondimensionalised parameters values: the friction at the interface between the two layers k_d=0.1, the friction at the bottom surface $k_{d}^{'} = 0.01$ and the aspect ratio of the domain n=1.3. The β plane lies at midlatitudes (50^∘) and the Coriolis parameter f₀ is set accordingly.

In the present work, the parameter h_d, the Newtonian cooling coefficient is fixed to 0.3 instead of the value found in Reinhold and Pierrehumbert (which is h_d=0.045). Two additional fields have to be specified on the domain: $θ^{*} (x, y)$ , the radiative equilibrium temperature field, and h(x, y), the topographic height field. These fields can be decomposed by projecting them onto the eigenfunctions of the Laplacian as before. The corresponding coefficients $θ_{i}^{*}$ and h_i then allow for writing these fields as sums of weighted eigenfunctions:

\begin{matrix} (45) & θ^{*} (x, y) = \sum_{i = 1}^{n_{a}} θ_{i}^{*} F_{i} (x, y), \\ (46) & h (x, y) = \sum_{i = 1}^{n_{a}} h_{i} F_{i} (x, y) . \end{matrix}

In the present case, we consider that the only non-zero coefficients are $θ_{1}^{*} = 0.2$ and h₂=0.4, meaning that the radiative equilibrium profile is given by the zonally varying function $\sqrt{2} \cos (y)$ and the orography is made of a mountain and a valley shaped by the function 2 cos (nx) sin (y). Again, the value of the temperature gradient $θ_{1}^{*}$ is larger than the one chosen in Reinhold and Pierrehumbert (which is $θ_{1}^{*} = 0.1$ ) to increase the chaotic variability in the system. Trajectories of variables θ₁ and ψ₂ are depicted in Fig. 1, for the reference system (reality) and a model version (model 0) for which the friction coefficient has been slightly modified.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f01

Figure 1Dynamics of the reference system and model 0 of the postprocessing experiment with a modification of the friction coefficient (see Table 1) for (a) time evolution of the variable θ₁ and (b) time evolution of the variable ψ₂.

Download

Table 1The main parameters used and modified in the experiments. Model 0 and model 1 are respectively the forecast model of reality before and after the model change.

Download Print Version

These parameter changes induce slight modifications of the dynamics. In particular the system possesses two distinct weather regimes, depicted in Fig. 2a and b: one characterised by a zonal circulation (see Fig. 2c) and another characterised by a blocking situation (see Fig. 2d). In the former case, the variables ψ₂ and ψ₃ characterising the strength of the meridional anomalies are small, while in the latter case they are large, indicating indeed a blocking situation. This is different from the situation considered in Reinhold and Pierrehumbert (1982), where two different blocking regimes coexist with the zonal regime.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f02

Figure 2Attractors for the experiment with a modification of the friction coefficient: (a) two-dimensional isodensity of the attractors estimated with a Gaussian kernel density estimator for the variables ψ₂ and ψ₃ and (b) two-dimensional scatterplot of the attractors for the variables ψ₂ and ψ₃. The attractors of reality and model 0 are qualitatively similar, with two different parts which are indicated by ellipses. The blue and red crosses correspond respectively to equilibrium points of the reference model (reality) and of model 0, respectively. The dashed ellipse corresponds on average to a zonal circulation depicted in panel (c). The dash-dotted ellipse corresponds on average to a blocking situation depicted in panel (d). In both panels (c) and (d), the underlying colour map denotes the orography in the domain, and the contours denote the geopotential height anomaly at 500 hPa.

Download

4.1 Postprocessing experiments

The model described above with 10 modes (n_a=10) is used, and two different postprocessing experiments are performed, one involving the Newtonian cooling parameter h_d and another involving the friction parameter k_d between the two atmospheric layers. The parameter values detailed above correspond to the long-term reference (i.e. reality). A first model is defined (model 0) which is a copy of the two-layer quasi-geostrophic model defining reality, but the parameters h_d or k_d are slightly changed; i.e. the model error of the forecasting system lies in either the Newtonian cooling or the friction parameter. Then, as in Sect. 3, a model change is imposed, leading to another forecasting model (model 1) that can either improve or degrade the model error by a factor κ. The parameter variations involved in these experiments are detailed in Table 1. Without a loss of generality, we consider model changes that improve the representation of reality in the sense that the amplitude of the model error in model 1 is smaller than in model 0. The effect of the model change is depicted in Figs. 3 and 4 for the friction parameter experiment. These figures display the mean and the standard deviation of the model forecasts and observations coming from the reference forecasts, as a function of the lead time τ. We have used a set of 1 million trajectories of each system to compute these averages.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f03

Figure 3Behaviour of the averages as a function of the lead time τ in reality and the forecast models before (a) and after (b) the model change, for the experiment with a modification of the friction coefficient (see Table 1). The variable considered is the temperature meridional gradient θ₁. The solid lines denote the mean, while the shaded areas denote the interval of 1 standard deviation.

Download

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f04

Figure 4Same as Fig. 3 but for the variable ψ₃ of the streamfunction ψ.

Download

In the framework of the EVMOS postprocessing scheme, the predictors and the predictands are the same nominal variable, and no other predictors are used. In both experiments considered, the postprocessing parameters α and β of the EVMOS for model 0, as well as $\hat{α}$ and $\hat{β}$ for model 1, are computed. The main objective here is then to estimate the difference between the former and the latter using Ruelle response theory. The approach in a multivariate setting is presented below.

4.2 Model change, response theory and the tangent linear model

Let us consider again the response theory described in Sect. 3.3 but in the general multivariate deterministic case described in Sect. 2. In the postprocessing framework, models 0 and 1 evolve in time from a set of initial conditions taken outside of their respective attractors. Response formulas found in Ruelle's work have to be adapted to take this into account. One therefore has to consider the density of initial conditions as the measure. For a system with a time-independent perturbation $Ψ (\hat{y})$ ,

\begin{matrix} (47) & \dot{\hat{y}} = F (\hat{y}) + Ψ (\hat{y}) = \hat{F} (\hat{y}), \end{matrix}

an observable A with average 〈A(τ)〉_y at the lead time τ for the system

\begin{matrix} (48) & \dot{y} = F (y) \end{matrix}

has a first-order response of

\begin{matrix} (49) & δ 〈 A (τ) 〉_{y} = \int d y_{0} ρ_{0} (y_{0}) δ y (τ)^{T} \cdot \nabla_{f^{τ} (y_{0})} A, \end{matrix}

where f^τ is the flow of the unperturbed system given by Eq. (48), ρ₀ is the distribution of initial conditions and δy(τ) is the solution of the equation $\dot{y} + \dot{δ y} = \hat{F} (y + δ y)$ , which can be approximated at the first order by the following linear inhomogeneous differential equation

\begin{matrix} (50) & \dot{δ y} = \nabla_{y} F \cdot δ y + Ψ (y), \end{matrix}

where y(τ) is the solution of the unperturbed Eq. (48) with the initial condition y(0)=y₀, and we see that the systems of Eqs. (48) and (50) have to be integrated simultaneously (Gaspard, 2005). The homogeneous part of Eq. (50) is the well-known tangent linear model of the system, and here it has to be solved with an additional boundary term which is the perturbation itself.

Equation (49) is derived in Appendix A and can be computed in the same way as the averages depicted in Figs. 3 and 4, by averaging over multiple initial conditions of the reference system. Since we initialise the unperturbed (model 0) and perturbed systems (model 1) with the same initial conditions, the initial state of the tangent model defined by Eq. (50) is δy(0)=0. Therefore we do not estimate the impact of the observation or assimilation errors but rather the direct impact of the model errors viewed as time-independent perturbations. The formulation of the problem and Eq. (50) can be adapted to take these errors into account, as described for instance by Nicolis (2016).

In what follows, we will numerically integrate Eq. (50) to evaluate the response on the average due to the perturbation induced by the model change. This will in turn, as in Sect. 3, enable us to compute the postprocessing parameters for the new model.

4.3 Main results

For each of the two experiments detailed in Table 1, we start by obtaining 1 million observations of reality that will be used to initialise the forecast models. For each observation, this is done by starting model x (the reference) with a random initial condition and running it for a very long time (100 000 nondimensionalised time units) to achieve convergence to its global attractor. Once the observations have been obtained, we run the reference model, model 0 and model 1 over 200 time units (corresponding to roughly 22 d) to obtain reality and the forecasts. The systems have been integrated using the fourth-order Runge–Kutta integration scheme with a time step of 0.1 time units corresponding to 16.15 min. The averaging over the 1 million trajectories of reality and of the forecasts at each lead time is used to compute the postprocessing coefficients α and β of the EVMOS by using Eqs. (17) and (18). For each predictand, the corresponding model variable is used as the unique predictor.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f05

Figure 5Corrections of the moments of θ₁ from model 0 to model 1 using the response-theory formula of Eq. (49), for the experiment with a modification of the friction coefficient.

Download

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f06

Figure 6Corrections of the moments of θ₁ from model 0 to model 1 using the response-theory formula of Eq. (49), for the experiment with a modification of the Newtonian cooling coefficient.

Download

The response-theory approximations of the averages of model $\hat{y}$ (model 1) averages are obtained by integrating the linearised equations of model 0 along its trajectories with the perturbation Ψ as an inhomogeneous term. This is done by integrating Eq. (50) over a lead time of 200 time units with a zero initial condition, using the same integration scheme as before. It gives us the integrand of Eq. (49) for each trajectory, and the integral is then approximated as the average of this integrand over the whole set of trajectories. The result of this integration and averaging is shown in Figs. 5 and 6 for the first and second moment of the variable θ₁. The results for other variables are available in the Supplement. The black curve shows the moments of model 0 with the addition of their linear response δ〈θ₁〉 and $δ 〈 θ_{1}^{2} 〉$ to the perturbation Ψ. This curve agrees well with the green curves of the model 1 moments up to a lead time of 4–5 d, showing the efficiency of response theory. Note that in contrast to the calculation of the averages shown in Figs. 3 and 4 and computed with 1 million trajectories, we have here considered a limited subset of 10 000 trajectories of model 0 and its tangent to compute the corrections to these averages. The correction of the moments of model 1 are accurate until 4 d for both experiments. After this critical lead time, obtaining a good accuracy requires a huge increase in the number of forecasts and tangent model integrations to perform the averaging. This problem is well-known (Nicolis, 2003; Eyink et al., 2004) and is due to the appearance of fat tails in the distribution of the perturbations δy in the integrand of Eq. (49). As it can be seen in Fig. 7 for the perturbations on θ₁, the problem worsens with the increase of the lead time: initially the distributions are near-Gaussian, and fat tails appear progressively. Therefore, the number of samples of δy needed to converge to the correct mean up to a certain precision increases exponentially as the lead time increases. This problem has consequences on the method used to perform the average. Indeed, to avoid rare and unrealistic extreme responses of the system located far in the tails of the distributions, outliers above a certain threshold (set to three nondimensional units) have been removed from the averaging.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f07

Figure 7Histograms of the solutions δθ₁(τ) of Eq. (50) for the perturbation δy(τ) (with θ₁ being the 11th component of y) along the trajectories of model 0, for different values of the lead time τ. The solid orange curves are fits of a Gaussian distribution function to the different histograms. The fat-tail phenomenon described in Eyink et al. (2004) is apparent and becomes more prominent as the lead time increases.

Download

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f08

Figure 8Coefficients α and β of the postprocessing schemes of variable θ₁ and their correction using the response theory, for the experiment with a modification of the Newtonian cooling coefficient.

Download

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f09

Figure 9Performance of the corrections on the variable θ₁ for the experiment with the modification of the friction coefficient. (a) Mean square error (MSE) evolution between the different forecasts and their correction and reality. (b) Mean of the different trajectories (reality, model 0 and model 1) and corrected forecasts. (c) Variance of the different trajectories and corrected forecasts.

Download

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f10

Figure 10Performance of the corrections on the variable θ₁ for the experiment with the modification of the Newtonian cooling coefficient. (a) Mean square error (MSE) evolution between the different forecasts and their correction and reality. (b) Mean of the different trajectories (reality, model 0 and model 1) and corrected forecasts. (c) Variance of the different trajectories and corrected forecasts.

Download

The moments obtained by the response-theory approach are used to compute new EVMOS postprocessing α and β coefficients, thanks to Eqs. (17) and (18). These corrected coefficients for variable θ₁ are shown in Fig. 8 for the experiment with a modification of the Newtonian cooling coefficient and in panels (c) and (d) of Fig. 11 for the experiment with a modification of the friction coefficient. In Figs. 9 and 10, we compare the performances of the four postprocessing schemes hence obtained: the postprocessing of model 0 (red curves) and model 1 (green curves) obtained by averaging over their trajectories (forecasts) and the postprocessing of model 1 obtained with the past model 0 forecasts (green + crosses) and with the response-theory approach (black × crosses). In panel (a) of Figs. 9 and 10, the mean square error (MSE) between the trajectories of the models and the reference trajectories is displayed by solid curves, while the MSE between both models correction and the reference is depicted by dash-dotted curves. The EVMOS postprocessing is able to partly correct the forecasts, reducing the MSE until a lead time of the order of a few times the model's Lyapunov time (the inverse of the leading Lyapunov exponent). After that, the MSE curves of the postprocessed and uncorrected forecasts converge toward a plateau corresponding to twice the variance of the reference solution (Vannitsem, 2009). Here, the statistical postprocessing corrections are indeed efficient until lead times of 4–5 d, with a skill of the corrections decreasing with the lead time. Thus the EVMOS schemes do not become better than the original models after roughly 4 d. Note also that even if the model change is small, the postprocessing using the past forecasts of model 0 (green + crosses) completely fails to correct model 1 forecasts, highlighting the need for an adaptation of the postprocessing to the model change. In contrast, the adaptation with the response-theory method (black × crosses) produces valid corrections up to 4 d later. In panels (b) and (c) of Figs. 9 and 10, the mean and variance of the corrected forecasts are compared with those of the original models. Again, the corrections obtained with response theory are efficient until 4 d for the postprocessing schemes.

In conclusion, the correction of model 1 using the response-theory EVMOS matches almost perfectly the score of the “exact” EVMOS obtained with the forecasts of model 1 (dash-dotted green curve), up to a 4 d lead time. After that lead time, the errors due to the fat tails in the response of the first moments of the statistics induce errors in the variance needed to compute the α and β coefficients (see Eqs. 17 and 18). These coefficients therefore degrade sharply after 4 d, as shown by the solid black curve in Fig. 8 and in Fig. 11c and d. This in turn induces a degradation of the response-theory postprocessing scheme. Nevertheless, this limitation of response theory is not a concern here, since after a lead time of 4 d, the EVMOS skill improvement vanishes anyway.

https://www.nonlin-processes-geophys.net/27/307/2020/npg-27-307-2020-f11

Figure 11Comparison of the efficiency of the response-theory correction for different numbers m of trajectories used to average Eq. (49), for the experiment of varying the friction coefficient: (a) mean square error with reality, (b) absolute difference between the response-theory correction and the correction based on the forecast of model 1, and (c, d) postprocessing coefficients α and β. In panels (b), (c) and (d), the higher (100 000) and the lower (20) numbers are depicted respectively by a solid black line and a dashed red line. The other cases in between are depicted by dotted lines.

Download

5 Discussion and conclusions

Statistical postprocessing techniques used to correct numerical weather predictions (NWP) require substantial past forecast and observation databases. In the case of a model change, which frequently occurs during the normal life cycle of an operational forecast model, one has to reforecast the entire database of past forecasts (Hagedorn et al., 2008; Hamill et al., 2008) to update the postprocessing coefficients and parameters. In the present work, we proposed a new methodology based on response theory to produce these new coefficients without having to reforecast. Instead, the database of past forecasts is reused to perform integrations in the tangent space of the model. It allows us to obtain the new postprocessing coefficients as modifications of the old ones. These new coefficients were shown to be accurate enough within the lead-time range for which the postprocessing corrections improve the forecast.

Figure 11 summarises the main results of this work, with the quasi-geostrophic system described in Sect. 4, using a different number m of trajectories of model 0 and its tangent model to compute the response-theory corrections. It shows that up to a lead time of 2 d, good postprocessing scheme coefficients are obtained even with a mere 20 integrations in the tangent space.

Note however that in the context of this conceptual model, good estimates of the postprocessing coefficients α and β can be obtained by simply using a small set of reforecasts. It is indeed enough to directly integrate the updated model 1, given by the non-linear Eq. (47), with only 20 trajectories. So the response-theory approach in the present case cannot really compete with the simple reforecasting method. How this can be improved in an operational context is an important question that should be addressed in the future. For instance, we can use a simplified tangent linear model to reduce the computational burden, as is often used in data assimilation (Bonavita et al., 2017). This approach could also be implemented for short-range forecasts, say from 1 to 3 d.

The response theory is efficient because the model changes are assumed to be small in comparison with the original parameterisation of the models. The method cannot improve a postprocessing scheme, but it can efficiently adapt it to a new model version. As such, the success of this method also depends on the quality of the past postprocessing scheme. There are situations where linear response theory is known to fail, but statistical tests which allow for the identification of its breakdown have been derived in Gottwald et al. (2016) and in Wormell and Gottwald (2018). In addition, the approach presented here applies only for models for which a tangent model is available. The model change itself has to be provided as an analytic function, which can in some circumstances limit the applicability of the approach.

To test this approach, we have focused on the EVMOS statistical postprocessing method, but other methods could be considered as well. The only requirement is that the outcome of the minimisation of the cost function uses averages of the systems being considered. For instance, member-by-member methods that correct both the mean square errors and the spread of the ensemble while preserving the spatial correlation (Van Schaeybroeck and Vannitsem, 2015) could be considered. These methods generally use the covariance between the model forecasts and the observations as an important piece. Response theory can also be applied here, since this covariance can be written as an average. This will be investigated in a future work, together with the applicability of the approach to parameters of probability distributions, as is often used in meteorology (Vannitsem et al., 2018).

The impact of initial-condition errors has not been addressed here, since the purpose was to demonstrate the applicability of the approach in a perfectly controlled environment. The main limiting issue of response theory in the present context is the presence of fat tails in the distribution of the perturbations δy in the tangent model. This implies that beyond a certain lead time, typically 2–3 d for the synoptic scale, the number of trajectories of the tangent model needed for the averages to converge increases exponentially. This renders the approach impractical at lead times beyond 2–3 d. This is a well-known problem, which is typically due to the trajectories passing close to the stable manifolds structuring the dynamics of chaotic systems (Eyink et al., 2004), generating an extreme response of the system to the perturbations Ψ. This is possibly due to the exacerbated sensitivity of these manifolds to the perturbation of the system. We see two possibilities to overcome this issue in the case where a long lead-time correction is needed.

First, as suggested by Eyink et al. (2004), the problem should be studied in other systems. It might be resolved by itself in other systems. Indeed, in very large atmospheric systems, the encounter of such manifolds might become rare. This could be related to the chaotic hypothesis (Gallavotti and Cohen, 1995 a, b) which states that large systems can be considered to behave like Axiom-A hyperbolic systems for the physical quantities of interest, and thus Ruelle response theory (Ruelle, 2009) might get better as the dimensionality of a system increases. This hypothesis would be interesting to test in current state-of-the-art NWP systems.
Secondly, another avenue would be to adapt the techniques based on the covariant Lyapunov vectors (CLVs) or on unstable periodic orbits (UPOs) to non-stationary dynamics. These techniques were recently introduced (Wang, 2013; Ni and Wang, 2017; Ni, 2019; Lasagna, 2019; Lasagna et al., 2019) to deal with stationary responses of chaotic systems, i.e. the response of a system that lies on its attractor.

The CLVs methods mentioned focus on finding an adjoint representation (Eyink et al., 2004) of the response, while in the present work the approach is based on forward integrations (direct method). The adjoint representation allows for easily changing the perturbation function Ψ for a fixed observable A, while the direct method enables the consideration of different observables while keeping the perturbation function fixed. The adjoint representation, however, requires one to integrate the tangent model backward in time. Therefore, its accuracy depends on the absolute value of the smallest Lyapunov exponent of the system, which might render its results less well than the direct forward representation.

In conclusion, the response-theory approach developed here is an effective method to deal with the problem of the impact of model change on the postprocessing scheme. Its main advantage is to be computed on the past model version and does not require reforecasts of the full model. Its operational implementation, however, is still an open question that should be addressed in the future.

Code availability

The quasi-geostrophic model used is called QGS and was obtained by adapting the Python code of the MAOOAM ocean–atmosphere model (De Cruz et al., 2016), following the model description in Cehelsky and Tung (1987). It was recently released on Zenodo (Demaeyer and De Cruz, 2020) and is also available at https://github.com/Climdyn/qgs (last access: 18 May 2020). The additional notebooks computing the response to model changes and generating the figures are also provided in the Supplement. They have been released on Zenodo as well (Demaeyer, 2020) and are available at https://github.com/jodemaey/Postprocessing_and_response_theory_notebooks (last access: 18 May 2020).

Appendix A: Non-stationary response theory

We consider a perturbed autonomous dynamical system

\begin{matrix} (A1) & \dot{\hat{y}} = F (\hat{y}) + Ψ (\hat{y}) = \hat{F} (\hat{y}) \end{matrix}

with a prescribed distribution of initial conditions ρ₀. For the unperturbed system

\begin{matrix} (A2) & \dot{y} = F (y), \end{matrix}

an observable A has the average at time τ

\begin{matrix} (A3) & \begin{aligned} 〈 A (τ) 〉_{y} & = \int d y_{0} ρ_{0} (y_{0}) A (f^{τ} (y_{0})) \\ = \int d y ρ_{τ} (y) A (y), \end{aligned} \end{matrix}

where f^τ is the flow of the unperturbed system given by Eq. (A2) and where ρ_τ is the distribution obtained by propagating the initial distribution ρ₀ with the Liouville equation (Gaspard, 2005). In this section, the variation of this average due to the presence of the perturbation is evaluated as

\begin{matrix} (A4) & 〈 A (τ) 〉_{\hat{y}} = 〈 A (τ) 〉_{y} + δ 〈 A (τ) 〉_{y} + δ^{2} 〈 A (τ) 〉_{y} + \dots . \end{matrix}

In other words, we compute the average of A in the system defined by Eq. (A1)

\begin{matrix} (A5) & 〈 A (τ) 〉_{\hat{y}} = \int d y_{0} ρ_{0} (y_{0}) A ({\hat{f}}^{τ} (y_{0})) \end{matrix}

as a perturbation of the average given by Eq. (A3) for the unperturbed system defined by Eq. (A2). Here, ${\hat{f}}^{τ}$ is the flow of the perturbed system defined by Eq. (A1). In the following, we will derive these corrections thanks to a Kubo-type perturbative expansion (Lucarini, 2008) that amounts to constructing a Dyson series in the interaction picture framework, where the perturbation is seen as an interaction Hamiltonian (Wouters and Lucarini, 2012). We start by considering the time evolution of the observable A in Eq. (A1):

\begin{matrix} (A6) & \frac{d}{d τ} A ({\hat{f}}^{τ} (y_{0})) = (L_{0} + L_{1}) A ({\hat{f}}^{τ} (y_{0})) \end{matrix}

with the operators

\begin{matrix} (A7) & \{\begin{array}{lcl} L_{0} A (y) & = & F (y)^{T} \cdot \nabla_{y} A \\ L_{1} A (y) & = & Ψ (y)^{T} \cdot \nabla_{y} A \end{array} \end{matrix}

and define an interaction observable as

\begin{matrix} (A8) & A_{I} (τ, y_{0}) = Π_{0} (- τ) A ({\hat{f}}^{τ} (y_{0})) \end{matrix}

with Π₀(τ)=exp (ℒ₀ τ). It is easy to show that the interaction observable satisfies the differential equation:

\begin{matrix} (A9) & \frac{d}{d τ} A_{I} (τ, y_{0}) = L_{I} (τ) A_{I} (τ, y_{0}) \end{matrix}

with the interaction operator $L_{I} (τ) = Π_{0} (- τ) L_{1} Π_{0} (τ)$ . The solution to this equation is

\begin{matrix} (A10) & \begin{aligned} A_{I} (τ, y_{0}) & = A_{I} (0, y_{0}) + \int_{0}^{τ} d s_{1} L_{I} (s_{1}) A_{I} (s_{1}, y_{0}) \\ = A (y_{0}) + \int_{0}^{τ} d s_{1} L_{I} (s_{1}) A_{I} (s_{1}, y_{0}), \end{aligned} \end{matrix}

which can be rewritten as

\begin{matrix} (A11) & \begin{aligned} A ({\hat{f}}^{τ} (y_{0})) & = Π_{0} (τ) A (y_{0}) \\ + \int_{0}^{τ} d s_{1} Π_{0} (τ - s_{1}) L_{1} Π_{0} (s_{1}) A_{I} (s_{1}, y_{0}) . \end{aligned} \end{matrix}

Iteratively replacing the interaction observable by Eq. (A10) finally leads to the Dyson series:

\begin{matrix} (A12) & \begin{aligned} A ({\hat{f}}^{τ} (y_{0})) & = Π_{0} (τ) A (y_{0}) \\ + \int_{0}^{τ} d s_{1} Π_{0} (τ - s_{1}) L_{1} Π_{0} (s_{1}) A (y_{0}) \\ + \int_{0}^{τ} d s_{1} \int_{0}^{s_{1}} d s_{2} Π_{0} (τ - s_{1}) L_{1} Π_{0} (s_{1} - s_{2}) \\ L_{1} Π_{0} (s_{2}) A (y_{0}) + \dots . \end{aligned} \end{matrix}

Using the definitions in Eqs. (A3) and (A5), as well as the fact that

\begin{matrix} (A13) & g (f^{τ} (y_{0})) = Π_{0} (τ) g (y_{0}) \end{matrix}

for any smooth function g, we get finally a formula for the perturbations in Eq. (A4):

\begin{matrix} (A14) & \begin{aligned} 〈 A (τ) 〉_{\hat{y}} & = 〈 A (τ) 〉_{y} + \int_{0}^{τ} d s_{1} \int d y_{0} ρ_{0} (y_{0}) Π_{0} (τ - s_{1}) \\ L_{1} Π_{0} (s_{1}) A (y_{0}) + \dots . \end{aligned} \end{matrix}

We will now focus on the first term of this expansion, but the subsequent orders of the response can be treated in the same way. We thus have

\begin{matrix} (A15) & \begin{aligned} δ 〈 A (τ) 〉_{y} & = \int_{0}^{τ} d s_{1} \int d y_{0} ρ_{0} (y_{0}) Ψ {(f^{τ - s_{1}} (y_{0}))}^{T} \\ \cdot \nabla_{f^{τ - s_{1}} (y_{0})} A (f^{τ} (y_{0})), \end{aligned} \end{matrix}

which with the change of variable $s_{1} \to t - τ^{'}$ can be rewritten as

\begin{matrix} (A16) & \begin{aligned} δ 〈 A (τ) 〉_{y} & = \int_{0}^{τ} d τ^{'} \int d y_{0} ρ_{0} (y_{0}) Ψ {(f^{τ^{'}} (y_{0}))}^{T} \\ \cdot \nabla_{f^{τ^{'}} (y_{0})} A (f^{τ} (y_{0})) \end{aligned} \end{matrix}

and then

\begin{matrix} (A17) & \begin{aligned} δ 〈 A (τ) 〉_{y} & = \int_{0}^{τ} d τ^{'} \int d y_{0} ρ_{0} (y_{0}) Ψ {(f^{τ^{'}} (y_{0}))}^{T} \\ \cdot {(\frac{\partial f^{τ} (y_{0})}{\partial f^{τ^{'}} (y_{0})})}^{T} \cdot \nabla_{f^{τ} (y_{0})} A \end{aligned} \\ (A18) & \begin{aligned} = \int_{0}^{τ} d τ^{'} \int d y_{0} ρ_{0} (y_{0}) Ψ {(f^{τ^{'}} (y_{0}))}^{T} \\ \cdot M {(τ - τ^{'}, f^{τ^{'}} (y_{0}))}^{T} \cdot \nabla_{f^{τ} (y_{0})} A . \end{aligned} \end{matrix}

M is the fundamental matrix (Gaspard, 2005; Nicolis, 2016) of the homogeneous part of the linear differential equation

\begin{matrix} (A19) & \dot{δ y} = \nabla_{y} F \cdot δ y + Ψ (y), \end{matrix}

where y is solution of Eq. (A2) with initial condition y₀, and we have the definition

\begin{matrix} (A20) & M (t, y) = \frac{\partial f^{t} (y)}{\partial y} . \end{matrix}

Equation (A19) is the linearised approximation of Eq. (A1):

\begin{matrix} (A21) & \dot{y} + \dot{δ y} = F (y + δ y) + Ψ (y + δ y), \end{matrix}

which provides a tool to estimate Eq. (A18). Indeed, since the solution of Eq. (A19) can be written as

\begin{matrix} (A22) & δ y (τ) = \int_{0}^{τ} d τ^{'} M (τ - τ^{'}, f^{τ^{'}} (y_{0})) \cdot Ψ (f^{τ^{'}} (y_{0})), \end{matrix}

we can write the first-order variation of the average of the observable A in terms of these solutions:

\begin{matrix} (A23) & δ 〈 A (τ) 〉_{y} = \int d y_{0} ρ_{0} (y_{0}) δ y (τ)^{T} \cdot \nabla_{f^{τ} (y_{0})} A . \end{matrix}

The interpretation of this equation is that a specific averaging of an observable over the trajectories of the linear approximation given by Eq. (A19) of the perturbed Eq. (A2) provides the first-order response of the observable. It is the main result used to compute the new postprocessing scheme in the present work. It is explained in detail in Sects. 3.3 and 4.2.

Appendix B: The quasi-geostrophic model equations

The ordinary differential equations of the model are given by

\begin{matrix} (B1) & \begin{aligned} {\dot{ψ}}_{i} & = - a_{i, i}^{- 1} \sum_{j, m = 1}^{n_{a}} b_{i, j, m} (ψ_{j} ψ_{m} + θ_{j} θ_{m}) \\ - \frac{a_{i, i}^{- 1}}{2} \sum_{j, m = 1}^{n_{a}} g_{i, j, m} h_{m} (ψ_{j} - θ_{j}) \\ - β a_{i, i}^{- 1} \sum_{j = 1}^{n_{a}} c_{i, j} ψ_{j} - \frac{k_{d}}{2} (ψ_{i} - θ_{i}), \end{aligned} \\ (B2) & \begin{aligned} {\dot{θ}}_{i} & = - a_{i, i}^{- 1} \sum_{j, m = 1}^{n_{a}} b_{i, j, m} (ψ_{j} θ_{m} + θ_{j} ψ_{m}) \\ + \frac{a_{i, i}^{- 1}}{2} \sum_{j, m = 1}^{n_{a}} g_{i, j, m} h_{m} (ψ_{j} - θ_{j}) \\ - β a_{i, i}^{- 1} \sum_{j = 1}^{n_{a}} c_{i, j} θ_{j} + \frac{k_{d}}{2} (ψ_{i} - θ_{i}) \\ - 2 k_{d}^{'} θ_{i} + a_{i, i}^{- 1} ω_{i}, \end{aligned} \\ (B3) & {\dot{θ}}_{i} = - \sum_{j, m = 1}^{n_{a}} g_{i, j, m} ψ_{j} θ_{m} + \frac{σ}{2} ω_{i} + h_{d} (θ_{i}^{*} - θ_{i}), \end{matrix}

where nondimensional parameters values and description can be found in Table 1 and Sect. 4. β is the meridional gradient of Coriolis parameter which has the nondimensional value of 0.21 at 50^∘ of latitude (Reinhold and Pierrehumbert, 1982; Cehelsky and Tung, 1987). The vertical velocity ω_i can be eliminated, leading to Eqs. (B2) and (B3) being reduced to a single equation for θ_i. The parameter σ is the nondimensional static stability of the atmosphere set typically to 0.2. The coefficients a_i, j, $g_{i, j, m}$ , $b_{i, j, m}$ and c_i, j are the inner products of the Fourier modes F_i defined in Sect. 4:

\begin{matrix} (B4) & \begin{aligned} a_{i, j} = \\ \frac{n}{2 π^{2}} \int_{0}^{π} \int_{0}^{2 π / n} F_{i} (x, y) \nabla^{2} F_{j} (x, y) d x d y = - δ_{i j} a_{i}^{2}, \end{aligned} \\ (B5) & \begin{aligned} g_{i, j, m} = \\ \frac{n}{2 π^{2}} \int_{0}^{π} \int_{0}^{2 π / n} F_{i} (x, y) J (F_{j} (x, y), F_{m} (x, y)) d x d y, \end{aligned} \\ (B6) & \begin{aligned} b_{i, j, m} = \\ \frac{n}{2 π^{2}} \int_{0}^{π} \int_{0}^{2 π / n} F_{i} (x, y) J (F_{j} (x, y), \nabla^{2} F_{m} (x, y)) d x d y, \end{aligned} \end{matrix}

\begin{matrix} (B7) & c_{i, j} = \frac{n}{2 π^{2}} \int_{0}^{π} \int_{0}^{2 π / n} F_{i} (x, y) \frac{\partial}{\partial x} F_{j} (x, y) d x d y, \end{matrix}

where the coefficients a_i are given by Eq. (41) and where J is the Jacobian present in the advection terms defined as

\begin{matrix} (B8) & J (S, G) = \frac{\partial S}{\partial x} \frac{\partial G}{\partial y} - \frac{\partial S}{\partial y} \frac{\partial G}{\partial x} . \end{matrix}

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/npg-27-307-2020-supplement.

Author contributions

JD and SV developed the idea of using the response theory in the context of postprocessing, together with the overall experimental setup. JD made the analytical and numerical computations. Both authors contributed to the writing of the paper.

Competing interests

Stéphane Vannitsem is a member of the editorial board of the journal. Jonathan Demaeyer declares that he has no conflict of interest.

Special issue statement

This article is part of the special issue “Advances in post-processing and blending of deterministic and ensemble forecasts”. It is not associated with a conference.

Acknowledgements

The authors warmly thank Lesley De Cruz for her suggestions throughout the paper. They also thank Michaël Zamo and the anonymous reviewer for their comments and suggested improvements.

Financial support

This research has been supported by EUMETNET (Postprocessing module of the NWP Cooperation Programme).

Review statement

This paper was edited by Maxime Taillardat and reviewed by Michaël Zamo and one anonymous referee.

References

Bódai, T., Lucarini, V., and Lunkeit, F.: Can we use linear response theory to assess geoengineering strategies?, Chaos: An Interdisciplinary Journal of Nonlinear Science, 30, 023124, https://doi.org/10.1063/1.5122255, 2020. a

Bonavita, M., Trémolet, Y., Holm, E., Lang, S. T. K., Chrust, M., Janisková, M., Lopez, P., Laloyaux, P., De Rosnay, P., Fisher, M., Hamrud, M., and English, S.: A strategy for data assimilation, European Centre for Medium Range Weather Forecasts, available at: https://www.ecmwf.int/sites/default/files/elibrary/2017/17179-strategy-data-assimilation.pdf (last access: 20 May 2020), 2017. a

Cehelsky, P. and Tung, K. K.: Theories of multiple equilibria and weather regimes – A critical reexamination. Part II: Baroclinic two-layer models, J. Atmos. Sci., 44, 3282–3303, 1987. a, b

Charney, J. G. and Straus, D. M.: Form-drag instability, multiple equilibria and propagating planetary waves in baroclinic, orographically forced, planetary wave systems, J. Atmos. Sci., 37, 1157–1176, 1980. a

De Cruz, L., Demaeyer, J., and Vannitsem, S.: The Modular Arbitrary-Order Ocean-Atmosphere Model: MAOOAM v1.0, Geosci. Model Dev., 9, 2793–2808, https://doi.org/10.5194/gmd-9-2793-2016, 2016. a

Demaeyer, J.: Postprocessing and response theory notebooks: version 0.1.1 release, Zenodo, https://doi.org/10.5281/zenodo.3755313, 2020. a

Demaeyer, J. and De Cruz, L.: qgs: version 0.1.0 release, Zenodo, https://doi.org/10.5281/zenodo.3716322, 2020. a

Demaeyer, J. and Vannitsem, S.: Comparison of stochastic parameterizations in the framework of a coupled ocean–atmosphere model, Nonlin. Processes Geophys., 25, 605–631, https://doi.org/10.5194/npg-25-605-2018, 2018. a

Eyink, G., Haine, T., and Lea, D.: Ruelle's linear response formula, ensemble adjoint schemes and Lévy flights, Nonlinearity, 17, 1867, https://doi.org/10.1088/0951-7715/17/5/016, 2004. a, b, c, d, e, f

Gallavotti, G. and Cohen, E.: Dynamical ensembles in nonequilibrium statistical mechanics, Phys. Rev. Lett., 74, 2694, https://doi.org/10.1103/PhysRevLett.74.2694, 1995a. a, b

Gallavotti, G. and Cohen, E. G. D.: Dynamical ensembles in stationary states, J. Stat. Phys., 80, 931–970, 1995b. a, b

Gardiner, C. W.: Handbook of stochastic methods, fourth edn., Springer, Berlin, 2009. a, b

Gaspard, P.: Chaos, scattering and statistical mechanics, vol. 9, Cambridge University Press, Cambridge, UK, 2005. a, b, c, d

Glahn, B., Peroutka, M., Wiedenfeld, J., Wagner, J., Zylstra, G., Schuknecht, B., and Jackson, B.: MOS uncertainty estimates in an ensemble framework, Mon. Weather Rev., 137, 246–268, 2009. a

Glahn, H. R. and Lowry, D. A.: The use of model output statistics (MOS) in objective weather forecasting, J. Appl. Meteorol., 11, 1203–1211, 1972. a

Gottwald, G. A., Wormell, J., and Wouters, J.: On spurious detection of linear response and misuse of the fluctuation–dissipation theorem in finite time series, Physica D, 331, 89–101, 2016. a, b

Hagedorn, R., Hamill, T. M., and Whitaker, J. S.: Probabilistic forecast calibration using ECMWF and GFS ensemble reforecasts. Part I: Two-meter temperatures, Mon. Weather Rev., 136, 2608–2619, 2008. a

Hamill, T. M.: Practical aspects of statistical postprocessing, in: Statistical Postprocessing of Ensemble Forecasts, edited by: Vannitsem, S., Wilks, D. S., and Messner, J. W., 187–217, Elsevier, Amsterdam, the Netherlands, 2018. a, b

Hamill, T. M., Hagedorn, R., and Whitaker, J. S.: Probabilistic forecast calibration using ECMWF and GFS ensemble reforecasts. Part II: Precipitation, Mon. Weather Rev., 136, 2620–2632, 2008. a

Hamill, T. M., Bates, G. T., Whitaker, J. S., Murray, D. R., Fiorino, M., Galarneau Jr, T. J., Zhu, Y., and Lapenta, W.: NOAA's second-generation global medium-range ensemble reforecast dataset, B. Am. Meteorol. Soc., 94, 1553–1565, 2013. a

Johnson, C. and Bowler, N.: On the reliability and calibration of ensemble forecasts, Mon. Weather Rev., 137, 1717–1720, 2009. a

Kalnay, E.: Atmospheric modeling, data assimilation and predictability, Cambridge University Press, Cambridge, UK, 2003. a, b

Lang, M. N., Lerch, S., Mayr, G. J., Simon, T., Stauffer, R., and Zeileis, A.: Remember the past: a comparison of time-adaptive training schemes for non-homogeneous regression, Nonlin. Processes Geophys., 27, 23–34, https://doi.org/10.5194/npg-27-23-2020, 2020. a

Lasagna, D.: Sensitivity and stability of long periodic orbits of chaotic systems, ArXiv [Preprint], arXiv:1910.06706, 2019. a

Lasagna, D., Sharma, A., and Meyers, J.: Periodic shadowing sensitivity analysis of chaotic systems, J. Comput. Phys., 391, 119–141, 2019. a

Lembo, V., Lucarini, V., and Ragone, F.: Beyond Forcing Scenarios: Predicting Climate Change through Response Operators in a Coupled General Circulation Model, ArXiv [Preprint], arXiv:1912.03996, 2019. a

Lucarini, V.: Response theory for equilibrium and non-equilibrium statistical mechanics: Causality and generalized Kramers-Kronig relations, J. Stat. Phys., 131, 543–558, 2008. a, b, c

Lucarini, V.: Evidence of dispersion relations for the nonlinear response of the Lorenz 63 system, J. Stat. Phys., 134, 381–400, 2009. a, b

Lucarini, V.: Stochastic perturbations to dynamical systems: a response theory approach, J. Stat. Phys., 146, 774–786, 2012. a, b, c

Ni, A.: Hyperbolicity, shadowing directions and sensitivity analysis of a turbulent three-dimensional flow, J. Fluid Mech., 863, 644–669, 2019. a

Ni, A. and Wang, Q.: Sensitivity analysis on chaotic dynamical systems by Non-Intrusive Least Squares Shadowing (NILSS), J. Comput. Phys., 347, 56–77, 2017. a

Nicolis, C.: Dynamics of model error: Some generic features, J. Atmos. Sci., 60, 2208–2218, 2003. a

Nicolis, C.: Error dynamics in extended-range forecasts, Q. J. Roy. Meteor. Soc., 142, 1222–1231, 2016. a, b, c, d

Nicolis, C., Perdigao, R. A., and Vannitsem, S.: Dynamics of prediction errors under the combined effect of initial condition and model errors, J. Atmos. Sci., 66, 766–778, 2009. a

Reinhold, B. B. and Pierrehumbert, R. T.: Dynamics of weather regimes: Quasi-stationary waves and blocking, Mon. Weather Rev., 110, 1105–1145, 1982. a, b, c, d, e, f

Ruelle, D.: General linear response formula in statistical mechanics, and the fluctuation-dissipation theorem far from equilibrium, Pys. Lett. A, 245, 220–224, 1998a. a, b

Ruelle, D.: Nonequilibrium statistical mechanics near equilibrium: computing higher-order terms, Nonlinearity, 11, 5, https://doi.org/10.1088/0951-7715/11/1/002, 1998b. a

Ruelle, D.: A review of linear response theory for general differentiable dynamical systems, Nonlinearity, 22, 855, https://doi.org/10.1088/0951-7715/22/4/009, 2009. a, b, c

Scheuerer, M. and Hamill, T. M.: Statistical postprocessing of ensemble precipitation forecasts by fitting censored, shifted gamma distributions, Mon. Weather Rev., 143, 4578–4596, 2015. a

Van Schaeybroeck, B. and Vannitsem, S.: Ensemble post-processing using member-by-member approaches: theoretical aspects, Q. J. Roy. Meteor. Soc., 141, 807–818, 2015. a, b

Vannitsem, S.: A unified linear model output statistics scheme for both deterministic and ensemble forecasts, Q. J. Roy. Meteor. Soc., 135, 1801–1815, 2009. a, b, c, d

Vannitsem, S.: Bias correction and post-processing under climate change, Nonlin. Processes Geophys., 18, 911–924, https://doi.org/10.5194/npg-18-911-2011, 2011. a

Vannitsem, S.: Predictability of large-scale atmospheric motions: Lyapunov exponents and error dynamics, Chaos: An Interdisciplinary Journal of Nonlinear Science, 27, 032101, https://doi.org/10.1063/1.4979042, 2017. a

Vannitsem, S. and Nicolis, C.: Dynamical properties of model output statistics forecasts, Mon. Weather Rev., 136, 405–419, 2008. a

Vannitsem, S., Wilks, D. S., and Messner, J.: Statistical postprocessing of ensemble forecasts, Elsevier, Amsterdam, the Netherlands, 2018. a

Vissio, G. and Lucarini, V.: A proof of concept for scale-adaptive parametrizations: the case of the Lorenz'96 model, Q. J. Roy. Meteor. Soc., 144, 63–75, 2018. a

Wang, Q.: Forward and adjoint sensitivity computation of chaotic dynamical systems, J. Comput. Phys., 235, 1–13, 2013. a, b, c

Wilks, D. S.: Statistical methods in the atmospheric sciences, vol. 100, Academic press, Oxford, UK, 2011. a

Wormell, C. L. and Gottwald, G. A.: On the validity of linear response theory in high-dimensional deterministic dynamical systems, J. Stat. Phys., 172, 1479–1498, 2018. a, b

Wouters, J. and Lucarini, V.: Disentangling multi-level systems: averaging, correlations and memory, J. Stat. Mech.-Theory E., 2012, P03003, https://doi.org/10.1088/1742-5468/2012/03/P03003, 2012. a

Young, L.-S.: What are SRB measures, and which dynamical systems have them?, J. Stat. Phys., 108, 733–754, 2002. a

We point the reader to recent articles dealing with the validity of the response theory for weakly hyperbolic systems and time series (Gottwald et al., 2016; Wormell and Gottwald, 2018).

When taking the gradient of a function A, the notation ∇_yA means taking the gradient at the point y, i.e. evaluating ∇_yA(y).

Response theory is also valid for stochastic models with a well-defined stationary measure, as shown for instance in Lucarini (2009).

⁴

Here we consider that the observation are perfectly assimilated in the models and that there is no observation errors. However in operational setups, such errors are of course to be taken into account.

⁵

This expression is equivalent to the second term of Eq. (1) in Lucarini (2012) upon a time transformation. It can also be obtained by explicitly computing the second-order perturbation of the average in Eq. (A14) in Appendix A.

Articles

Short summary

Postprocessing schemes used to correct weather forecasts are no longer efficient when the model generating the forecasts changes. An approach based on response theory to take the change into account without having to recompute the parameters based on past forecasts is presented. It is tested on an analytical model and a simple model of atmospheric variability. We show that this approach is effective and discuss its potential application for an operational environment.