Fractional relaxation noises, motions and the fractional energy balance equation

We consider the statistical properties of solutions of the stochastic fractional relaxation equation and its fractionally integrated extensions that are models for the Earth’s energy balance. In these equations, the highest-order derivative term is fractional, and it models the energy storage processes that are scaling over a wide range. When driven stochastically, the system is a fractional Langevin equation (FLE) that has been considered in the context of random walks where it yields highly nonstationary behaviour. An important difference with the usual applications is that we instead consider the stationary solutions of the Weyl fractional relaxation equations whose domain is −∞ to t rather than 0 to t . An additional key difference is that, unlike the (usual) FLEs – where the highest-order term is of integer order and the fractional term represents a scaling damping – in the fractional relaxation equation, the fractional term is of the highest order. When its order is less than 1/2 (this is the main empirically relevant range), the solutions are noises (generalized functions) whose high-frequency limits are fractional Gaussian noises (fGn). In order to yield physical processes, they must be smoothed, and this is conveniently done by considering their integrals. Whereas the basic processes are (stationary) fractional relaxation noises (fRn), their integrals are (nonstationary) fractional relaxation motions (fRm) that generalize both fractional Brownian motion (fBm) as well as Ornstein–Uhlenbeck processes. Since these processes are Gaussian, their properties are determined by their second-order statistics; using Fourier and Laplace techniques, we analytically develop corresponding power series expansions for fRn and fRm and their fractionally integrated extensions needed to model energy storage processes. We show extensive analytic and numerical results on the autocorrelation functions, Haar fluctuations and spectra. We display sample realizations. Finally, we discuss the predictability of these processes which – due to long memories – is a past value problem, not an initial value problem (that is used for example in highly skillful monthly and seasonal temperature forecasts). We develop an analytic formula for the fRn forecast skills and compare it to fGn skill. The large-scale white noise and fGn limits are attained in a slow power law manner so that when the temporal resolution of the series is small compared to the relaxation time (of the order of a few years on the Earth), fRn and its extensions can mimic a long memory process with a range of exponents wider than possible with fGn or fBm. We discuss the implications for monthly, seasonal, and annual forecasts of the Earth’s temperature as well as for projecting the temperature to 2050 and 2100.


Introduction
Over the last decades, stochastic approaches have rapidly developed and have spread throughout the geosciences. From early beginnings in hydrology and turbulence, stochasticity has made inroads in many traditionally deterministic areas. This is notably illustrated by stochastic parameterizations of numerical weather prediction models, e.g. Buizza et al. (1999), and the "random" extensions of dynamical systems theory, e.g. Chekroun et al. (2010).
In parallel, pure stochastic approaches have developed primarily along two distinct lines. One is the classical (integerordered) stochastic differential equation approach based on the Itô or Stratonovich calculus that goes back to the 1950s (see the useful review by Dijkstra, 2013). The other is the scaling strand that encompasses both linear (monofractal, Mandelbrot, 1982) and nonlinear (multifractal) models (see the review by Lovejoy and Schertzer, 2013) that are based on phenomenological scaling models, notably cascade processes. These and other stochastic approaches have played important roles in nonlinear geoscience.
Up until now, the scaling and differential equation strands of stochasticity have had surprisingly little overlap. This is at least partly for technical reasons: integer-ordered stochastic differential equations have exponential Green functions that are incompatible with wide-range scaling. However, this shortcoming can -at least in principle -be easily overcome by introducing at least some derivatives of fractional order. Once the (typically) ad hoc restriction on integer orders is dropped, the Green functions are based on "generalized exponentials" that in turn are based on fractional powers (see the review by Podlubny, 1999). The integerordered stochastic equations that have received the most attention are thus the exceptional, non-scaling special cases. In physics they correspond to classical Langevin equations; in geophysics and climate modelling, they correspond to the linear inverse modelling (LIM) approach that goes back to Hasselmann (1976) and later elaborated notably by Penland and Magorian (1993), Penland (1996), Sardeshmukh et al. (2000), Sardeshmukh andSura (2009) andNewman (2013). Although LIM is not the only stochastic approach to climate, in two recent representative multi-author collections (Palmer and Williams, 2010;Franzke and O'Kane, 2017), all 32 papers shared the integer-ordered assumption (a single exception being Watkins, 2017; see also Watkins et al., 2020).
Under the title "Fractal operators", West et al. (2003) review and emphasize that, in order to yield scaling behaviours, it suffices that stochastic differential equations contain fractional derivatives. However, when it is the time derivatives of stochastic variables that are fractional -fractional Langevin equations (FLEs) -then the relevant processes are generally non-Markovian (Jumarie, 1993), so that there is no Fokker-Planck (FP) equation describing the corresponding probabilities. Even in the relatively few cases where the FLE has been studied, the fractional terms are generally models of viscous damping, so that the highest-order terms are still integerordered (an exception is Watkins et al., 2020, who mention "fractionally integrated FLE" of the type studied here but without investigating its properties). Integer-ordered terms have the convenient consequence of regularizing the solutions, so that they are at least root mean square continuous; in this paper the highest-order derivatives are fractional, so that when the highest-order terms are ≤ 1/2, the solutions are "noises", i.e. generalized functions that must be smoothed in order to represent physically meaningful quantities.
An additional obstacle is that -as with the simplest scaling stochastic model, fractional Brownian motion (fBm, Mandelbrot and Van Ness, 1968) -we expect that the solutions will not be semi-martingales and hence that the Itô calculus used for integer-ordered equations will not be applicable (see Biagini et al., 2008). This may explain the relative paucity of mathematical literature on stochastic fractional equations (see however Karczewska and Lizama, 2009). In statistical physics, starting with Mainardi and Pironi (1996), Metzler and Klafter (2000) and Lutz (2001) helped with numerics; the FLE (and a more general "Generalized Langevin Equation", Kou and Sunney Xie, 2004;Watkins et al., 2019) has received a little more attention as a model for (nonstationary) particle diffusion (see West et al., 2003, for an introduction, or Vojta et al., 2019, for a more recent example). These technical aspects may explain why the statistics of the resulting processes are not available in the literature.
Technical difficulties may also explain the apparent paradox of continuous-time random walks (CTRWs) and other approaches to anomalous diffusion that involve fractional equations. While CTRW probabilities are governed by the deterministic fractional-ordered generalized fractional diffusion equation (e.g. Hilfer, 2000;Coffey et al., 2012), the walks themselves are based on specific particle jump models rather than (stochastic) Langevin equations. Alternatively, a (spatially) fractional-ordered Fokker-Planck equation may be derived from an integer-ordered but nonlinear Langevin equation for a diffusing particle driven by an (infinite-variance) Levy motion (Schertzer et al., 2001).
In nonlinear geoscience, it is all too common for mathematical models and techniques developed primarily for mathematical reasons to be subsequently applied to the real world. This approach -effectively starting with a solution and then looking for a problem -occasionally succeeds, yet historically the converse has generally proved more fruitful. The proposal that an understanding of the Earth's energy balance requires the fractional energy balance equation (FEBE, Lovejoy et al., 2021, announced in Lovejoy, 2019a is an example of the latter. First, the scaling exponent of macroweather (monthly, seasonal, interannual) temperature stochastic variability was determined (H I ≈ −0.085 ± 0.02) and shown to permit skillful global temperature predictions (Lovejoy, 2015b;Del Rio Amador and Lovejoy, 2019), and then it was extended to regional temperatures (at 2 • × 2 • resolution) (Del Rio Amador and Lovejoy, 2019Lovejoy, , 2021a. The latter papers showed how the long-memory high-frequency approximation to the FEBE can not only make state-of-the-art multi-month temperature forecasts, but also how the corresponding simulations generate emergent properties such as realistic El Niño events. In parallel, the multidecadal deterministic response to external (anthropogenic, deterministic) forcing was shown to also obey a scaling law but with a different exponent (Hébert, 2017;Lovejoy et al., 2017;Procyk et al., 2020Procyk et al., , 2022Procyk, 2021), H F ≈ −0.5 ± 0.2. It was only then realized that the order h FEBE naturally accounts for both the high-and lowfrequency global temperature exponents with h = H I + 1/2 and H F = −h, with both empirical exponents recovered with a FEBE of order h ≈ 0.38 ± 0.03. The realization that the FEBE fit these basic empirical facts motivated the present research into its statistical properties, including its predictability.
In the EBE, energy storage is modelled by a uniform slab of material, implying that, when perturbed, the temperature exponentially relaxes to a new thermodynamic equilibrium. However, as reviewed in Lovejoy and Schertzer (2013), both conventional global circulation models and observations show that atmospheric, oceanic and surface (e.g. topographic) structures are spatially scaling. A consequence is that the temperature relaxes to equilibrium in a power law manner. This motivated earlier approaches (van Hateren, 2013;Rypdal, 2012;Hébert, 2017;Lovejoy et al., 2017) to postulate that the climate response function (CRF) itself is scaling. However, these models require either ad hoc truncations or imply infinite sensitivity to small perturbations (Rypdal, 2015;Hébert and Lovejoy, 2015).
The FEBE instead situates the scaling in the energy storage processes; this is the physical basis for the phenomenological derivation of the FEBE proposed in Lovejoy et al. (2021), and the zeroth-order term guarantees that equilibrium is reached after long enough times. The scaling of the basic physical quantities in both time and space motivates the study of the FEBE and its fractionally integrated extensions discussed below with temperature treated as a stochastic variable. The FEBE determines the Earth's global temperature when the energy storage processes are scaling and modelled by a fractional time-derivative term. Recently, analysis of the atmospheric radiation budget has shown that, at least over some regions, the internal component of the radiative forcing may itself be scaling: this justifies the consideration of the extensions to fGn forcing.
The FEBE differs from the classical energy balance equation (EBE) in several ways. Whereas the EBE is integerordered and describes the deterministic, exponential relaxation of the Earth's temperature to equilibrium, the FEBE is of fractional order, and because it is both deterministic and stochastic, it unites all the forcings and responses into a single model. Whereas the stochastic part represents the forcing and response to the unresolved degrees of freedom -the "internal variability" -and is treated as a zero mean Gaussian noise, the deterministic part represents the external (e.g. anthropogenic) forcing and the forced response modelled by the total external forcing. Complementary work (Procyk et al., 2020(Procyk et al., , 2022Procyk, 2021) uses the deterministic FEBE as the basic model for the response to external forcing, but its Bayesian parameter estimation uses the stochastic FEBE to characterize the likelihood function of the residuals assumed to be the responses to stochastic internal forcing and governed by the same equation. It thus avoids the ad hoc error models involved in conventional Bayesian parameter estimation. The result is a parsimonious, FEBE projection of the Earth's temperature to 2100 that has much lower uncertainty than the classical global circulation model alternative. This is the first time that classical general circulation model climate projections have been confirmed by an independent, qualitatively different, approach.
An important but subtle EBE-FEBE difference is that, whereas the former is an initial value problem whose initial condition is the Earth's temperature at t = 0, the FEBE is effectively a past value problem whose prediction skill improves with the amount of available past data, and -depending on the parameters -it can have an enormous memory (Del Rio Amador and Lovejoy, 2021b). To understand this, recall that an important aspect of fractional derivatives is that they are defined as convolutions over various domains. To date, the main one that has been applied to physical problems is the Riemann-Liouville (and the related Caputo) fractional derivative specialized to convolutions over the interval between an initial time = 0 and a later time t. With one or two exceptions, this is the domain considered in Podlubny's mathematical monograph on deterministic fractional differential equations (Podlubny, 1999) as well as in the stochastic fractional physics discussed in West et al. (2003), Herrmann (2011), Atanackovic et al. (2014), and most of the papers in Hilfer (2000) (with the partial exceptions of Schiessel et al., 2000, andNonnenmacher andMetzler, 2000). A key point of the FEBE is that it is instead based over semiinfinite domains -here from −∞ to t -often called Weyl fractional derivatives. This is the natural range to consider for the Earth's energy balance, and it is needed to obtain statistically stationary responses. Random walk problems involving fractional equations over the domain 0 to t can be dealt with using Laplace transform techniques. In comparison, the Earth's temperature balance involves statistically stationary stochastic forcings that are more conveniently dealt with using Fourier techniques.
We have mentioned that the FEBE can be derived phenomenologically where the fractional derivative of order h term represents the energy storage processes . In this approach order h is an empirically determined parameter with h = 1 corresponding to the classical (exponential) exception. Alternatively, it may derived from a more fundamental starting point, the classical heat equation -the same starting point as the classical Budyko-Sellers energy balance models (Budyko, 1969;Sellers, 1969). Recently it was shown with the help of Babenko's operator method that the special h = 1/2 FEBE -the half-ordered energy balance equation (HEBE) -could be derived analytically from the classical heat equation (Lovejoy, 2021a, b).
To obtain the HEBE, it is sufficient to follow the Budyko-Sellers approach but to avoid one of their key approximations. The Earth's atmosphere and ocean are driven by local imbalances in radiative fluxes. While Budyko-Sellers models simply redirect this flux away from the Equator, the HEBE improvement (Lovejoy, 2021a, b) is to instead use the mathematically correct radiative-conductive surface boundary conditions. When this is done in the classical energy transport equation, one obtains an important h = 1/2 special case of the FEBE, the half-order EBE or HEBE. The use of half-order derivatives in the heat equation is completely classical and goes back to at least Oldham (1973), Oldham and Spanier (1972), Babenko (1986), Magin et al. (2004), and Sierociuk et al. (2013). The extension to h = 1/2 can be obtained using the same mathematical techniques by starting with the fractional generalization of the classical heat equation, the fractional heat equation. Further generalizations are also possible and will be reported elsewhere.
The choice of a Gaussian white noise forcing was made not so much for its theoretical simplicity as for its physical realism. Using scaling to divide atmospheric dynamics into dynamical ranges (Lovejoy, 2013(Lovejoy, , 2015a(Lovejoy, , 2019b, the main ones are weather, macroweather and climate. While the temperature variability in both space and time is generally highly intermittent (multifractal), there is one exception: the temporal macroweather regime (starting at the lifetime of planetary structures -roughly 10 d -up until the climate regime at much longer scales). Macroweather is the regime over which the FEBE applies, and it has exceptionally low intermittency: temporal (but not spatial) temperature anomalies are not far from Gaussian (Lovejoy, 2018). Responses to multifractal or Levy process FEBE forcings may however be of interest elsewhere.
This paper is structured as follows. In Sect. 2 we present the fractional relaxation equation, forced by a Gaussian white noise as a natural generalization of classical fractional Brownian motion, fractional Gaussian noise and Ornstein-Uhlenbeck processes (Sect. 2.1 and 2.2). When forced by Gaussian white noises, the solutions define the corresponding fractional relaxation motions (fRm) and fractional relaxation noises (fRn). We consider further extensions to the case where the equation is forced by a scaling noise fGn (Sect. 2.3,Eqs. 21 and 22). This is equivalent to considering the fractionally integrated fractional relaxation equation with white noise forcing. In Sect. 2, we first solve the equations in terms of Green's functions and then introduce powerful Fourier techniques that yield integral representations of the second-order statistics, including autocorrelations, structure functions (Eqs. 33 and 35), Haar fluctuations and spectra (with many details in Appendix A and in Appendix B, we derive the properties of the HEBE special case). In Sect. 3, we develop both short-and long-time (asymptotic) series expansions for the statistics (Eqs. 49 and 51), and we display and discuss sample fRn and fRm processes. In Sect. 4 we discuss the problem of prediction -important for macroweather forecasting -and derive expressions for the optimum predictor (Eq. 63) and its theoretical prediction skill as a function of forecast lead time (Eq. 68). In Sect. 5 we conclude.
We could note that the paper is somewhat complex due to the necessity of developing several approaches: Fourier for the main integral representations (Sect. 2), Laplace for the asymptotic expansions (Sect. 3), and real space for the predictability results (Sect. 4).

fRn, fRm, fGn and fBm
In the introduction, we outlined physical arguments that the Earth's global energy balance could be well modelled by the fractional energy balance equation. Taking T as the globally averaged temperature, τ as the characteristic timescale for energy storage/relaxation processes, F as the (stochastic) forcing (energy flux; power per area), and s as the climate sensitivity (temperature increase per unit flux of forcing), the FEBE can be written in Langevin form as where the Riemann-Liouville fractional derivative symbol a D h t is defined as where is the standard gamma function. Derivatives of order ν > 1 can be obtained using ν = h+m, where m is the integer part of ν, and then applying this formula to the mth ordinary derivative. The main case studied in applications (e.g. random walks) is a = 0, so that Laplace transform techniques are often used (alternatively, the somewhat different Caputo fractional derivative is used). However, here we will be interested in a = −∞: the Weyl fractional derivative −∞ D h t , which is naturally handled by Fourier techniques (Sect. 2.4 and Appendices A and B), and, in this case, this distinction is unimportant.
Since Eq. (1) is linear, by taking ensemble averages, it can be decomposed into deterministic and random components with the former driven by the mean forcing external to system F and the latter by the fluctuating stochastic component F − F representing the internal forcing driving the internal variability. The deterministic part has been used to project the Earth's temperature throughout the 21st century (Procyk et al., 2020(Procyk et al., , 2022; in the following we consider the simplest purely stochastic model in which F = 0 and F = γ , where γ is a Gaussian "delta-correlated" and with unit amplitude white noise: In Hébert (2017), Lovejoy et al. (2017), and , it was argued on the basis of an empirical study of ocean-atmosphere coupling that τ r ≈ 2 years, while recent work indicates a value somewhat higher, ≈ 5 years (Procyk et al., 2022). At high frequencies,  and Del Rio Amador and Lovejoy (2019Lovejoy ( , 2021a showed that the value h ≈ 0.4 reproduced the Earth's temperature at scales < τ as well as for macroweather scales (longer than the weather regime scales of about 10 d) but still < τ . Procyk et al. (2020)  (1) with γ (t) replaced by a deterministic forcing is a fractional generalization of the usual (h = 1) relaxation equation; when 1 < h < 2, it is the "fractional oscillation equation", a generalization of the usual (h = 2) oscillation equation (Podlubny, 1999).
To simplify the development, we use the relaxation time τ to nondimensionalize time, i.e. to replace time by t/τ to obtain the canonical Weyl fractional relaxation equation: for the nondimensional process U h . The dimensional solution of Eq. (1) with nondimensional γ = sF is simply T (t) = τ −1 U h (t/τ ), so that in the nondimensional Eq. (4), the characteristic transition "relaxation" time between dominance by the high frequency (differential) and the low frequency (U h term) is t = 1. Although we give results for the full range 0 < h < 2 -i.e. both the "relaxation" and "oscillation" ranges -for simplicity, we refer to the solution U h (t) as "fractional relaxation noise" (fRn) and to Q h (t) as "fractional relaxation motion" (fRm). Note that fRn is only strictly a noise when h ≤ 1/2. In dealing with fRn and fRm, we must be careful of various small and large t divergences. For example, Eqs. (1) and (4) are the fractional Langevin equations corresponding to generalizations of integer-ordered stochastic diffusion equations: the classical h = 1 case is the Ohrenstein-Uhlenbeck process. Since γ (t) is a "generalized function" -a "noise"it does not converge at a mathematical instant in time, and it is only strictly meaningful under an integral sign. Therefore, a standard form of Eq. (4) is obtained by integrating both sides by order h (i.e. by differentiating by −h and assuming that differentiation and integration of order h commute): (see e.g. Karczewska and Lizama, 2009). The white noise forcing in the above is statistically stationary; the solution for U h (t) is also statistically stationary. It is tempting to obtain an equation for the motion Q h (t) by integrating Eq. (4) from −∞ to t to obtain the fractional Langevin equation where W is the Wiener process (a standard Brownian motion) satisfying dW = γ (t)dt. Unfortunately the Wiener process-integrated −∞ to t almost surely diverges, and hence we relate Q h to U h by an integral from 0 to t.
In the high-frequency limit, the derivative dominates, and we obtain the simpler fractional Langevin equation whose solution F h is the fractional Gaussian noise process (fGn, not to be confused with the forcing) and whose integral B h is fractional Brownian motion (fBm). We thus anticipate that F h and B h are the high-frequency limits of fRn and fRm.

Green's functions
Although it will turn out that Fourier techniques are very convenient for calculating the statistics, there are also advantages to classical (real-space) approaches, and in any case they are needed for studying the predictability properties (Sect. 4). We therefore start with a discussion of Green's functions that are the classical tools for solving inhomogeneous linear differential equations: where G (fGn) 0,h and G (fRn) 0,h are Green's functions for the differential operators corresponding respectively to −∞ D h t and −∞ D h t + 1. Note that, due to causality, all Green's functions used in this paper vanish for t < 0. G (fGn) 0,h and G (fRn) 0,h are the usual "impulse" (Dirac) response Green's functions (hence the subscript "0"). For the differential operator , they satisfy Integrating this equation, we find an equation for their integrals G 1,h , which are thus "step" (Heaviside, subscript "1") response Green functions satisfying where is the Heaviside (step) function (= 0 for t < 0, = 1 for t ≥ 0). The inhomogeneous equation

98
S. Lovejoy: Fractional relaxation noises, motions and the fractional energy balance equation has a solution in terms of either an impulse or a step Green function: the equivalence being established by integration by parts with the conditions F (−∞) = 0 and G 1,h (0) = 0. The use of the step rather than impulse response is standard in the energy balance equation literature since it gives direct information on energy balance and the approach to equilibrium (see e.g. Lovejoy et al., 2021). The step response for the noise is also the basic impulse response function for the motion. For fGn, Green's functions are simply the kernels of the fractional integrals obtained by integrating both sides of Eq. (6) by order h. We conclude that For fRn, we now recall some classical results useful in geophysical applications. First, these Green functions are often equivalently written in terms of Mittag-Leffler functions ("generalized exponentials"), E α,β .
To lighten the notation in Eq. 14 and in the following, we suppress the superscripts for fRn and fRm processes. A convenient feature of Mittag-Leffler functions is that they can easily be integrated by any positive order α: (Podlubny, 1999). As mentioned, the constraint t > 0 is due to causality, and physical Green functions vanish for negative arguments. In the following this will simply be assumed. With α = 1, we obtain the useful formulas Figure 1. The impulse (a, b) and step response functions (c, d) for the fractional relaxation range (a, c: 0 < h < 1, and red is h = 1, the exponential), the black curves, bottom to top, are for h = 1/10, 2/10, . . . 9/10 and the fractional oscillation range (b, d: 1 < h < 2, red is the integer values h = 1, c, d is the exponential, and a, b h = 2), and the sine function, the black curves, bottom to top are for h = 11/10, 12/10, . . . , 19/10.
With this, we see that G (fGn) 0,h and G (fGn) 1,h are simply the first terms in the power series expansions of the corresponding fRn and fRm Green functions. The solution to Eq. (4) with the white noise forcing γ (t) is therefore where for this "pure" fRn process, we have added the subscript "0" for reasons discussed below. We note that, at the origin, for 0 < h < 1, G 0,h is singular, whereas G 1,h is regular, so that it may be advantageous to use the latter (step) response function (for example in the numerical simulations in Sect. 4). These Green function responses are shown in Fig. 1. When 0 < h ≤ 1, the step response is monotonic; in an energy balance model, this would correspond to relaxation to equilibrium. When 1 < h < 2, we see that there are overshoot and oscillations around the long-term value; it is therefore (presumably) outside the physical range of an equilibrium process.
In order to understand the relaxation process -i.e. the approach to the asymptotic value 1 in Fig. 1 for the step response G 1,h -we need the asymptotic expansion For α = 0, 1 we obtain the special cases corresponding to impulse and step responses: (0 < h < 1, 1 < h < 2; note that the n = 0 terms are 0 and 1 for G 0,h and G 1,h respectively) (Podlubny, 1999), i.e. the asymptotic expansions are power laws in t −h rather than t h . According to this, the asymptotic approach to the step function response (bottom row in Fig. 1) is a slow, power law process. In the FEBE, this implies for example that the classical CO 2 doubling experiment would yield a power law rather than exponential approach to a new thermodynamic equilibrium. Comparing this to the EBE, i.e. the special case h = 1, we have so that when h = 1, the asymptotic step response is instead approached exponentially quickly. We see that when h = 1, the process is a classical Ornstein-Uhlenbeck process, so that fRn can be considered a generalization of the latter. There are also analytic formulae for fRn when h = 1/2 (the HEBE) is discussed in Appendix B, notably involving logarithmic corrections.

The α-order fractionally integrated fRn and fRm processes
Before proceeding to discuss the statistics of fRn and fRm processes, it is useful to make a generalization to the fractionally integrated processes: U α,h is the "α-order-integrated, fractional h relaxation noise". Combined with Green's function relation , we find that U α,h and G α,h are respectively the fractionally integrated relaxation noises and Green's functions of the fractionally integrated fractional relaxation equation: If the highest-order derivative is constrained to be an integer (i.e. α +h = 1 or 2), then the equation is a standard fractional Langevin equation; for example, U could be for the velocity of a particle with fractional damping and white noise forcing, although even here, the initial conditions are usually taken to be at t = 0 and not t = −∞. Equivalently, U α,h is the solution of the relaxation equation but with an fGn forcing: (the Weyl fractional derivatives commute). F α is the α-order fGn process, and the restriction α < 1/2 is needed to ensure low-frequency convergence (see below).
In the Earth's radiative balance, such fractionally integrated fRn processes arise in two physically interesting situations. The first is where the forcing itself has a long memory -e.g. it is an fGn process. Whereas the memory in a pure fRn process is purely from the high-frequency storage term, in this case, the forcing (the overall radiative imbalance) also contributes to the memory, and this has important consequences for the predictability (Sect. 4). Although the solutions U α,h are mathematically the same whether from the fractional relaxation equation with fGn forcing (Eq. 23) or the fractionally integrated fractional relaxation equation with white noise forcing (Eq. 22), only the former is directly relevant for the Earth's energy balance. This is because the energy balance involves the response from both stochastic (internal) and deterministic (external) forcing. For the latter, it is important that, following a step function forcing, at long times, the system will approach a new state of thermodynamic equilibrium. This implies that the term in the equation that dominates at low frequencies -the lowest-order termis of order zero, so that if F in Eq. (1) is a step function, the new equilibrium temperature (anomaly) is T = sF .
The second situation where fractionally integrated fRn processes arise is for the energy storage (even in the purely white noise forcing case). The storage process is the difference between the forcing and the response: so that Even when the forcing is pure white noise (α = 0), the storage is an h-ordered fractionally integrated process: S 0,h = U h,h ; this corresponds to the storage following an impulse forcing. The storage following a step forcing is obtained by integration order 1: U 1+h,h . Similarly, Green's function for the fRn storage following an impulse forcing is G h,h and, following a step forcing, G 1+h,h (Fig. 2). Since it turns out that most of the pure fRn (α = 0) results are readily generalized to 0 < α < 1/2, many fractionally integrated results are given below.

Statistics
In the above, we discussed fGn, fRn and their order 1 integrals fBm, fRm as well as fractional generalizations, presenting a classical (real-space) approach stressing the links with fGn and fBm. We now turn to their statistics. U α,h (t) is a mean zero stationary Gaussian process (i.e. U α,h (t) = 0, where " . " indicates ensemble or statistical averaging); therefore, its statistics are determined completely by its autocorrelation function R α,h (t), which is only a function of the . Black is for h = 1/10, 2/10, . . . 10/10 and red for 11/10, 12/10, . . . 19/10 (to identify the curves; use the fact that at large t, they are in order of increasing h -bottom to top). For small t, G h,h ∝ t 2h−1 (Eq. 15), so that for h ≤ 1/2, the impulse response is singular at the origin. For large t, G h,h ∝ t h−1 (Eq. 18), so that for h < 1, the total impulse response storage decreases following the impulse; for h = 1 (the EBE), it tends to unity, and for h > 1, it diverges.
lag t: The far-right equality follows from U α,h = G α,h * γ and γ (t)γ (t ) = δ(t − t ) ("*" indicates "convolution"). The process can only be normalized by R α,h (0) when there is no small-scale divergence, i.e. when When α + h ≤ 1/2, this diverges; in order to be normalized, the process must be averaged at a finite resolution (below). Although it is possible to follow Mandelbrot and Van Ness (1968) and derive many statistical properties in real space, a Fourier approach is not only more streamlined, but is also more powerful. The reason for the simplicity of the Fourier approach is that the Fourier transform (FT, indicated by the tilde) of the Weyl fractional derivative is symbolically (e.g. Podlubny, 1999). This is simply the extension of the usual rule for the FT of integer-ordered derivatives. Therefore, since U α,h and G α,h are respectively solutions and Green's functions of the fractionally integrated fractional relaxation equation (Eq. 22), we have so that We see that in the limit h → 0, U α,0 is an α-order fGn process (see e.g. Eq. 23). Now we can use the fact that the white noise γ has a flat spectrum: The modulus (vertical bars) intervenes since for any real function f (t) we have f (ω) = f * (−ω), where the superscript "*" indicates a complex conjugate. Application of Eq. (31) leads to i.e. the spectrum E U is the FT of the correlation function R α,h (t) (the Wiener-Khinchin theorem). Applying this to U α,h , we obtain .
This shows that R α,h (t) = R α,h (−t), so that below, we only consider t ≥ 0. Since R α,h (0) diverges for α + h < 1/2, we consider the integral Q α,h of the process (the "motion") from which we can easily compute the average. The corresponding variance In terms of U α,h (ω), We see that at low frequencies, when α ≥ 1/2, the integral diverges for all t. Also note that a series expansion for V α,h (t) in t will only have even-ordered integer power terms. Comparing Eqs. (33) and (35), we see that R and V are linked by the simple relation Therefore, by integrating Eq. (26) (twice), we can express V α,h in terms of G α,h : This can be verified by differentiation and using The basic behaviour can be understood in the Fourier domain. First, putting t = 0 in Eq. (32) (i.e. "Parseval's theorem"), we have so that when α + h < 1/2, R diverges at high frequencies (small t), and hence to represent a physical process (here, the Earth's temperature), the process must be averaged over a finite-resolution τ . When α + h > 1/2, R(0) is finite and can therefore be used to obtain a normalized autocorrelation function (Eq. 27). From Eq. (32), we may also easily obtain the asymptotic high-and low-frequency behaviours of the energy spectrum:

Finite-resolution processes
When α + h < 1/2, the process does not converge at any instant t: it is a noise, a generalized function. To represent the Earth's temperature, it must therefore be averaged at a finiteresolution τ : Applying Eqs. (34) and (40), we obtain the "resolution τ " autocorrelation: Alternatively, measuring time in units of the resolution λ = t/τ , R a,h,τ can be conveniently written in terms of centred finite differences: The finite-difference formula is valid for t ≥ τ . For finite τ , it allows us to obtain the correlation behaviour by replacing the second difference with a second derivative, an approximation that is very good except when t is close to τ . Taking the limit τ → 0 in Eq. (43), we obtain the second derivative formula Eq. (36).

102
S. Lovejoy: Fractional relaxation noises, motions and the fractional energy balance equation 3 Application to fBm, fGn, fRm, and fRn

fBm and fGn
The above derivations were for noises and motions derived from differential operators whose impulse and step Green functions had convergent V α,h (t). Before applying them to fRn and fRm, we illustrate this by applying them first to fBm and fGn.
The fBm results are obtained by using the fGn step Green function (Eq. 13) in Eq. (35) The standard normalization and parametrization are This normalization turns out to be convenient not only for fBm, but also for fRm, so that for the normalized process, where we have introduced the standard fBm parameter H = h + 1/2, so that and hence H is the fluctuation exponent for fBm. Note that fBm is usually defined as the Gaussian process with V H given by Eq. (46), i.e. with this normalization (e.g. Biagini et al., 2008).
We can now calculate the correlation function relevant for the fGn statistics. With the above normalization, the bottom approximations are valid for large-scale ratios λ. We note the difference in sign for H > 1/2 ("persistence") and for H < 1/2 ("anti-persistence"). When H = 1/2, the noise corresponds to standard Brownian motion, and it is uncorrelated.

fRm and fRn
Since fRm and fRn are Gaussian, their properties are determined by their second-order statistics, by V α,h (t) and R α,h (t). These statistics are second order in G α,h (t) and can most easily be determined using the Fourier representation of G α,h (t) (Sect. 2.4, Appendices A and B). The development is challenging because unlike the G α,h (t) functions that are entirely expressed in series of fractional powers of t, V α,h (t) and R α,h (t) involve mixed fractional and integer power expansions; the details are given in the Appendices, and here we summarize the main results. First, for the noises, we have At small t, the lowest-order terms dominate, and the normalized autocorrelations are thus (note that F 3 < 0 for 3/2 < h + α < 2; see Appendix A). We see that at small t, the behaviour of the normalized autocorrelations depends essentially on the sum h + α; in particular, when h + α < 1/2, the process is effectively an fGn process with an effective fluctuation exponent H = −1/2 + (h + α). This is to be expected since α + h is the highest-order term in the fractionally integrated fractional relaxation equation (Eq. 22).

V α,h (t)
Integrating twice α+h ), so that the fBm coefficient can be used for normalization using R α,h,τ (0) = τ −2 V α,h (τ ). When h + α > 1/2, this normalization becomes negative, so that it cannot be used; however, in this case, R α,h (0) = F 1 and may be used for normalization instead. For an analytic expression, convergence properties including numerical results and modified expansions converge more rapidly; see Appendix A and, for the special case h = 1/2, Appendix B.
For convenience, the leading terms of the normalized V α,h are
Integrating R α,h twice and doubling, we obtain (the full expansion is given in Appendix A; see Fig. 4 for plots). The constants of integration a α,h and b α,h are not determined since the expansion is not valid at t = 0; they can be determined numerically if needed. However, in the limit α → 0 (the pure fRn case), the leading term is exactly t (corresponding to ordinary Brownian motion), so that an extra a 0,h is not needed (Appendix A). When α > 0, the far-left (fGn) term from the forcing dominates; at large enough t, V α,h (t) ∝ t 2H with H = α + 1/2, and the corresponding motion is an fBm.

Haar fluctuations
A useful statistical characterization of the processes is by the statistics of their Haar fluctuations over an interval t. For an interval t, Haar fluctuations (based on Haar wavelets) are the differences between the averages of the first and second halves of an interval. For a process U , the Haar fluctuation is In terms of the process at resolution t/2 (i.e. averaged at this scale), U t/2 (t): Therefore, where V (t) is the variance of the integral of U over an interval t (Eq. 34). Using Eq. (60), we can determine the behaviour of the root mean square (rms) Haar fluctuations; terms like V α,h (t) ∝ t ξ contribute ∝ t ξ/2−1 to the rms Haar fluctuation U α,h ( t) 2 Haar 1/2 (the exception is when ξ = 2, which contributes nothing). Applying this equation to fGn parameter h, we obtain F h ( t) 2 Haar 1/2 ∝ t H with H = h − 1/2. Using the results above for V α,h , we therefore obtain the leading exponents: (61) Figure 5 shows that the theory agrees well with the numerics. For the range of α, h discussed here (0 ≤ α < 1/2, 0 ≤ h ≤ 2), H spans the range −1/2 (white noise) to 1. In comparison, fGn processes have H covering the range −1 < H < 0 and fBm processes have 0 < H < 1; therefore, depending on whether the process is observed at timescales below or above the relaxation timescale ( t = 1), fractionally integrated fRn processes can mimic fGn or fBm processes. If we consider the integrals -the motions -the value of H is increased by 1 (although for Haar fluctuations, it cannot exceed H = 1). Overall, from an empirical viewpoint, if over some range of scales (that may only be a factor of 100 or less), it may be quite hard to distinguish the various models, especially since the transition from low-to high-frequency scaling may be Figure 5. The rms Haar fluctuation plots for the pure (α = 0) fRn process for 0 < h < 1/2 (a), 1/2 < h < 1 (b), 1 < h < 3/2 (c), and 3/2 < h < 2 (d). The individual curves correspond to those of Figs. 3 and 4. The small t slopes follow the theoretical values h − 1/2 up to h = 3/2 (slope = 1); for larger h, the small t slopes all equal 1. Also, at large t due to dominant V ≈ t terms, in all cases we obtain slopes t −1/2 . very slow (see especially Appendix B for the h = 1/2 case). Recent work shows that the maximum likelihood method may be the optimum parameter estimation technique (Procyk, 2021).

Sample processes
It is instructive to view some samples of fRn and fRm processes (here we consider only α = 0). For simulations, both the small-and large-scale divergences must be considered. Starting with the approximate methods developed by Mandelbrot and Wallis (1969), it took some time for exact fBm and fGn simulation techniques to be developed (Hipel and McLeod, 1994;Palma, 2007). Fortunately, for fRm and fRn, the low-frequency situation is easier since the long-time memory is much smaller than for fBm and fGn. Therefore, as long as we are careful to always simulate series a few times longer than the relaxation time and then to throw away the earliest 2/3 or 3/4 of the simulation, the remainder will have accurate statistics. With this procedure to take care of lowfrequency issues, we can therefore use the solution for fRn in the form of a convolution and use standard numerical convolution algorithms.
We must nevertheless be careful about the high frequencies since the impulse response Green functions G 0,h are singular for h < 1. In order to avoid singularities, simulations of fRn are best made by first simulating the motions Q 0,h using Q 0,h ∝ G 1,h * γ and obtaining the resolution τ fRn, using U 0,h,τ (t) = (Q 0,h (t + τ ) − Q 0,h (t))/τ . Numerically, this allows us to use the smoother (nonsingular) G 1,h in the convolution rather than the singular G 0,h . The simulations shown in Figs. 6-9 follow this procedure, and the Haar fluctuation statistics were analysed, verifying the statistical accuracy of the simulations.
In order to clearly display the behaviours, recall that when t 1, we showed that all the fRn converge to Gaussian white noises and the fRm to Brownian motions (albeit in a slow power law manner). At the other extreme, for t 1, we obtain the fGn and fBm limits (when 0 < h < 1/2) and their generalizations for 1/2 < h < 2. Figure 6 shows three simulations, each of length 2 19 pixels, with each pixel corresponding to a temporal resolution of τ = 2 −10 , so that the unit (relaxation) scale is 2 10 elementary pixels. Each simulation uses the same random seed, but they have h's increasing from h = 1/10 (top set) to h = 5/10 (bottom set). The fRm on the right is from the running sum of the fRn on the left. Each series has been rescaled, so that the range (maximum-minimum) is the same for each. Starting at the top line of each group, we show 2 10 points of the original series degraded by a factor of 2 9 . The second line shows a blow-up by a factor of 8 of the part of the upper line to the right of the dashed vertical line. The line below is a further blow-up by a factor of 8 until the bottom line shows a 1/512 part of the full simulation but at full resolution. The unit scale indicating the transition from small to large is shown by the horizontal red line in the middle-right figure. At the top (degraded by a factor of 2 9 ), the unit (relaxation) scale is 2 pixels, so that the top line degraded view of the simulation is nearly a white noise (left) or (ordinary) Brownian motion (right). In contrast, the bottom series is exactly of length unity, so that it is close to the fGn limit with the standard exponent H = h + 1/2. Moving from bottom to top in Fig. 6, one effectively transitions from fGn to fRn (left column) and from fBm to fRm (right column).
If we take the empirical relaxation scale for the global temperature to be 2 7 months (≈ 10 years, Lovejoy et al., 2017) and we use monthly-resolution temperature anomaly data, then the nondimensional resolution is 2 −7 , corresponding to the second series from the top (which is thus 2 10 months ≈ 80 years long). Since h ≈ 0.38 ± 0.03 (Procyk et al., 2022), the second series from the top in the bottom set is the most realistic, and we can make out the low-frequency undulations that are mostly present at scales 1/8 of the series (or less). Figure 7 shows realizations constructed from the same random seed but for the extended range 1/2 < h < 2 (i.e. beyond fGn). Over this range, the top (large-scale, degradedresolution) series are close to white noises (left) and Brownian motions (right). For the bottom series, there is no equivalent fGn or fBm process and the curves become smoother, although the rescaling may hide this somewhat (see for example the h = 13/20 set, the blow-up of the farright 1/8 of the second series from the top shown in the third line). For 1 < h < 2, also note the oscillations with frequency 2π/ sin(π/ h) (Eqs. 53 and A3): this is the fractional oscillation range. There are three simulations, each of length 2 19 pixels, and each uses the same random seed with the unit scale equal to 2 10 pixels (i.e. a resolution of τ = 2 −10 ). The entire simulation therefore covers the range of scale 1/1024 to 512 units. The fRm on the right is from the running sum of the fRn on the left. Starting at the top line of each set, we show 2 10 points of the original series degraded in resolution by a factor 2 9 . Since the length is t = 2 9 units long, each pixel has resolution τ = 1/2. The second line of each set takes the segment of the upper line lying to the right of the dashed vertical line, 1/8 of its length. It therefore spans t = 0 to t = 2 9 /8 = 2 6 , but resolution was taken as τ = 2 −4 , and hence it is still 2 10 pixels long. Since each pixel has a resolution of 2 −4 , the unit scale is 2 4 pixels long: this is shown in red in the second series from the top (middle set). The process of taking 1/8 and blowing up by a factor of 8 continues to the third line (length t = 2 3 , resolution τ = 2 −7 ), unit scale = 2 7 pixels (shown by the red arrows in the third series), until the bottom series which spans the range t = 0 to t = 1, and resolution τ = 2 −10 with unit scale 2 10 pixels (the whole series displayed). Each series was rescaled in the vertical so that its range between maximum and minimum was the same. The unit relaxation scales indicated by the red arrows mark the transition from small to large scales. Since the top series in each set has a unit scale of 2 (degraded), it is nearly a white noise (a, c, e) or (ordinary) Brownian motion (b, d, f). In contrast, the bottom series is exactly of length t = 1, so that it is close to the fGn and fBm limits (left and right) with the standard exponent H = h + 1/2. As indicated in the text, the second series from the top in the bottom set is most realistic for monthly temperature anomalies. Figure 8 shows simulations similar to Fig. 5a (fRn on the left, fRm on the right), except that instead of making a large simulation and then degrading and zooming, all the simulations were of equal length (2 10 points), but the relaxation scale was changed from 2 15 pixels (bottom) to 2 10 , 2 5 and 1 pixel (top). Again, the top is white noise (left) and Brownian motion (right), and the bottom is (nearly) fGn (left) and fBm (right); Fig. 9 shows the extensions to 1/2 < h < 2.  Fig. 6 but for h = 7/10, 13/10 and 19/10 (top to bottom). Over this range, the top (large-scale, degradedresolution) series is close to a white noise (a, c, e) and Brownian motion (b, d, f). For the bottom series, there is no equivalent fGn or fBm process and the curves become smoother, although the rescaling may hide this somewhat (see for example the middle h = 13/20 set, the blow-up of the far-right 1/8 of the second series from the top shown in the third line). Also note for the bottom two sets with 1 < h < 2 the oscillations that have frequency 2π/ sin(π/ h): this is the fractional oscillation range.

Prediction
The initial value for Weyl fractional differential equations is effectively at t = −∞, so that for fRn, it is not directly relevant at finite times (although the ensemble mean is assumed = 0; for fRm, the initial condition Q α,h (0) = 0 is important). The prediction problem is thus to use past data (say, for t < 0) in order to make the most skillful prediction for t > 0. We are therefore dealing with a past value rather than usual initial value problem. The emphasis on past values is particularly appropriate since in the fGn limit, the memory is so large that values of the series in the distant past are important. Indeed, prediction of fGn with a finite length of past data involves placing strong (mathematically singular) weight on the most ancient data available (see Gripenberg and Norros, 1996;Lovejoy, 2019, 2021a, b). This is quite different from standard stochastic predictions that are based on short-memory (exponential) auto-regressive or moving-average-type processes that are not much different from initial value problems.
To deal with the small-scale divergences when 0 < h + α ≤ 1/2, it is necessary to predict the finite-resolution fRn: Figure 8. This set of simulations is similar to Fig. 6 ((a, c, e) fRn, (b, d, f) fRm) except that instead of making a large simulation and then degrading and zooming, all the simulations were of equal length (2 10 points) but resolutions τ = 2 −15 , 2 −10 , 2 −5 , and 1 (bottom to top). The simulations therefore spanned the ranges of scale 2 −15 to 2 −5 , 2 −10 to 1, 2 −5 to 2 5 , and 1 to 2 10 , and the same random seed was used in each so that we can see how the structures slowly change when the relaxation scale changes. The bottom fRn, h = 5/10 set is the closest to that observed for the Earth's temperature, and since the relaxation scale is of the order of a few years, the second series from the top of this set (with 1 pixel = 1 month) is close to that of monthly global temperature anomaly series. In that case the relaxation scale would be 32 months and the entire series would be 2 10 /12 ≈ 85 years long. The top series (of total length 2 10 relaxation times) is (nearly) a white noise (a, c, e) and Brownian motion (b, d, f), and the bottom is (nearly) an fGn (a, c, e) and fBm (b, d, f). The total range of scales covered here (2 10 × 2 15 ) is larger than in Fig. 5a and allows one to more clearly distinguish the high-and low-frequency regimes. U α,h,τ (t). Using Eq. (40) for U α,h,τ (t), we have Now define the predictor for t ≥ 0 (indicated by a circumflex): To show that it is indeed the optimal predictor, consider the predictor error E τ (t): S. Lovejoy: Fractional relaxation noises, motions and the fractional energy balance equation Equation (64) shows that the error depends only on γ (v) for v > 0, whereas the predictor (Eq. 63) only depends on γ (v) for v < 0, and hence they are orthogonal: This is a sufficient condition for U α,h,τ (t) to be the minimum square predictor, which is the optimal predictor for stationary Gaussian processes (e.g. Papoulis, 1965). The prediction error variance is or, with a change in variables, where we have used U 2 α,h,τ = τ −2 V α,h (τ ) (the unconditional variance).
There are numerous skill indicators, but the most popular and easy-to-interpret definition of forecast skill is the minimum square skill score or MSSS (S k,τ , see Del Rio Amador and Lovejoy, 2021a, for a discussion of this and other indicators). For this, we obtain When h < 1/2 and Figure 10. The prediction skill (S k ) for pure fGn processes for forecast horizons up to λ = 10 steps (10 times the resolution). This plot is nondimensional, and it is valid for time steps of any duration. From bottom to top, the curves correspond to h = 1/20, 3/10, . . . 9/20 (red, top, close to the empirical h).
we obtain the fGn result: , where λ is the forecast horizon (lead time) measured in the number of time steps in the future (due to the fGn scaling, it is independent of the resolution τ ). The MSSS gives the fraction of the variance explained by the optimum predictor; when skill = 1, the forecast is perfect.
To survey the implications, let us start by showing the τ independent results for fGn, shown in Fig. 10, which is a variant on a plot published in . We see that when h ≈ 1/2 (H ≈ 1), the skill is very high; indeed, in the limit h → 1/2, we have perfect skill for fGn forecasts (this would of course require an infinite amount of past data to attain). Now consider the fRn skill: we will start by considering the pure (α = 0) fRn case where the memory comes completely from the (high-frequency) storage, anticipating that the fGn forced case (α = 0) obtains its memory and skill from both storage and forcing. In comparison with fGn, fRn has an extra parameter, the resolution of the data, τ . Figure 11 shows curves corresponding to Fig. 10 for fRn with forecast horizon integer multiples (λ) of τ , i.e. for times t = λτ in the future but with separate curves, one for each of five τ values increasing from 10 −4 to 10 by factors of 10. When τ is small, the results should be close to those of fGn, i.e. with potentially high skill, and in all cases, the skill is expected to vanish quite rapidly for τ > 1 since in this limit, fRn becomes an (unpredictable) white noise (although there are scaling corrections to this). Figure 11. Panels (a, c, e) show the skill (S k ) of pure (α = 0) fRn forecasts (as in Fig. 10 for fGn) for fRn skill with h = 1/20, 5/20, and 9/20 (top to bottom sets); λ is the forecast horizon, the number of steps of resolution τ forecast into the future. Panels (b, d, f) show the ratio (r) of the fRn to the corresponding fGn skill. Here the result depends on τ ; each curve is for different values increasing from 10 −4 (top, black) to 10 (bottom, purple) and increasing by a factor of 10 (the red set in the bottom plots with τ = 10 −2 ; h = 9/20 are closest to the empirical values).
To better understand the fGn limit, it is helpful to plot the ratio of the fRn-to-fGn skill (Fig. 11, right column). We see even with quite small values τ = 10 −4 (top, black curves) that some skill has already been lost. Figure 12 shows this more clearly: it shows 1-time-step and 10-time-step skill ratios. To put this into perspective, it is helpful to compare this using some of the parameters relevant to macroweather forecasting. According to  and Del Rio Amador and Lovejoy (2019), the relevant empirical Haar exponent is ≈ −0.1 for the global temperature, so that h = 1/2 − 0.1 ≈ 0.4. Although direct empirical estimates of the relaxation time are difficult since the responses to anthropogenic forcing begin to dominate over the internal variability after ≈ 10 years, Procyk et al. (2022) have used the deterministic response to estimate a global relaxation time of ≈ 5 years (work in progress using maximum likelihood estimates shows that at scales of hundreds of kilometres, it is quite variable, ranging from months to decades; Procyk, 2021). For monthly-resolution forecasts, the nondimensional resolution is τ ≈ 1/100. With these values, we see (red curves) that we may have lost ≈ 30 % of the fGn skill for 1-month forecasts and ≈ 85 % for 10-month forecasts. Comparing this with Fig. 10, we see that this implies about 60 % and 10 % skill (see also the red curve in Fig. 11, bottom set). Figure 12. The ratio of (α = 0) fRn skill to fGn skill (a: 1-step horizon, b: 10-step forecast horizon) as a function of resolution τ for h increasing from (at left) bottom to top (h = 1/20, 2/20, 3/20 . . . 9/20); the h = 9/20 curve (close to the empirical value) is the curve that starts at the upper left of each plot. Figure 13. The 1-step (a) and 10-step (b) pure (α = 0) fRn forecast skill as a function of h for various resolutions (τ ) ranging from τ = 10 −4 (black, left of each set) through τ = 10 −3 (brown) 10 −2 (red), 0.1 (blue), 1 (orange), and 10 (purple). In the right set τ = 1 (orange), 10 (purple) lines are nearly on top of the S k = 0 line. Again, red (τ = 10 −2 ) is the more empirically relevant value for monthly data. Recall that the regime h < 1/2 (to the left of the vertical dashed lines) corresponds to the overlap with fGn.
Going beyond the 0 < h < 1/2 region that overlaps fGn, Figs. 12 and 13 clearly show that the skill continues to increase with h. We already saw (Fig. 4) that the range 1/2 < h < 3/2 has rms Haar fluctuations that for t < 0 mimic fBm, and these do indeed have higher skill, approaching unity for h near 1 corresponding to a Haar exponent ≈ 1/2, i.e. close to an fBm with H = 1/2, i.e. a regular Brownian motion. Recall that for Brownian motion, the increments are unpredictable but the process itself is predictable (persistence). In Fig. 12, we show the skill for various h's as a function of resolution τ . Figure 14 shows that for h < 3/2, the skill decreases rapidly for τ > 1. Figure 15 in the fractional oscillation equation regime shows that the skill oscillates.
We may now consider the skill of the fGn-forced process (α = 0) in Fig. 16. For small τ , short lags, λ (the upper left), the contours are fairly linear along lines of constant h + α, so that, as expected, the predictability is essentially that of an fGn process but with an effective exponent h + α. At the opposite extreme, (large τ , h) the lines are fairly horizontal, Figure 14. One-step pure (α = 0) fRn prediction skills as a function of resolution for hs increasing from 1/20 (bottom) to 29/20 (top) every 1/10. Note the rapid transition to low skill (white noise) for τ > 1. The curve for h = 9/20 is shown in red. indicating that the skill from the storage (i.e. from h) is negligible and that all the memory (and hence skill) comes from the forcing fGn, exponent α. The in-between resolutions and lags generally have in-between slopes. As expected, the skill from the storage drops off quickly for resolutions ≈> τ . For h ≥ 1, there is some waviness in the contours due to the oscillatory nature of the Green functions. Figure 16. Contour plots of the forecast skill, with h along the horizontal axis and α along the vertical axis. The plots are for increasing nondimensional resolutions: τ = 0.001, 0.01, 0.1, 1, and 10 (top to bottom), with forecasts for lags λ = 1, 3, and 10 (left to right) and with contour levels (legend) varying from nearly no skill (0.03) to nearly full skill (0.98).

Conclusions
Ever since Budyko (1969) and Sellers (1969), the energy balance between the Earth and outer space has been modelled by the energy balance equation (EBE) based on the continuum heat equation; see North and Kim (2017) for a recent review and see Ziegler and Rehfeld (2020) for a recent regional application. It is most commonly used as a model for the globally averaged temperature, where it is usually derived by applying Newton's law of cooling applied to a uniform slab of material, a "box". The resulting EBE is a first-order relaxation equation describing the exponential relaxation of the temperature to a new equilibrium after it has been perturbed by an external forcing. Its first-order (h = 1) derivative term accounts for energy storage.
The resulting model relaxes to equilibrium much too quickly, so that to increase realism, it is usual to introduce a few interacting slabs (representing for example the atmosphere and ocean mixed layer; the Intergovernmental Panel on Climate Change recommends two such components; IPCC, 2013). However, it turns out that these h = 1 box models do not use the correct surface radiativeconductive boundary conditions. If one assumes heat trans-port by the classical heat equation and radiative-conductive boundary conditions are used instead, one instead obtains the half-order EBE, the HEBE with h = 1/2 (Lovejoy, 2021a, b), which is already close to the global empirical value (h = 0.38 ± 0.03, Procyk et al., 2022;Del Rio Amador and Lovejoy, 2019; see also . However, this model is only valid in the macroweather regime -for timescales of weeks and longer and, due to the spatial scaling in the atmosphere, the fractional heat equation (FHE) may be a more appropriate model than the classical one. The use of the FHE can be justified by recognizing that a realistic energy transport model involves a continuous hierarchy of mechanisms. The extension to the FHE leads directly to a fractional relaxation equation that generalizes the EBE: the fractional energy balance equation (Lovejoy, 2021a, b) (FEBE). The FEBE can also be derived phenomenologically by assuming that energy storage processes are scaling (Lovejoy, 2019;Lovejoy et al., 2021).
When forced by a Gaussian white noise, the FEBE is also a generalization of fractional Gaussian noise (fGn), and its integral (fractional relaxation motion, fRm) generalizes fractional Brownian motion (fBm). More classically, it generalizes the Orenstein-Uhlenbeck process that corresponds to the h = 1 special case (i.e. the standard EBE with white noise forcing). Over the parameter range 0 < h < 1/2, the highfrequency FEBE limit (fGn) has been used as the basis of monthly and seasonal temperature forecasts Lovejoy, 2019, 2021a, b); at 1month lead times, these macroweather forecasts are similar in skill to conventional numerical models, whereas for bimonthly, seasonal and annual forecasts, they are more skillful (Del Rio Amador and Lovejoy, 2021a). For multidecadal timescales the low-frequency limit has been used as the basis of climate projections through to the year 2100 (Hébert, 2017;Lovejoy et al., 2017;Hébert et al., 2021), and more recently, the full FEBE has been used directly (Procyk et al., 2020(Procyk et al., , 2022Procyk, 2021).
It was the success of predictions and projections with different exponents but the same theoretically derived empirical underlying FEBE h ≈ 0.4 that, over recent years, motivated the development of the FEBE (announced in Lovejoy, 2019) and the work reported here. The statistical characterizations, correlations, structure functions, Haar fluctuations and spectra as well as the predictability properties are important for these and other FEBE applications and are derived in this paper.
While the deterministic fractional relaxation equation is classical, various technical difficulties arise when it is generalized to the stochastic case: in the physics literature, it is a fractional Langevin equation (FLE) that has almost exclusively been considered a model of diffusion of particles starting at an origin. This requires t = 0 initial conditions that imply that the solutions are strongly nonstationary. In comparison, the Earth's temperature fluctuations that are associated with its internal variability are statistically stationary. This can easily be modelled with initial conditions at t = −∞, i.e. by using Weyl fractional derivatives. In addition, in the usual FLE, the highest-order derivative is an integer, so that sample processes are rms differentiable of order at least 1 (Watkins et al., 2020, have called the FEBE a "Fractionally Integrated FLE"). In the FEBE and the fractionally integrated extensions, the highest-order derivative is readily of order < 1/2, so that sample processes are generalized functions ("noises") and must be smoothed/averaged for physical applications.
Although EBEs were originally developed to understand the deterministic temperature response to external forcing, the temperature also responds to stochastic "internal" forcing. While the Earth's system variability is generally highly non-Gaussian (multifractal, Lovejoy, 2018), the temporal macroweather regime modelled here is the quasi-Gaussian exception. This paper therefore explores the statistics of the temperature response when it is stochastically forced by Gaussian processes, both by white noise (α = 0) and by a (long-memory) fractional Gaussian noise (fGn) process. The white noise special case -"pure fRn and fRm" -is the α = 0 special case; the fGn-forced case extends the parameter range to 0 ≤ α < 1/2. According to work in progress using satellite and reanalysis radiances, both cases appear to be empirically relevant for modelling the Earth's energy balance.
A key novelty is therefore to consider the fractional relaxation equation (a FLE) forced by white and scaling noises starting from t = −∞, equivalent to Weyl's "fractionally integrated fractional relaxation equation". In addition, the highest-order terms in standard FLEs are integer-ordered: the fractional terms represent damping and are of lower order, guaranteeing that solutions are regular functions. However, the FEBE's highest-order term is fractional, and over the main empirically significant parameter range (α + h < 1/2) the processes are noises (generalized functions): in order to represent physical processes, they must be averaged. This is conveniently handled by introducing their integrals or "motions". We proceeded to derive their fundamental statistical properties, including series expansions about the origin and infinity. These expansions are nontrivial since they mix fractional and integer-ordered terms (Appendix A). Since the FEBE is used as the basis for macroweather predictions, the theoretical predictability skill is important in applications and was also derived.
With these stationary Gaussian forcings, the solutions are a new stationary process -fRn (α = 0) and its extensions to fractionally integrated fRn processes (α > 0). Over the range 0 < α + h < 1/2, we show that the small-scale limit is an fGn, and its integral -fRm -has stationary increments and generalizes fBm. Although at long enough times the fRn (α = 0) tends to a Gaussian white noise and fRm to a standard Brownian motion, this long time convergence is typically very slow (when α > 0, the long time behaviours are fGn and fBm processes, parameter α).
Much of the effort was in deducing the asymptotic smalland large-scale behaviours of the autocorrelation functions that determine the statistics and in verifying these with extensive numerical simulations. An interesting exception was the h = 1/2 special case, which for fGn corresponds to an exactly 1/f noise. Here, we give the exact mathematical expressions for the full correlation functions, showing that they had logarithmic dependencies at both small and large scales. The resulting HEBE has an exceptionally slow transition from small to large scales (a factor of a million or more is needed), and empirically it is quite close to the global temperature series over scales of months, decades and possibly longer.
Beyond improved monthly and seasonal temperature forecasts and multidecadal projections, the stochastic FEBE opens up several paths for future research. One of the more promising is to apply these techniques to the spatial FEBE and generalize it in various directions. This is a follow-up on the special value h = 1/2 that is very close to that found empirically and that can be analytically deduced from the classical Budyko-Sellers energy transport equation by improving the mathematical treatment of the radiative boundary conditions (Lovejoy, 2021a, b). In the latter case, one obtains a partial fractional differential equation for the horizontal space-time variability of temperature anomalies over the Earth's surface, allowing regional forecasts and projections. This has already allowed improved regional projections (Procyk, 2021) and promises better monthly and seasonal forecasts.
While the FEBE has already demonstrated its ability to project future climates, these improvements will allow for the modelling of the nonlinear albedo-temperature feedbacks needed for modelling of transitions between different past climates. Finally, FEBE-based projections have shown that, in spite of improved computer power and algorithms, conventional GCM approaches may be suffering from diminishing returns; the GCMs in the latest IPCC assessment (AR6, 2021) are even more uncertain: a range of 2-5.5 K/CO 2 doubling (90 % confidence) those in the previous assessment (AR5, 2013, 1.5-4.5 K per doubling) while also being somewhat warmer. The FEBE had the somewhat lower but much less uncertain range 1.6-2.4 K/CO 2 doubling (90 % confidence). Conventional GCM approaches attempt to explicitly model as many degrees of freedom as possible, and by the year 2030 they are expected to have kilometric-scale ("cloudresolving") resolutions that will model structures that live for only 15 min and then average them over decades. The FEBE (with regional and other future extensions) is, in contrast, a high-level stochastic model that accounts for the collective interactions of huge numbers of degrees of freedom (Lovejoy, 2019). It is thus a promising candidate for a new generation of climate models.
Appendix A: The small-and large-scale fRn and fRm statistics A1 R α,h (t) as a Laplace transform In Sect. 2.4, we derived general statistical formulae for the autocorrelation functions of motions and noises defined in terms of Green's functions of fractional operators. Since the processes are Gaussian, autocorrelations fully determine the statistics. While the autocorrelations of fBm and fGn are well known, those for fRm and fRn are new and are not so easy to deal with since they involve quadratic integrals of Mittag-Leffler functions. In this Appendix, we derive the basic power law expansions as well as large t (asymptotic) expansions, and we numerically investigate their accuracy.
It is simplest to start with the Fourier expression for the autocorrelation function for the unit white noise forcing (Eq. 33). First convert the inverse Fourier transform (Eq. 66) into a Laplace transform. For this, consider the integral over the contour C in the complex plane: Take C to be the closed contour obtained by integrating along the imaginary axis (this part gives R α,h (t), Eq. 33) and closing the contour along an (infinite) semicircle over the second and third quadrants. When 0 < h < 1, there are no poles in these quadrants, but we must integrate around a branch cut on the negative real axis. When 1 < h < 2, we must take into account two new branch cuts and two new poles in the negative real half-plane. In a polar representation z = re iθ , the additional branch cuts are along the rays z = re ±iπ/ h , r > 1, circling around the poles at z = e ±iπ/ h . The additional branch cuts give no net contribution, but the residues of the poles do make a contribution (P α,h = 0 below). We can express both cases with the formula "Im" indicates the imaginary part and P α,h,± (t) = 0, 0 < h < 1, While the integral term is monotonic, the P α,h term oscillates with frequency ω = 2π/ sin(π/ h). P α,h accounts for the oscillations visible in Figs. 3, 4, and 7, although since when 1 < h < 2, cos(π/ h) < 1, they decay exponentially. When h > 1, this pole contribution dominates R α,h (t) for a wide range of t values around t = 1, although as we see below, eventually at large t, power law terms come to the fore.

A2 Asymptotic expansions
An advantage of writing R α,h (t) as a Laplace transform is that we can use Watson's lemma to obtain an asymptotic expansion (e.g. Bender and Orszag, 1978). The idea is that an expansion of Eq. (A2) around x = 0 can be Laplacetransformed term by term to yield an asymptotic expansion for large t.
The first terms are explicitly We see that when α = 0, D 0 > 0, so that, as expected, the leading behaviour has no h dependence: it is only due to the long-range correlations in the forcing. We obtain the fGn result t 2α−1 . However, for the pure fRn case, α = 0 and D 0 = 0, so that we obtain i.e. the leading behaviour is t −(1+h) . Note that the leading n = 1 coefficient reduces to −1/ (−h) and that for 0 < h < 1, (−h) < 0. For the motions (fRm), we need the expansion of V α,h (t); this can be obtained by integrating R α,h twice (using Eq. 36): where P a,h− is from the poles when 1 < h < 2. Since the asymptotic expansion is not valid for t = 0, we used the indefinite integrals of R α,h , and hence there is a linear a α,h t + b α,h term from the constants of integration. However, when α > 0, the leading term is the t 2α+1 term from the fGn forcing, and in the pure fRn case (α = 0), we can take lim α→0 (−2D 0 (−1−2α)t 2α+1 ) = t so that the leading term n = 0 already gives the correct fRm behaviour: V α,h (t) ≈ t, so that a 0,h = 0 (b 0,h can be determined numerically).

A3 Power series expansions about the origin
For many applications, one is interested in the behaviour of R α,h (t) for scales of months, which is typically less than the relaxation time, i.e. t < 1. It is therefore important to understand the small t behaviour. We again consider the Laplace integral for the 0 < h < 1 case. In this case, we can divide the range of integration in Eq. (A2) into two parts for 0 < x < 1 and x > 1. For the former, we use the expansion in Eq. (A4) and, for the latter, S. Lovejoy: Fractional relaxation noises, motions and the fractional energy balance equation We can now integrate each term separately using where E β (t) = ∞ 1 e −xt x −β dx is the exponential integral. Adding the two integrals and summing over n, we obtain (we have interchanged the order of summations and used D n from Eq. A5 with n > 0). The series for the coefficient F j can now be summed analytically. Although the sum is a special case of the Lipchitz summation and Poisson summation formulae, the easiest method is to use the Sommerfeld-Watson transformation (e.g. Mathews and Walker, 1973) that converts an infinite sum into a contour integral that is then deformed. The Sommerfeld-Watson transformation states that for an analytic function f (z) that goes to zero at least as fast as |z| −1 , where z k is the location of the poles of f (z) and R k is the residue of the corresponding pole. In the above, take There is a single pole at z 1 = −a, and the residue is R 1 = e −iaπ h ; therefore, The second sum needed in F j can be obtained using h = 0 in the above, so that, overall, If j is even, then the term in the square bracket is pure real, hence F j vanishes. Otherwise Note that F 1 > 0 for h+α > 1/2 (with 0 ≤ α < 1/2, 0 ≤ h < 2), whereas for h + α < 1/2 it is quite complicated (see below).

Comments
1. These and the following formulae are for t > 0; in addition, only the even integer-ordered terms are non-zero (the sum over odd j ).
2. Each integer term of the expansion F j is itself obtained as an infinite sum, so that the overall result for R α,h (t) is effectively a doubly infinite sum. This procedure swaps the order of the summation and apparently explains the fact that, while the expansions were derived for the case 0 < h < 1, the final expansion is valid for 0 ≤ α < 1/2 and the full range 0 < h < 2: numerically, it accurately reproduces the oscillations when h > 1.
3. The fGn correlation function is given by the single n = 2 term: It is also proportional to the correlation function of the fGn-forced h = 0; fRn process: R (fGn) h (t) = 4R α=h,0 (t).
5. The F j terms diverge when (j − 2α)/ h is an integer. For example, if α = 0, the overall sum over all j thus diverges for all rational h. For irrational h, the convergence properties are not easy to establish, although due to the functions, these series apparently converge for all t ≥ 0, but the convergence is rather slow. Figure A1 shows some numerical results for α = 0 showing the convergence of the 10th-order fractional 10th-order integer power approximation (n max = j max = 10). Since the leading (fGn) term diverges for small t, when h ≤ 1/2, it is more useful to consider the convergence of the difference with respect to the fGn term, i.e. R (fGn) h (t) − R 0,h,a (t), where the approximation R 0,h,a (t) is from the sum from n = 3 to 10 and odd j ≤ 9. Figure A1 shows the logarithm of the ratio of the approximation with respect to the true value: r = log 10 1− R (to avoid exact rationals, 10 −4 was added to the h values). From the figure we see that the approximation is satisfactory except for small h. In the next section we return to this.
6. For α + h > 1/2, when t = 0, the only nonzero term is from the constant F 1 : R α,h (0) = F 1 . This gives the normalization constant. Comparing with Eq. (27), we therefore have Similarly, when α + h > 3/2, for the quadratic the squared integral of G α,h is finite, and it gives the coefficient of the t 2 term, so that 7. The expression for V α,h (t) can be obtained by integrating twice (Eq. 36).
8. In the special cases h = 1/m, with m a positive integer, F j is independent of j , and the integer-powered series can be summed, yielding a result proportional to cosh t. However, this large t divergence is cancelled out by the fractional term, and the result is finite (this partial cancellation is discussed in the next subsection). The special important case h = 1/2 is dealt with in Appendix B. Figure A1. This shows the logarithm of the relative error in the R (10,10) 0,h (t) approximation (i.e. with 10 fractional terms and 10 integer-ordered terms) with respect to the deviation from the fGn R 0,h (t) r = log 10 1 − R fGn The lines are for h = 2/10, 4/10, . . . , 16/10, 18/10 (excluding the exponential case h = 1), from left to right (note that convergence is only for irrational h, and therefore an extra 10 −4 was added to each h). For the low h values the convergence is particularly slow.

A4 A convenient approximation
The expansion for R α,h is the sum of a fractional-ordered and an integer-ordered series. Partial sums appear to converge (Fig. A1), albeit slowly. For simplicity, we consider the case of primary interest, a pure fRn process (α = 0). Examination of partial sums shows that the integer-ordered and fractionalordered terms tend to cancel, the difficulty being due to the coefficient of the integer-ordered terms j ≈ hn + 2α that comes from the exponential integral and that can be large when j ≈ hn + 2α. This suggests an alternative way of expressing the series: where D n is given by Eq. (A5) and the n sums start at n = 2 since D 1 = 0. C j can be expressed as where is the Hurwitz-Lerch phi function (z, s, a) = ∞ n=0 z n (n + a) −s .

116
S. Lovejoy: Fractional relaxation noises, motions and the fractional energy balance equation We can also expand the exponential integral: For the j max and n max partial sums, we have Now define the (j max , n max ) approximation by This has the effect of adding half the next-highest n term and is more accurate; overall, j max and n max may now be taken to be much smaller than in the previous approximation. For example, putting n max = 2, j max = 1, we get with the partial sum where , D 2 = sin(π h) π , D 3 = − sin(π h)(1 + 2 cos(π h)) π . (A27) To understand the behaviour, Fig. A2 shows the behaviour of the coefficient of the t −1+3h term D 3 2 (1 − 3h), the constant term F 1 and the coefficient of the next integer (linear in t) term .
Up until the end of the fGn region (h = 1/2), the t −1+3h and F 1 terms have opposite signs and tend to cancel. In addition, we see that for t ≈< 1 and h < 1, they dominate over the (omitted) linear term. Figure A3 shows that the R 0,h,2,1 approximation is surprisingly good for h < 1 and is still not so bad for 1 < h < 2. This approximation is thus useful for monthly-resolution macroweather temperature fields that have relaxation times of years or longer and where h is mostly over the range 0 < h < 1/2 but over some tropical ocean regions can increase to as much as h ≈ 1.2 (Del Rio Amador and Lovejoy, 2021a). Figure A3 shows that the (2, 1) approximation is reasonably accurate for t ≈< 1, especially for h < 1. Figure A2. The solid line is the constant term F 1 , the long dashes are the coefficients D 3 2 (1 − 3h) of the fractional power, and the short dashes are the coefficients of the linear term: F 2 = C 2 + D 2 2h−2 + D 3 2(3h−2) . We can see that the contribution of the linear term (used in the R 0,h,2,2 (t) approximation) for h < 1 and t < 1 is fairly small, whereas for 1 < h < 2, it is larger and the R 0,h,2,2 (t) approximation is significantly better than the R 0,h,2,1 (t) approximation (see Fig. A3). Figure A3. This shows the logarithm of the relative error in the (2, 1) approximation with respect to the deviation from the fGn R h (t) (r = log 10 1 − R fGn h (t) − R 0,h,2,1 (t) R fGn h (t) − R 0,h (t) ). For h < 1, t < 0 is of the order ≈ 30 %, whereas for h > 1, it of the order 100 %. The h = 1 (exponential) curve is not shown, although when t < 0, the error is of order 60 %.
Appendix B: The h = 1/2 special case When α = 0, h = 1/2, and the high-frequency fGn limit is an exact "1/f noise" (spectrum ω −1 ); it has both high-and low-frequency divergences. The high-frequency divergence can be tamed by averaging, but not the low-frequency divergence, so that fGn is only defined for h < 1/2. However, for fRn, the low frequencies are convergent over the whole range 0 < h < 2, and for h = 1/2 we find that the correlation function has a logarithmic dependence at both small and large scales. This is associated with particularly slow transitions from high-to low-frequency behaviours. The critical value h = 1/2 corresponds to the HEBE (Lovejoy, 2021a, b), where it was shown that the value h = 1/2 could be derived analytically from the classical Budyko-Sellers energy balance equation. Therefore, R α,1/2 (t) and V α,1/2 (t) characterize the statistics of the temperature response of the classical heat equation response to an fGn-order α forcing.
It is possible to obtain exact analytic expressions for R α,1/2 (t), V α,1/2 (t) and the Haar fluctuations; we develop these in this Appendix; for some early results, see Mainardi and Pironi (1996).
(B3) Figure B1 shows plots R α,1/2 (t) over 8 orders of magnitude in t, indicating the generally very slow convergence to the asymptotic behaviour (shown as straight lines on the right). Figure B1 also shows the singular small t behaviour of the pure fRn case (α = 0). In this limit both L 1 and L 2 are singular -they both yield logarithmic small-scale divergences. Pure fRn is of special interest and yields the somewhat sim- Figure B1. R α,1/2 for α increasing from 0 (pure fRn) to 8/10 in steps of 1/10 (on the right: bottom to top). The α = 0 curve has a logarithmic divergence at small t (far left). Recall from the section that at large t, R 0,1/2 ≈ t −3/2 and that for α > 0, R α,1/2 ≈ t 2α−1 ; for α = 0, 1/5, and 2/5 the theoretical asymptotes of the leading terms are indicated for reference. pler result: We can use these results to obtain small and large t expansions: where γ E is Euler's constant = 0.57. . . (the asymptotic formula can be obtained as a special case of Eq. A8 in Appendix A, but not the logarithmic small-scale divergence).
To obtain the corresponding results for V 0,1/2 , use V 0,1/2 (t) = 2 t 0   v 0 R 0,1/2 (u)du   dv . Figure B2 shows numerical results for α = 0 and h = 1/2. The transition between small and large t behaviour is extremely slow; the nine orders of magnitude depicted in the figure are barely enough. The extreme low (R 1/2 ) 1/2 (dashed) asymptotes on the left to a 0 slope (a square root logarithmic limit, Eq. B8) and to a −3/4 slope on the right. The rms Haar fluctuation (black) changes slope from H = 0 to −1/2 (left to right). Figure B2 also shows the logarithmic derivative of the rms Haar (black) compared to a regression estimate over 2 orders of magnitude in scale (dashed; a factor of 10 smaller and 10 larger than the indicated scale was used; this represents a possibly empirically accessible range). This figure underlines the gradualness of the transition from H = 0 to H = −1/2. If empirical data were available only over a factor of 100 in scale, depending on where this scale was with respect to the relaxation timescale (unity in the plot), the rms Haar fluctuations could have any slope in the range 0 to −1/2, with only small deviations. Figure B2. The logarithmic derivative of the rms Haar fluctuations of U 0,1/2 (solid) in Fig. B1 compared to a regression estimate over 2 orders of magnitude in scale (dashed; factors of 10 smaller and 10 larger than the indicated scale were used). This plot underlines the gradualness of the transition from slopes 0 to −0.5 corresponding to apparent H = 0 to H = −1/2 scaling. Over a range of 100 or so in scale there is approximate scaling but with exponents that depend on the range of scales covered by the data. If data were available only over a factor of 100 in scale, the rms Haar fluctuations could have any slope in the fGn range 0 to −1/2, with only small deviations.
Data availability. No data sets were used in this article.
Competing interests. The author is a member of the editorial board of Nonlinear Processes in Geophysics. The peer-review process was guided by an independent editor, and the author also has no other competing interests to declare.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Special issue statement. This article is part of the special issue "Centennial issue on nonlinear geophysics: accomplishments of the past, challenges of the future". It is not associated with a conference. fer Rypdal. We thank anonymous referees for suggestions, including the fifth referee for encouraging comments on the Fourier approach. This work was unfunded, and there were no conflicts of interest.
Review statement. This paper was edited by Daniel Schertzer and reviewed by five anonymous referees.