<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0"><?xmltex \makeatother\@nolinetrue\makeatletter?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NPG</journal-id><journal-title-group>
    <journal-title>Nonlinear Processes in Geophysics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NPG</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nonlin. Processes Geophys.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7946</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/npg-27-23-2020</article-id><title-group><article-title>Remember the past: a comparison of time-adaptive training schemes for non-homogeneous regression</article-title><alt-title>A comparison of time-adaptive training schemes</alt-title>
      </title-group><?xmltex \runningtitle{A comparison of time-adaptive training schemes}?><?xmltex \runningauthor{M. N. Lang et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Lang</surname><given-names>Moritz N.</given-names></name>
          <email>moritz.lang@uibk.ac.at</email>
        <ext-link>https://orcid.org/0000-0002-2533-9903</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Lerch</surname><given-names>Sebastian</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-3467-4375</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Mayr</surname><given-names>Georg J.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-6661-9453</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Simon</surname><given-names>Thorsten</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-3778-7738</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff4">
          <name><surname>Stauffer</surname><given-names>Reto</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-3798-5507</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Zeileis</surname><given-names>Achim</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-0918-3766</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Statistics, Universität Innsbruck, Innsbruck, Austria</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Atmospheric and Cryospheric Sciences, Universität Innsbruck, Innsbruck, Austria</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Institute for Stochastics, Karlsruher Institut für Technologie, Karlsruhe, Germany</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Digital Science Center, Universität Innsbruck, Innsbruck, Austria</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Moritz N. Lang (moritz.lang@uibk.ac.at)</corresp></author-notes><pub-date><day>5</day><month>February</month><year>2020</year></pub-date>
      
      <volume>27</volume>
      <issue>1</issue>
      <fpage>23</fpage><lpage>34</lpage>
      <history>
        <date date-type="received"><day>27</day><month>September</month><year>2019</year></date>
           <date date-type="rev-request"><day>2</day><month>October</month><year>2019</year></date>
           <date date-type="rev-recd"><day>10</day><month>December</month><year>2019</year></date>
           <date date-type="accepted"><day>6</day><month>January</month><year>2020</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 Moritz N. Lang et al.</copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020.html">This article is available from https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020.html</self-uri><self-uri xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020.pdf">The full text article is available as a PDF file from https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e146">Non-homogeneous regression is a frequently used post-processing method for
increasing the predictive skill of probabilistic ensemble weather
forecasts. To adjust for seasonally varying error characteristics between
ensemble forecasts and corresponding observations, different time-adaptive
training schemes, including the classical sliding training window, have
been developed for non-homogeneous regression. This study compares three
such training approaches with the sliding-window approach for the
application of post-processing near-surface air temperature forecasts
across central Europe. The predictive performance is evaluated conditional
on three different groups of stations located in plains, in mountain
foreland, and within mountainous terrain, as well as on a specific change
in the ensemble forecast system of the European Centre for Medium-Range
Weather Forecasts (ECMWF) used as input for the post-processing.</p>
    <p id="d1e149">The results show that time-adaptive training schemes using data over multiple
years stabilize the temporal evolution of the coefficient estimates, yielding
an increased predictive performance for all station types tested compared to
the classical sliding-window approach based on the most recent days only. While
this may not be surprising under fully stable model conditions, it is shown
that “remembering the past” from multiple years of training data is typically
also superior to the classical sliding-window approach when the ensemble prediction
system is affected by certain model changes. Thus, reducing the variance of the
non-homogeneous regression estimates due to increased training data appears to
be more important than reducing its bias by adapting rapidly to the most
current training data only.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e161">The need for accurate probabilistic weather forecasts is steadily increasing,
because reliable information about the expected uncertainty is crucial for
optimal risk assessment in agriculture and industry or for personal planning
of outdoor activities. Therefore, most forecast centers nowadays issue
probabilistic forecasts based on ensemble prediction systems (EPSs). To
quantify the uncertainty of a specific forecast, an EPS provides a set of
numerical weather predictions using slightly perturbed initial conditions and
different model parameterizations <xref ref-type="bibr" rid="bib1.bibx21" id="paren.1"/>. However, due to various
constraints and required simplifications in the EPS, these forecasts often show
systematic biases and capture only parts of the expected uncertainty,
especially when EPS forecasts are directly compared to point measurements
<xref ref-type="bibr" rid="bib1.bibx5" id="paren.2"/>. In order to increase the predictive skill of
the forecasts for specific locations, statistical post-processing is often
applied to correct for these systematic errors in the forecasts' expectation
and uncertainty.</p>
      <p id="d1e170">One of the most frequently used parametric post-processing methods is “ensemble
model output statistics” (EMOS) introduced by <xref ref-type="bibr" rid="bib1.bibx7" id="text.3"/>. To
emphasize that not only the errors in the mean but also the errors in the
uncertainty are corrected, the method is often referred to as “non-homogeneous
regression” (NR). In the statistical literature, this type of model is also
known as distributional regression <xref ref-type="bibr" rid="bib1.bibx15" id="paren.4"/> since all parameters
of a specific response distribution are optimized simultaneously conditional on
respective sets of covariates.</p>
      <?pagebreak page24?><p id="d1e179">As the error characteristics between the covariates, typically provided by the
EPS, and the observations often show seasonal dependencies and might change
inter-annually over time, different time-adaptive training schemes have been
developed for NR models. <xref ref-type="bibr" rid="bib1.bibx7" id="text.5"/> proposed the so-called
“sliding training window” approach where the training data set consists of EPS
forecasts and observations of the most recent 30–60 d only. As soon as new
data become available, the training data set and the statistical model are
updated so that the estimated coefficients automatically evolve over time and
adjust to changing error characteristics. This makes it very handy for
operational use; however, little training data can sometimes yield unrealistic
jumps in the estimated coefficients over time, especially if events which show
a significantly different error characteristic enter the training data set.
Therefore, to stabilize the temporal variability of the coefficient estimates,
several approaches have been proposed in the literature. <xref ref-type="bibr" rid="bib1.bibx26" id="text.6"/>
regularizes the estimation by only allowing the optimizer to slightly adjust
the coefficient from day to day. In an alternative approach,
<xref ref-type="bibr" rid="bib1.bibx19" id="text.7"/> extend the training data by using not only the days
prior to estimation, but also the days centered around the same calendar day
over all previous years available. This idea of using a rolling centered
training data set over multiple years is similar to the concept of using annual
cyclic smooth functions to capture seasonality as employed by
<xref ref-type="bibr" rid="bib1.bibx16" id="text.8"/>. These smooth functions are also known as regression
splines <xref ref-type="bibr" rid="bib1.bibx32" id="paren.9"/>, where the estimate of each point in the function
only depends on data in its closer neighborhood; this allows for a smooth and
stable evolution of the coefficients over the year.</p>
      <p id="d1e197">Alternative time-adaptive models are based on historical analogs or
non-parametric approaches. For approaches employing analogs
<xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx2" id="paren.10"/>, training sets are selected to consist
of past forecast cases with atmospheric conditions similar to those on the day
of interest. Such methods may lead to models that are able to account for the
flow dependency of EPS errors <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx25" id="paren.11"/>.
However, the definition and computation of similarity measures are far from
straightforward, and substantial methodological developments may be required to
obtain suitably extensive training data sets for stable model estimation
<xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx17" id="paren.12"/>. For non-parametric approaches
<xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx12" id="paren.13"/> or semi-parametric approaches
<xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx27" id="paren.14"/>, time-adaptive choices of the
training data are typically abandoned as well, as interactions between the day
of the year and other covariates can capture the potential time adaptiveness.
Therefore, analog-based and non-parametric approaches will not be pursued
further in the context of this work.</p>
      <p id="d1e216">In addition to the training scheme employed, an important data-specific aspect
which has to be considered in post-processing is that the EPS may change over
time <xref ref-type="bibr" rid="bib1.bibx8" id="paren.15"/>. This also motivates the recent study of
<xref ref-type="bibr" rid="bib1.bibx3" id="text.16"/>, which introduces the promising concept of a
post-processing method specifically dealing with model changes in a simplified
physical setup. However, as stated by the authors, more research would be
required to transfer their findings to real case scenarios. When using data of
an operational EPS, changes in the underlying numerical model, e.g., an
increased horizontal resolution, can typically lead to sudden transitions in
the predictive performance of the EPS and hence affect the error
characteristics of the data. If the training data set used to estimate the
statistical post-processing model contains data of a previous EPS version which
significantly differs from the current one, it can result in a loss of the
predictive performance.</p>
      <p id="d1e225">This paper presents a comparison of four widely used different time-adaptive
training schemes proposed in the literature that employ alternative strategies
to account for varying error characteristics in the data. To show a wide
spectrum of possible approaches in a unified setup – rather than finding the
universally best method – we consider typical basic applications of these
training schemes and refrain from more elaborate tuning or combinations. A
case study is shown for post-processed <inline-formula><mml:math id="M1" display="inline"><mml:mn mathvariant="normal">2</mml:mn></mml:math></inline-formula> m temperature forecasts for three
different groups of stations across central Europe at the midlatitudes, namely,
stations in the plain, in the foreland, and within mountainous terrain
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>). The study highlights the advantages and drawbacks
of the different approaches in different topographical environments and
investigates the impact of a change in the horizontal resolution of the EPS,
which is expected to have a particularly pronounced effect on the predictive
performance.</p>
      <p id="d1e237">The structure of the paper is as follows: Sect. <xref ref-type="sec" rid="Ch1.S2"/> explains
the different methods and the comparison setup including the underlying data.
In Sect. <xref ref-type="sec" rid="Ch1.S3"/>, the different time-adaptive training schemes are
compared in terms of their coefficient paths and their predictive performance.
Finally, a summary and conclusion are given in Sect. <xref ref-type="sec" rid="Ch1.S4"/>.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methodology and comparison setup</title>
      <p id="d1e254">The different training schemes for NR models proposed in the literature
try to adapt to various kinds of error sources that can occur in post-processing,
both in space and time. In order to provide a unifying view and to fix
jargon, we first discuss these different error sources and then introduce
the training schemes considered along with the comparison setup employed.</p><?xmltex \hack{\newpage}?>
<?pagebreak page25?><sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Sources of errors in post-processing</title>
      <p id="d1e265">NR models aim to adjust for errors and biases in EPS forecasts but, of course,
the NR models can be affected by errors and misspecifications themselves.
Therefore, we try to carefully distinguish between the two different models involved
with their associated errors, i.e., the numerical weather prediction model
underlying the EPS vs. the statistical NR model employed for post-processing.</p>
      <p id="d1e268">The skill of the EPS can be quantified in EPS forecast biases and variances, which
(i) typically vary for different locations conditional on the surrounding terrain,
(ii) often show cyclic seasonal patterns, and
(iii) can experience non-seasonal temporal changes, e.g., due to changes in the EPS itself.</p>
      <p id="d1e271">In addition to the error sources in the employed EPS, the performance of the
statistical post-processing itself will typically also
(iv) differ at different measurement sites,
(v) strongly depend on the amount of training data used, and
(vi) whether it is affected by effects that are not accounted for in the NR specification.</p>
      <p id="d1e274">Clearly, larger training samples (v) will lead to more reliable predictions when
the NR specification (vi) – in terms of response distribution, covariates
and corresponding effects, link functions, estimation method, etc. – appropriately captures the
error characteristics in the relationship between EPS forecasts and actual
observations. However, when these error characteristics differ in space
(i and iv) and/or in time (ii and iii), it is not obvious what the best strategy
for training the NR is. Extending the training data (v) in space or time will
reduce the variance of the NR estimation but might also introduce bias if the
NR specification (vi) is not adapted. Thus, this is a classical bias-variance
trade-off problem, and we investigate which strategies for dealing with this are
most useful in a typical temperature forecasting situation.</p>
      <p id="d1e278">To fix jargon, we employ the terms “model” and “bias” without further qualifiers
when referring to the NR model in post-processing, whereas when referring to the
numerical weather prediction model we employ “EPS model“ and “EPS bias”.
Moreover, we refer to a statistical model whose estimates have small bias and
variance as stable.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Non-homogeneous regression with time-adaptive training schemes</title>
      <p id="d1e289">Non-homogeneous regression as originally introduced by
<xref ref-type="bibr" rid="bib1.bibx7" id="text.17"/> is a special case of distributional regression,
where a response variable <inline-formula><mml:math id="M2" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> is assumed to follow a specific probability
distribution <inline-formula><mml:math id="M3" display="inline"><mml:mi mathvariant="script">D</mml:mi></mml:math></inline-formula> with distribution parameters <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi></mml:mrow></mml:math></inline-formula>:
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M5" display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>∼</mml:mo><mml:mi mathvariant="script">D</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>K</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="script">D</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mi>K</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>K</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where each parameter of the distribution is linked to an additive predictor
<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> via a link function <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to ensure its appropriate
co-domain. In the case of post-processing air temperatures, the normal distribution
is typically employed <xref ref-type="bibr" rid="bib1.bibx5" id="paren.18"/>, and Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) can
be rewritten as
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M8" display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          In the classical NR <xref ref-type="bibr" rid="bib1.bibx7" id="paren.19"/>, the two distribution parameters
location <inline-formula><mml:math id="M9" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and scale <inline-formula><mml:math id="M10" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> are expressed by the ensemble mean <inline-formula><mml:math id="M11" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> and
ensemble variance or standard deviation <inline-formula><mml:math id="M12" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>, respectively:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M13" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi mathvariant="italic">μ</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>⋅</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi mathvariant="italic">σ</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>⋅</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            with <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> being the corresponding intercept and
slope coefficients. Here, we use the logarithm link to ensure positivity of the
scale parameter <inline-formula><mml:math id="M16" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>; however, a quadratic link with additional parameter
constraints for the coefficients as used by <xref ref-type="bibr" rid="bib1.bibx7" id="text.20"/> would
also be feasible. In this study, we regard the statistical model specifications
according to Eqs. (<xref ref-type="disp-formula" rid="Ch1.E2"/>)–(<xref ref-type="disp-formula" rid="Ch1.E4"/>), but all concepts of
time-adaptive training schemes could easily be transferred to other response
distributions <inline-formula><mml:math id="M17" display="inline"><mml:mi mathvariant="script">D</mml:mi></mml:math></inline-formula>, to alternative link functions <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi>h</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>,
or to more complex additive predictors <inline-formula><mml:math id="M19" display="inline"><mml:mi mathvariant="italic">η</mml:mi></mml:math></inline-formula> with additional covariates.</p>
      <p id="d1e649">The regression coefficients <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> are estimated
by minimizing a loss function over a training data set containing historical
pairs of observations and EPS forecasts. In this study, we employ maximum
likelihood estimation, which performs very similarly to minimizing the continuous
ranked probability score (CRPS, <xref ref-type="bibr" rid="bib1.bibx6" id="altparen.21"/>) as used by
<xref ref-type="bibr" rid="bib1.bibx7" id="text.22"/> when the response distribution is well specified
<xref ref-type="bibr" rid="bib1.bibx4" id="paren.23"/>. For a single observation <inline-formula><mml:math id="M22" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, the
log-likelihood <inline-formula><mml:math id="M23" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> of the normal distribution is given by
            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M24" display="block"><mml:mrow><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>|</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>log⁡</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>y</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the probability density function of the normal
distribution. The coefficients <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula>,
specified in Eqs. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) and (<xref ref-type="disp-formula" rid="Ch1.E4"/>), are derived by
minimizing the sum of negative log-likelihood contributions <inline-formula><mml:math id="M28" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> over the
training data. The larger the training data, the more stable the estimation
in case the statistical model is well specified; however, if the covariate's
skill varies either seasonally or non-seasonally over time, this leads to the
bias-variance trade-off between preferable large training data sets for stable
estimation and the benefit of shorter training periods which allow one to adjust
more rapidly to changes in the data or, to be precise, in the error
characteristics of the data (see Sect. <xref ref-type="sec" rid="Ch1.S2.SS1"/>). In the
following, four approaches are discussed on how to gain informative time-adaptive
training data sets while ensuring a stable estimation.</p>
<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>Sliding-window</title>
      <p id="d1e808">The <italic>sliding-window</italic> approach originally introduced by
<xref ref-type="bibr" rid="bib1.bibx7" id="text.24"/> uses the most recent days prior to<?pagebreak page26?> the day of
interest as training data for estimation. For post-processing <inline-formula><mml:math id="M29" display="inline"><mml:mn mathvariant="normal">2</mml:mn></mml:math></inline-formula> m
temperature forecasts, <xref ref-type="bibr" rid="bib1.bibx7" id="text.25"/> found the best predictive
performance for training periods between 30 and 45 d with substantial gains
in increasing the training period beyond 30 d and slow but steady
performance losses for training lengths beyond 45 d. According to
<xref ref-type="bibr" rid="bib1.bibx7" id="text.26"/>, the latter is presumably a result of seasonally
varying EPS forecast biases.</p>
      <p id="d1e830">In this study, we use a period of 40 d for the <italic>sliding-window</italic>
approach, which is a frequently used compromise (e.g.,
<xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx7 bib1.bibx31" id="altparen.27"/>). However, as
discussed in <xref ref-type="bibr" rid="bib1.bibx7" id="text.28"/>, different training periods might
perform better for distinct weather variables, locations, forecast steps, or
model specifications. Common choices in the literature include training
lengths between 15 and 100 d, for example, depending on whether the
estimation of regression coefficients is performed station-specifically or jointly
for multiple locations at once.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>Regularized sliding-window</title>
      <p id="d1e850">A regularized adaption of the classical <italic>sliding-window</italic> approach was
introduced by <xref ref-type="bibr" rid="bib1.bibx26" id="text.29"/> in order to stabilize the estimation based
on early stopping in statistical learning. The motivation is that
gradient-based optimizers adjust the starting values by iteratively taking
steps in the direction of the steepest descent of a distinct loss function
until some convergence condition is fulfilled. These steps are largest in the
first iteration and get smaller towards the optimum. Thus, the most
important adjustments are made during the first steps, while further
adjustments often improve the fit to unimportant or even random features in the
data, which can lead to wiggly coefficient paths over time and ultimately to an
overfitting <xref ref-type="bibr" rid="bib1.bibx26" id="paren.30"/>.</p>
      <p id="d1e862">Therefore, <xref ref-type="bibr" rid="bib1.bibx26" id="text.31"/> proposes to use the coefficients of the
previous day as starting values and to stop the optimizer after a single
iteration to stabilize the evolution of the coefficient estimates. A drawback
of his approach is that it implies that the estimation never converges and, in
the case of poor starting values or strong truly observed temporal changes in the
data, the obtained coefficients might be incorrect <xref ref-type="bibr" rid="bib1.bibx26" id="paren.32"/>. For
post-processing precipitation amounts employing a left-censored generalized
extreme value distribution, <xref ref-type="bibr" rid="bib1.bibx26" id="text.33"/> obtained better results with
regularized coefficients than without regularization.</p>
      <p id="d1e874">For the <italic>regularized sliding-window</italic> approach used in this study, we
employ the quasi-Newton Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm as in
<xref ref-type="bibr" rid="bib1.bibx26" id="text.34"/> and stop the optimizer after one single iteration. For
the first time, we let the BFGS algorithm perform 10 iterations and use
<inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> as starting values in the location
parameter <inline-formula><mml:math id="M31" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> as starting
values in the scale parameter <inline-formula><mml:math id="M33" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>. According to <xref ref-type="bibr" rid="bib1.bibx26" id="text.35"/> a
single iteration might not always provide the optimum degree of regularization;
however, for the presented comparison study a single iteration yields a
regularized setup which is on the opposite side of the possible model spectrum
compared to the classical <italic>sliding-window</italic> approach
which runs until convergence.
In comparison to <xref ref-type="bibr" rid="bib1.bibx26" id="text.36"/>, we
perform maximum likelihood estimation instead of CRPS minimization.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS3">
  <label>2.2.3</label><title>Sliding-window plus</title>
      <p id="d1e1002">As already pointed out by <xref ref-type="bibr" rid="bib1.bibx7" id="text.37"/>, training data from
previous years could additionally be included in the <italic>sliding-window</italic>
approach to address seasonal effects. This should reduce the variance in the
estimation of the regression coefficients, which stabilizes the evolution of
the coefficients similarly to the <italic>regularized sliding-window</italic> approach.</p>
      <p id="d1e1014">This idea has recently been pursued by
<xref ref-type="bibr" rid="bib1.bibx30" id="text.38"/> for the construction of climatological reference forecasts
and by
<xref ref-type="bibr" rid="bib1.bibx19" id="text.39"/> for a
post-processing approach based on D-vine copulas in which many more
coefficients than in classical NR need to be estimated,
making a more extensive training data set necessary.
Their so-called “refined training data
set” consists of the most 45 recent days prior to the day of interest plus
91 d centered around the same calendar day over all previous years
available. Including multiple years yields more stable estimates, while, on the
other hand, there is the trade-off of losing the ability to quickly adjust to
non-seasonal temporal changes in the EPS forecast biases.
The approach of <xref ref-type="bibr" rid="bib1.bibx19" id="text.40"/> can be seen as time-adaptive version of the seasonal training proposed by <xref ref-type="bibr" rid="bib1.bibx11" id="text.41"/>, who consider training data sets comprised of days from all previous years within the same season (winter/summer).</p>
      <p id="d1e1029">In this study, to be comparable to the <italic>sliding-window</italic> approach, we use
the most recent 40 d prior to estimation and a respective 81 d interval
centered around the day of interest over the previous years available in the
training data.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS4">
  <label>2.2.4</label><?xmltex \opttitle{\text{Smooth model}}?><title>
            Smooth model
          </title>
      <p id="d1e1044">If we reformulate the <italic>sliding-window plus</italic> approach, it is very similar
to fitting an annual cyclic smooth function where the points of the function
only depend on data points in the closer neighborhood, specified by the sliding-window length.</p>
      <p id="d1e1050">Cyclic smooth functions belong to the broader model class of generalized
additive models <xref ref-type="bibr" rid="bib1.bibx10" id="paren.42"><named-content content-type="pre">GAMs,</named-content></xref>, which allow one to
include potentially nonlinear effects in the linear predictors <inline-formula><mml:math id="M34" display="inline"><mml:mi mathvariant="italic">η</mml:mi></mml:math></inline-formula>. Smooth
functions are also referred to as regression splines and are directly linked to
the model parameters as additive terms in <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="italic">η</mml:mi></mml:math></inline-formula>. Introductory material for
cyclic smooth functions conditional on the day of the year can be found in
<xref ref-type="bibr" rid="bib1.bibx16" id="text.43"/>, and a comprehensive summary of GAMs is given in
<xref ref-type="bibr" rid="bib1.bibx32" id="text.44"/>.</p>
      <p id="d1e1078">To account for seasonal variations we only need to fit one single model, here
called the <italic>smooth model</italic>, over a training<?pagebreak page27?> data set with several years of
data. The effects included allow the coefficients to smoothly evolve over the
year, which leads to the following adaptations in Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) and
(<xref ref-type="disp-formula" rid="Ch1.E4"/>) for the location <inline-formula><mml:math id="M36" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and scale <inline-formula><mml:math id="M37" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, respectively:

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M38" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi mathvariant="italic">μ</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi>m</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi mathvariant="italic">σ</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:munder class="underbrace"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo></mml:mrow><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mstyle scriptlevel="+1"><mml:mtable class="substack"><mml:mtr><mml:mtd><mml:mtext>seasonally varying</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>intercept</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:mstyle></mml:munder><mml:mo>+</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:munder><mml:munder class="underbrace"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mstyle scriptlevel="+1"><mml:mtable class="substack"><mml:mtr><mml:mtd><mml:mtext>seasonally varying</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>slope</mml:mtext></mml:mtd></mml:mtr></mml:mtable></mml:mstyle></mml:munder><mml:mo>⋅</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>s</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

              with <inline-formula><mml:math id="M39" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M40" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> being the ensemble mean and ensemble standard deviation,
respectively; <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>•</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> are regression coefficients,
and <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mo>•</mml:mo></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mo>•</mml:mo></mml:msub><mml:mo>(</mml:mo><mml:mtext>doy</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> employ cyclic
regression splines conditional on the day of the year <xref ref-type="bibr" rid="bib1.bibx32" id="paren.45"/>. The
regression coefficients <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, as well as <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>,
are unconditional on the day of the year and can be interpreted as global
intercept or slope coefficients, respectively.</p>

<?xmltex \floatpos{t!}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e1407">Overview of time-adaptive training schemes, distinguished by
model specification/estimation and training data selection corresponding
to errors sources (vi) and (v), respectively. The basic model specification
refers to Eqs. (<xref ref-type="disp-formula" rid="Ch1.E3"/>)–(<xref ref-type="disp-formula" rid="Ch1.E4"/>), in contrast to the
extended Eqs. (<xref ref-type="disp-formula" rid="Ch1.E6"/>)–(<xref ref-type="disp-formula" rid="Ch1.E7"/>).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" namest="col2" nameend="col3" align="center" colsep="1">Model </oasis:entry>
         <oasis:entry rowsep="1" namest="col4" nameend="col5" align="center">Data </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Name</oasis:entry>
         <oasis:entry colname="col2">Specification</oasis:entry>
         <oasis:entry colname="col3">Estimation</oasis:entry>
         <oasis:entry colname="col4">Years</oasis:entry>
         <oasis:entry colname="col5">Seasons</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Sliding-window</italic></oasis:entry>
         <oasis:entry colname="col2">Basic</oasis:entry>
         <oasis:entry colname="col3">Maximum likelihood</oasis:entry>
         <oasis:entry colname="col4">Current</oasis:entry>
         <oasis:entry colname="col5">Current</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Regularized sliding-window</italic></oasis:entry>
         <oasis:entry colname="col2">Basic</oasis:entry>
         <oasis:entry colname="col3">Early stopping</oasis:entry>
         <oasis:entry colname="col4">Current</oasis:entry>
         <oasis:entry colname="col5">Current</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Sliding-window plus</italic></oasis:entry>
         <oasis:entry colname="col2">Basic</oasis:entry>
         <oasis:entry colname="col3">Maximum likelihood</oasis:entry>
         <oasis:entry colname="col4">Multiple</oasis:entry>
         <oasis:entry colname="col5">Current</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Smooth model</italic></oasis:entry>
         <oasis:entry colname="col2">Extended</oasis:entry>
         <oasis:entry colname="col3">Penalized</oasis:entry>
         <oasis:entry colname="col4">Multiple</oasis:entry>
         <oasis:entry colname="col5">All</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Comparison setup</title>
<sec id="Ch1.S2.SS3.SSS1">
  <label>2.3.1</label><title>NR training schemes</title>
      <p id="d1e1562">The NR training schemes presented in Sect. <xref ref-type="sec" rid="Ch1.S2.SS2"/> deal with
the potential temporal error sources from Sect. <xref ref-type="sec" rid="Ch1.S2.SS1"/> in different ways
(see Table <xref ref-type="table" rid="Ch1.T1"/> for an overview). The classic <italic>sliding-window</italic> employs
the basic NR model equations from Eqs. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) to (<xref ref-type="disp-formula" rid="Ch1.E4"/>) and avoids
potential biases in the NR model estimation by using only very recent data from
the same year and season. Compared to this, the <italic>regularized sliding-window</italic> and <italic>sliding-window plus</italic> approaches both try to stabilize the coefficient estimates by reducing the variance – either
through regularized estimation (vi) or by considering multiple years (v). The <italic>smooth model</italic>
differs from all of these by modifying both the model (vi) and data (v) specification,
using the extended model specification from Eqs. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) to (<xref ref-type="disp-formula" rid="Ch1.E7"/>)
fitted by penalized estimation to a large data set comprising several years and all seasons.</p>
      <p id="d1e1592">Potential spatial differences (i) and (iv) are handled for all training schemes in the same
way: the NR models are estimated separately for each station and subsequently evaluated
in groups of terrain types (plain, foreland, alpine). The underlying EPS data – described
subsequently – are the same for all NR training schemes and are thus affected by the same
seasonal (ii) and non-seasonal changes (iii).</p>
</sec>
<sec id="Ch1.S2.SS3.SSS2">
  <label>2.3.2</label><title>Data sets</title>
      <p id="d1e1603">For validation of the training schemes, we consider 2 m temperature ensemble
forecasts and corresponding observations at <inline-formula><mml:math id="M49" display="inline"><mml:mn mathvariant="normal">15</mml:mn></mml:math></inline-formula> measurement sites located across
Austria, Germany, and Switzerland. The sites are chosen to investigate the impact of
potential error sources in space (i) and (iv), e.g., through varying discrepancies
between the real and EPS topography. The data comprises three groups of five
stations located either in plains, in mountain foreland, or within mountainous terrain
(see Fig. <xref ref-type="fig" rid="Ch1.F1"/>). The estimated statistical models for
stations Hamburg and Innsbruck, highlighted by symbols with white borders, are
discussed in more detail in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e1619">Overview of the study area with selected stations classified as
plain, foreland, and alpine station sites. The two highlighted and labeled
stations, Hamburg and Innsbruck, are discussed in detail in
Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>. Elevation data are obtained from the SRTM-30 m
digital elevation model <xref ref-type="bibr" rid="bib1.bibx20" id="paren.46"/>.</p></caption>
            <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020-f01.png"/>

          </fig>

      <p id="d1e1633">As covariates for Eqs. (<xref ref-type="disp-formula" rid="Ch1.E3"/>)–(<xref ref-type="disp-formula" rid="Ch1.E7"/>), we employ the ensemble mean <inline-formula><mml:math id="M50" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> and the
ensemble standard deviation <inline-formula><mml:math id="M51" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> of bilinearly interpolated 2 m temperature
forecasts issued by the global 50-member EPS of the European Centre for
Medium-Range Weather Forecasts (ECMWF). We assess forecast steps from <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">12</mml:mn></mml:mrow></mml:math></inline-formula>
to <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula> h ahead at a <inline-formula><mml:math id="M54" display="inline"><mml:mn mathvariant="normal">12</mml:mn></mml:math></inline-formula>-hourly temporal resolution for the EPS run
initialized at 00:00 UTC and use data from 8 March 2010 to 7 March 2019.</p>
      <p id="d1e1683">This period has been selected in order to investigate the impact of
non-seasonal long-term changes in the EPS model (iii) that is not reflected in
the NR model specifications; i.e., the horizontal resolution of the ECMWF
EPS changed from the previous version (cycle 36r1; 26 January 2010) to the
new version on 8 March 2016 (cycle 41r2). This specific model change was
chosen among various others as it modifies the height of the terrain and, thus,
likely introduces an EPS bias for temperature forecasts directly affecting the
coefficient estimates; other changes such as modified model parameterizations
or improvements in the analysis scheme are expected to have a minor impact on
the post-processing of 2 m temperatures.
It is of specific interest how the <italic>sliding-window plus</italic> and the <italic>smooth model</italic> are affected if
the training period comprises data from both the “old EPS version” before the
change in the horizontal resolution as well as the “new EPS version”. Thus, we construct three data sets
with different validation periods that are either (a) not affected by this EPS
model change at all, (b) start immediately after the model change, or (c) have
some time lag after change.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e1694"><bold>(a)</bold> Illustrative example of how the training data sets are composed
for the four different time-adaptive training schemes. <bold>(b)</bold> Schematic overview
of the training and validation data sets employed in this study with regard to the
change in the horizontal resolution of the ECMWF EPS on 8 March 2016
(cycle 41r2). For training, up to 4 years of data are used in all data sets;
for validation, 2 years of data are used for data sets A and B, and 1 year
for data set C.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020-f02.png"/>

          </fig>

      <?pagebreak page28?><p id="d1e1708">To understand how this affects the different training schemes, we first illustrate
in Fig. <xref ref-type="fig" rid="Ch1.F2"/>a how training and validation periods are selected for
each scheme. For the three sliding-window approaches, the NR models are re-estimated
every day as the validation date rolls through the validation period (hatched area).
In contrast, the <italic>smooth model</italic> is estimated only once for the entire validation period
based on a fixed training data period of 4 years prior to the validation
period. For a fair comparison, the training data for the
<italic>sliding-window plus</italic> model are also restricted to 4 years prior
to each validation date.</p>
      <p id="d1e1719">Now Fig. <xref ref-type="fig" rid="Ch1.F2"/>b illustrates how the three data sets A, B, and C are
selected in relation to the EPS change on 8 March 2016.</p>
      <p id="d1e1724"><list list-type="bullet">
              <list-item>

      <p id="d1e1729">Data set A. All models are trained and evaluated without being affected by
the EPS change.</p>
              </list-item>
              <list-item>

      <p id="d1e1735">Data set B. All models start with a training period entirely before the
EPS change but a validation period entirely after the change. However, for the
<italic>sliding-window</italic> and <italic>regularized sliding-window</italic> approaches, the training period
quickly rolls across the change point, and after 40 d they are not affected
by it anymore. For <italic>sliding-window plus</italic> the training data also roll
into the new EPS version but still partially use data from the old EPS version.
Finally, as the <italic>smooth model</italic> is only estimated once, it cannot adapt at
all to the new EPS version.</p>
              </list-item>
              <list-item>

      <p id="d1e1753">Data set C. Effects from A and B are mixed so that the <italic>smooth model</italic> and the
<italic>sliding-window plus</italic> model use data from both the old and new EPS
versions, while the classical <italic>sliding-window</italic> and <italic>regularized sliding-window</italic> models already use only data from the new EPS version.</p>
              </list-item>
            </list></p>
      <p id="d1e1771">The validation period is 2 years for A and B and 1 year for C.  A total number
of 731/730/365 NR models has to be estimated for the three sliding-window
approaches, while only<?pagebreak page29?> 1/1/1 <italic>smooth model</italic> is required for data sets
A/B/C per station and forecast step. The computation time for the various
sliding-window approaches is in the order of seconds, whereas the estimation of
the <italic>smooth model</italic>, including full Markov chain Monte Carlo (MCMC) sampling, is in the order of minutes on a standard computer.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
      <p id="d1e1790">This section assesses the performance of the different time-adaptive training
schemes. First, the temporal evolutions of the estimated coefficients are shown
for two stations representative of one measurement site in the plains and one
in mountainous terrain. Afterwards, the predictive performance of the training
schemes is evaluated in terms of the CRPS conditional on the three data sets
with and without the change in the horizontal resolution of the EPS (Fig. <xref ref-type="fig" rid="Ch1.F2"/>) and grouped for stations
classified as topographically plain, mountain foreland, and alpine sites
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e1799">Temporal evolution of regression coefficients for the validation
period in data set A for Innsbruck at forecast step <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">36</mml:mn></mml:mrow></mml:math></inline-formula> h (valid at
12:00 UTC). The coefficient paths are shown for the coefficients <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(a–c)</bold> and <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(d–f)</bold> in the location parameter <inline-formula><mml:math id="M58" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and for the
coefficients <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(g–i)</bold> and <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(j–l)</bold> in the scale
parameter <inline-formula><mml:math id="M61" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> based on the <italic>sliding-window</italic>, <italic>regularized sliding-window</italic>, and <italic>sliding-window plus</italic> approaches (dashed, from left
to right) compared to the <italic>smooth model</italic> approach (solid line).  The
coefficient paths are plotted for the consecutive calendar years 2014, 2015,
and 2016 as dashed, dotted, and two-dashed lines, respectively.  The grey
shading represents the <inline-formula><mml:math id="M62" display="inline"><mml:mn mathvariant="normal">95</mml:mn></mml:math></inline-formula> % credible intervals of the coefficients in the
<italic>smooth model</italic> based on MCMC sampling.</p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020-f03.png"/>

      </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e1914">As Fig. <xref ref-type="fig" rid="Ch1.F3"/> but for Hamburg at forecast step
<inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">36</mml:mn></mml:mrow></mml:math></inline-formula> h (valid at 12:00 UTC).</p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020-f04.png"/>

      </fig>

<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Coefficient paths</title>
      <p id="d1e1943">Figure <xref ref-type="fig" rid="Ch1.F3"/> shows the estimated coefficients for Innsbruck at
forecast step <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">36</mml:mn></mml:mrow></mml:math></inline-formula> h conditional on the day of the year. The coefficient
paths are plotted for the different time-adaptive training schemes for 2
years included in the validation period of data set A. The pronounced seasonal
evolution of the coefficients for all training schemes shows that the EPS'
forecast bias and skill varies seasonally, which makes a time-adaptive training
scheme mandatory to capture these characteristics in the post-processing.
During summer, a slope coefficient <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> close to 1 in the
location parameter <inline-formula><mml:math id="M66" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and a high slope coefficient <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in the scale
parameter <inline-formula><mml:math id="M68" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> indicate a better performance of the EPS compared to the cold season.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e1997">CRPS skill scores clustered into groups of stations located in the
plain, in the mountain foreland near the Alps, and within mountainous terrain
and for the out-of-sample validation periods according to the different data
sets: data set A without the change in the horizontal resolution of the EPS,
data set B with the EPS change in
between the training and validation data sets, and data set C with the EPS
change within training data (Fig. <xref ref-type="fig" rid="Ch1.F2"/>).
Compared are the different time-adaptive training
schemes specified in Sect. <xref ref-type="sec" rid="Ch1.S2.SS2"/> with the classical
<italic>sliding-window</italic> approach as a reference; note that “sliding-window” is
abbreviated as SW in the figure. Each box-and-whisker contains aggregated skill
scores over the forecast steps from <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">12</mml:mn></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">72</mml:mn></mml:mrow></mml:math></inline-formula> h at a <inline-formula><mml:math id="M71" display="inline"><mml:mn mathvariant="normal">12</mml:mn></mml:math></inline-formula>-hourly
temporal resolution and over five respective weather stations
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>). Skill scores are in percent and positive values
indicate improvements over the reference.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://npg.copernicus.org/articles/27/23/2020/npg-27-23-2020-f05.png"/>

        </fig>

      <p id="d1e2043">In comparison to the other time-adaptive training schemes, the classical
<italic>sliding-window</italic> approach (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a, d, g, j)
shows very strong outliers and an unstable temporal evolution for all
coefficients with distinct differences during the 2 subsequent validation
years; this is more pronounced for the scale parameter <inline-formula><mml:math id="M72" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> where the
estimates seem to be more volatile than for the location parameter <inline-formula><mml:math id="M73" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>. All
strategies extending the classical <italic>sliding-window</italic> approach smooth the
temporal evolution of the coefficients to a certain extent while maintaining
the overall seasonal cyclic pattern. For the <italic>regularized sliding-window</italic> approach (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b, e, h, k), the
stabilization strongly differs for the individual coefficients, and some of the
estimated coefficients seem to need rather long to adapt during the transition
periods; the latter could indicate that a single iteration step might not be
sufficient in this study. The coefficient paths for the <italic>sliding-window plus</italic> approach (Fig. <xref ref-type="fig" rid="Ch1.F3"/>c, f, i, l) and for the
<italic>smooth model</italic> (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a–l; solid line) look very
similar with minor distortions during the cold season. Due to the definition of
the <italic>smooth model</italic>, its coefficient paths show the most stable evolution
but with the lowest ability to react to abrupt changes in the error
characteristics.</p>
      <p id="d1e2089">For Hamburg (Fig. <xref ref-type="fig" rid="Ch1.F4"/>) by contrast to Innsbruck, the
information content of the mean EPS temperature forecast is quite high
throughout the year. This yields a lower bias correction and an almost
one-to-one mapping of the ensemble mean to the location parameter <inline-formula><mml:math id="M74" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>
indicated by a coefficient <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> close to 1. Despite the different
post-processing characteristics, the temporal evolution of the coefficient
paths is similar to the one for Innsbruck, which confirms our previous findings:
for the extended sliding-window approaches the coefficients have indeed very
little seasonal variability, while for the classical <italic>sliding-window</italic>
approach the coefficients show unrealistically strong fluctuations over time
without a clear seasonal pattern (Fig. <xref ref-type="fig" rid="Ch1.F4"/>a, d, g, j). As
for Innsbruck, the <italic>regularized sliding-window</italic> approach has a rather
unrealistic stepwise evolution for some coefficients
(Fig. <xref ref-type="fig" rid="Ch1.F4"/>b, e, h, k). The coefficient paths for the
<italic>sliding-window plus</italic> approach (Fig. <xref ref-type="fig" rid="Ch1.F4"/>c, f, i, l)
and the <italic>smooth model</italic> (Fig. <xref ref-type="fig" rid="Ch1.F4"/>; solid line) look
comparable. These results support the bias-variance trade-off where regularizing
or smoothing stabilizes the coefficient paths while losing the ability to
rapidly react to temporal changes in the data.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Predictive performance</title>
      <p id="d1e2141">After the illustrative evaluation of the coefficients' temporal evolution for
the different time-adaptive training schemes, Fig. <xref ref-type="fig" rid="Ch1.F5"/> shows
aggregated CRPS skill scores for groups of five respective stations classified
as topographically plain, mountain foreland, and alpine sites
(Fig. <xref ref-type="fig" rid="Ch1.F1"/>) regarding data sets A, B, and C
(Fig. <xref ref-type="fig" rid="Ch1.F2"/>). In all panels the <italic>regularized sliding-window</italic>
approach, the <italic>sliding-window plus</italic> approach, and the <italic>smooth model</italic> are compared to the classical <italic>sliding-window</italic> approach as a
reference.</p>
      <p id="d1e2163"><list list-type="bullet">
            <list-item>

      <p id="d1e2168">For data set A, the <italic>regularized sliding-window</italic> approach shows only
little improvements for the plain and foreland and an overall performance loss
for alpine stations. By contrast, the <italic>sliding-window plus</italic> and
<italic>smooth model</italic> approaches show distinct improvements over the
classical <italic>sliding-window</italic> approach, with the largest values for alpine
sites.</p>
            </list-item>
            <list-item>

      <p id="d1e2186">For data set B at stations in the plains and foreland, the mean predictive
skill behaves similarly to data set A, except that the <italic>smooth model</italic> shows a slightly larger variance. For alpine stations, the
<italic>regularized sliding-window</italic> approach performs slightly worse than in
data set A, while the two approaches using training data over multiple years no
longer outperform the reference.</p>
            </list-item>
            <list-item>

      <?pagebreak page30?><p id="d1e2198">For data set C at stations in the plains and foreland, the predictive skill
is again similar to data set A with slight performance losses. For alpine
stations, the <italic>regularized sliding-window</italic> approach shows even less
skill than in data set B, while the two other approaches again outperform the
<italic>sliding-window</italic> approach and are on a similar level to that in data set A.</p>
            </list-item>
          </list></p>
      <p id="d1e2209">The validation of the different time-adaptive training schemes shows that the
<italic>sliding-window plus</italic> approach and the <italic>smooth model</italic> perform
overall similarly and are clearly superior for all station types compared to the
classical <italic>sliding-window</italic> approach. However, the <italic>smooth model</italic>
shows the highest variance in the predictive performance in the case of a change in
the EPS, especially in mountainous terrain (data sets B and C); this is likely
due to its reduced ability to adapt to temporal changes in the data.
Furthermore, the validation shows that the <italic>regularized sliding-window</italic>
approach seems to have difficulties in mountainous terrain and yields only
minor improvements for plain and foreland sites.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Summary and conclusion</title>
      <p id="d1e2236">Non-homogeneous regression (NR) is a widely used method to statistically
post-process ensemble weather forecasts. In its original version it was used
for temperature forecasts employing a Gaussian response distribution, but over
the last decade various statistical model extensions have been proposed for
other quantities employing different response distributions or to enhance its
predictive performance.  When estimating NR models there is always a trade-off
between large enough training data sets to get stable estimates and still
allowing the statistical model to adjust to temporal changes in the statistical
error characteristics of the data. Therefore, different training schemes with
specific advantages and drawbacks have been developed as presented in this
paper.  To show a wide spectrum of possible approaches in a unified<?pagebreak page31?> setup, we
consider typical basic applications of the training schemes and refrain from
more elaborate tuning or combinations.</p>
      <p id="d1e2239">The classical <italic>sliding-window</italic> approach has the advantage that no
extensive training data set is required, which allows the statistical model to
adjust itself rapidly to changing forecast biases, for example, in the case of
changes in the EPS. On the other hand, statistical models trained on a small
training data set have typically large variance in the estimation of the
regression coefficients, which can yield unstable wiggly coefficient paths.
Additional regularization allows one to stabilize the evolution of the
regression coefficients without losing the simplicity of the classical
<italic>sliding-window</italic> approach. However, inappropriate settings of the
optimizer, e.g., unrealistic starting values or insufficient update steps,
can quickly lead to incorrect coefficients. The alternative
<italic>sliding-window plus</italic> strategy foregoes regularization but stabilizes
the coefficients by using an extended training data set which includes data
from the same season over several years. Compared to the classical approach the
method requires historical data and partially loses its ability to rapidly
adjust to changes in the error characteristics. The last approach presented in
this paper can be seen as a generalization of the <italic>sliding-window plus</italic>
approach. Rather than using a training data set centered around the date of
interest, the <italic>smooth model</italic> makes use of all historical data in
combination with cyclic regression splines, which allows the coefficients to
smoothly evolve over the year.</p>
      <p id="d1e2257">The differences between the methods presented can be seen in the coefficient
paths shown in Figs. <xref ref-type="fig" rid="Ch1.F3"/> and <xref ref-type="fig" rid="Ch1.F4"/>. The
coefficients of the classical <italic>sliding-window</italic> approach show strong
fluctuations and pronounced peaks throughout the year. Regularization allows one to
stabilize the evolution; however, strong step-wise changes in the coefficient
paths still occur. The two methods using data from multiple years perform
comparably similarly and show stable coefficient paths over the year.
Figure <xref ref-type="fig" rid="Ch1.F5"/> confirms that more stable estimates have a positive
impact on the predictive performance. The <italic>sliding-window plus</italic> approach
and the <italic>smooth model</italic> show an overall improvement of about 3 %–5 % (in
median) over the classical <italic>sliding-window</italic> approach, while the
<italic>regularized sliding-window</italic> only partially outperforms the
<italic>sliding-window</italic> training scheme.  Even in the case of the model change
chosen to<?pagebreak page32?> demonstrate the effect of non-seasonal long-term changes on the
coefficient estimates, the training schemes using multiple years of data are
still superior to the ones using the most recent days only, even if they
technically allow adjustment to the EPS change more rapidly.</p>
      <p id="d1e2285">To conclude, all four training schemes shown in this paper have their
advantages in particular applications. If only short periods of training data
are available (<inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> year), the classical <italic>sliding-window</italic> approach may
already provide sufficiently good estimates. However, as soon as one has access
to longer historical data sets, the approaches using data from multiple years
become superior due to a more stable coefficient evolution over time, which
yields an overall improved performance. This even holds in the case of the EPS
change considered in this study, but may be different for other changes or
EPSs. While the <italic>sliding-window plus</italic> approach is a natural extension of the
classical <italic>sliding-window</italic> approach and, therefore, can be estimated by
the same software, the <italic>smooth model</italic> approach can be seen as a
generalization, and only a single model has to be estimated for all seasons
using all available data. The <italic>smooth model</italic> yields, by definition, the
smoothest and most stable coefficient paths but with the lowest ability to
adjust itself to a new error characteristic.</p>
</sec>

      
      </body>
    <back><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d1e2319">All computations are performed in R 3.6.1 <xref ref-type="bibr" rid="bib1.bibx24" id="paren.47"/> <uri>https://www.R-project.org/</uri> (last access: 10 December 2019). The
statistical models using a sliding-window approach are based on R package
<bold>crch</bold> <xref ref-type="bibr" rid="bib1.bibx18" id="paren.48"/> (<ext-link xlink:href="https://doi.org/10.32614/RJ-2016-012" ext-link-type="DOI">10.32614/RJ-2016-012</ext-link>) employing a frequentist maximum
likelihood approach. The statistical models using a time-adaptive training
scheme by fitting cyclic smooth functions are fitted with R package
<bold>bamlss</bold> <xref ref-type="bibr" rid="bib1.bibx29" id="paren.49"/> (<ext-link xlink:href="https://doi.org/10.1080/10618600.2017.1407325" ext-link-type="DOI">10.1080/10618600.2017.1407325</ext-link>). The package provides a flexible toolbox
for distribution regression models in a Bayesian framework; introductory
material can be found at <uri>http://BayesR.R-Forge.R-project.org/</uri> (last access: 10 December 2019). The
computation of the CRPS is based on R package <bold>scoringRules</bold>
<xref ref-type="bibr" rid="bib1.bibx13" id="paren.50"/> (<ext-link xlink:href="https://doi.org/10.18637/jss.v090.i12" ext-link-type="DOI">10.18637/jss.v090.i12</ext-link>).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e2360">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/npg-27-23-2020-supplement" xlink:title="pdf">https://doi.org/10.5194/npg-27-23-2020-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e2369">This study is based on the PhD work of MNL under supervision of GJM and AZ.
The majority of the work for this study was performed by MNL with the support
of RS. All the<?pagebreak page33?> authors worked closely together in discussing the results and
commenting on the manuscript.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e2375">Sebastian Lerch is one of the editors of the special issue on “Advances in post-processing and blending of deterministic and ensemble forecasts”. The remaining authors declare that they have no conflict of interest.</p>
  </notes><notes notes-type="sistatement"><title>Special issue statement</title>

      <p id="d1e2381">This article is part of the special issue “Advances in post-processing and blending of deterministic and ensemble forecasts”. It is not associated with a conference.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e2387">We
thank the Zentralanstalt für Meteorologie und Geodynamik (ZAMG) for providing
access to the data.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e2392">This project was partly funded by the Austrian Research Promotion Agency (FFG, grant no. 858537) and by the Austrian Science Fund (FWF, grant no. P31836). Sebastian Lerch gratefully acknowledges support by the Deutsche Forschungsgemeinschaft (DFG) through SFB/TRR 165 “Waves to Weather”.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e2398">This paper was edited by Maxime Taillardat and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Baran and M{\"{o}}ller(2017)}}?><label>Baran and Möller(2017)</label><?label baran.moller:2017?><mixed-citation>Baran, S. and Möller, A.: Bivariate Ensemble Model Output Statistics
Approach for Joint Forecasting of Wind Speed and Temperature,
Meteorol. Atmos. Phys., 129, 99–112, <ext-link xlink:href="https://doi.org/10.1007/s00703-016-0467-8" ext-link-type="DOI">10.1007/s00703-016-0467-8</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Barnes et al.(2019)Barnes, Brierley, and Chandler</label><?label barnes.etal:2019?><mixed-citation>Barnes, C., Brierley, C. M., and Chandler, R. E.: New approaches to
postprocessing of multi-model ensemble forecasts,
Q. J. Roy. Meteor. Soc., 145, 3479–3498, <ext-link xlink:href="https://doi.org/10.1002/qj.3632" ext-link-type="DOI">10.1002/qj.3632</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Demaeyer and Vannitsem(2019)</label><?label demaeyer.etal:2019?><mixed-citation>Demaeyer, J. and Vannitsem, S.: Correcting for Model Changes in Statistical Post-Processing – An approach based on Response Theory, Nonlin. Processes Geophys. Discuss., <ext-link xlink:href="https://doi.org/10.5194/npg-2019-57" ext-link-type="DOI">10.5194/npg-2019-57</ext-link>, in review, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Gebetsberger et al.(2018)Gebetsberger, Messner, Mayr, and
Zeileis</label><?label gebetsberger.etal:2018?><mixed-citation>Gebetsberger, M., Messner, J. W., Mayr, G. J., and Zeileis, A.: Estimation
Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked
Probability Score versus Maximum Likelihood, Mon. Weather Rev., 146,
4323–4338, <ext-link xlink:href="https://doi.org/10.1175/MWR-D-17-0364.1" ext-link-type="DOI">10.1175/MWR-D-17-0364.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Gneiting and Katzfuss(2014)</label><?label gneiting.katzfuss:2014?><mixed-citation>Gneiting, T. and Katzfuss, M.: Probabilistic Forecasting,
Annu. Rev. Stat. Appl., 1, 125–151,
<ext-link xlink:href="https://doi.org/10.1146/annurev-statistics-062713-085831" ext-link-type="DOI">10.1146/annurev-statistics-062713-085831</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Gneiting and Raftery(2007)</label><?label gneiting.raftery:2007?><mixed-citation>Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules, Prediction, and
Estimation, J. Am. Stat. Assoc., 102, 359–378,
<ext-link xlink:href="https://doi.org/10.1198/016214506000001437" ext-link-type="DOI">10.1198/016214506000001437</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Gneiting et al.(2005)Gneiting, Raftery, Westveld III, and
Goldman</label><?label gneiting.etal:2005?><mixed-citation>Gneiting, T., Raftery, A. E., Westveld III, A. H., and Goldman, T.: Calibrated
Probabilistic Forecasting Using Ensemble Model Output Statistics and Minimum
CRPS Estimation, Mon. Weather Rev., 133, 1098–1118,
<ext-link xlink:href="https://doi.org/10.1175/MWR2904.1" ext-link-type="DOI">10.1175/MWR2904.1</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Hamill(2018)</label><?label hamill:2018?><mixed-citation>Hamill, T. M.: Practical Aspects of Statistical Postprocessing, in: Statistical
Postprocessing of Ensemble Forecasts, edited by: Vannitsem, S., Wilks, D. S.,
and Messner, J. W.,  187–217, Elsevier, <ext-link xlink:href="https://doi.org/10.1016/C2016-0-03244-8" ext-link-type="DOI">10.1016/C2016-0-03244-8</ext-link>,
2018.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Hamill et al.(2008)Hamill, Hagedorn, and Whitaker</label><?label hamill.etal:2008?><mixed-citation>Hamill, T. M., Hagedorn, R., and Whitaker, J. S.: Probabilistic forecast
calibration using ECMWF and GFS ensemble reforecasts. Part II:
Precipitation, Mon. Weather Rev., 136, 2620–2632,
<ext-link xlink:href="https://doi.org/10.1175/2007MWR2411.1" ext-link-type="DOI">10.1175/2007MWR2411.1</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Hastie and Tibshirani(1986)</label><?label hastie.tibshirani:1986?><mixed-citation>
Hastie, T. and Tibshirani, R.: Generalized Additive Models, Stat.
Sci., 1, 297–310,
1986.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Hemri et al.(2016)Hemri, Haiden, and Pappenberger</label><?label hemri.etal:2016?><mixed-citation>Hemri, S., Haiden, T., and Pappenberger, F.: Discrete Postprocessing of Total
Cloud Cover Ensemble Forecasts, Mon. Weather Rev., 144, 2565–2577,
<ext-link xlink:href="https://doi.org/10.1175/mwr-d-15-0426.1" ext-link-type="DOI">10.1175/mwr-d-15-0426.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Henzi et al.(2019)Henzi, Ziegel, and Gneiting</label><?label henzi.etal:2019?><mixed-citation>Henzi, A., Ziegel, J. F., and Gneiting, T.: Isotonic Distributional Regression,
arXiv 1909.03725, arXiv.org E-Print Archive,
available at: <uri>http://arxiv.org/abs/1909.03725</uri>, last access: 10 December 2019.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Jordan et~al.(2019)Jordan, Kr{\"{u}}ger, and Lerch}}?><label>Jordan et al.(2019)Jordan, Krüger, and Lerch</label><?label jordan.etal:2019?><mixed-citation>Jordan, A., Krüger, F., and Lerch, S.: Evaluating Probabilistic Forecasts
with scoringRules, J. Stat. Softw., 90, 1–37,
<ext-link xlink:href="https://doi.org/10.18637/jss.v090.i12" ext-link-type="DOI">10.18637/jss.v090.i12</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Junk et al.(2015)Junk, Monache, and Alessandrini</label><?label junk.etal:2015?><mixed-citation>Junk, C., Monache, L. D., and Alessandrini, S.: Analog-Based Ensemble Model
Output Statistics, Mon. Weather Rev., 143, 2909–2917,
<ext-link xlink:href="https://doi.org/10.1175/mwr-d-15-0095.1" ext-link-type="DOI">10.1175/mwr-d-15-0095.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Klein et al.(2014)Klein, Kneib, Klasen, and Lang</label><?label klein.etal:2014?><mixed-citation>Klein, N., Kneib, T., Klasen, S., and Lang, S.: Bayesian Structured Additive
Distributional Regression for Multivariate Responses,
J. R. Stat. Soc. C-Appl., 64, 569–591,
<ext-link xlink:href="https://doi.org/10.1111/rssc.12090" ext-link-type="DOI">10.1111/rssc.12090</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Lang et al.(2019)Lang, Mayr, Stauffer, and Zeileis</label><?label lang.etal:2019b?><mixed-citation>Lang, M. N., Mayr, G. J., Stauffer, R., and Zeileis, A.: Bivariate Gaussian models for wind vectors in a distributional regression framework, Adv. Stat. Clim. Meteorol. Oceanogr., 5, 115–132, <ext-link xlink:href="https://doi.org/10.5194/ascmo-5-115-2019" ext-link-type="DOI">10.5194/ascmo-5-115-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Lerch and Baran(2017)</label><?label lerch.baran:2017?><mixed-citation>Lerch, S. and Baran, S.: Similarity-based semilocal estimation of
post-processing models, J. R. Stat. Soc. C-Appl., 66, 29–51, <ext-link xlink:href="https://doi.org/10.1111/rssc.12153" ext-link-type="DOI">10.1111/rssc.12153</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Messner et al.(2016)Messner, Mayr, and Zeileis</label><?label messner.etal:2016a?><mixed-citation>Messner, J. W., Mayr, G. J., and Zeileis, A.: Heteroscedastic Censored and
Truncated Regression with crch, R J., 8, 173–181,
<ext-link xlink:href="https://doi.org/10.32614/RJ-2016-012" ext-link-type="DOI">10.32614/RJ-2016-012</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{M{\"{o}}ller et~al.(2018)M{\"{o}}ller, Spazzini, Kraus, Nagler, and
Czado}}?><label>Möller et al.(2018)Möller, Spazzini, Kraus, Nagler, and
Czado</label><?label moller.etal:2018?><mixed-citation>Möller, A., Spazzini, L., Kraus, D., Nagler, T., and Czado, C.: Vine Copula
Based Post-Processing of Ensemble Forecasts for Temperature, arXiv
1811.02255, arXiv.org E-Print Archive,
available at: <uri>http://arxiv.org/abs/1811.02255</uri> (last access: 10 December 2019), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>NASA JPL(2013)</label><?label nasa-jpl:2013?><mixed-citation>NASA JPL: NASA Shuttle Radar Topography Mission Global 30 Arc Second [Data
Set], NASA EOSDIS Land Processes DAAC,
<ext-link xlink:href="https://doi.org/10.5067/MEaSUREs/SRTM/SRTMGL30.002" ext-link-type="DOI">10.5067/MEaSUREs/SRTM/SRTMGL30.002</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Palmer(2002)</label><?label palmer:2002?><mixed-citation>Palmer, T. N.: The Economic Value of Ensemble Forecasts as a Tool for Risk
Assessment: From Days to Decades, Q. J. Roy. Meteor. Soc., 128, 747–774, <ext-link xlink:href="https://doi.org/10.1256/0035900021643593" ext-link-type="DOI">10.1256/0035900021643593</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Pantillon et al.(2018)Pantillon, Lerch, Knippertz, and
Corsmeier</label><?label pantillon.etal:2018?><mixed-citation>Pantillon, F., Lerch, S., Knippertz, P., and Corsmeier, U.: Forecasting Wind
Gusts in Winter Storms Using a Calibrated Convection-Permitting Ensemble,
Q. J. Roy. Meteor. Soc., 144, 1864–1881,
<ext-link xlink:href="https://doi.org/10.1002/qj.3380" ext-link-type="DOI">10.1002/qj.3380</ext-link>, 2018.</mixed-citation></ref>
      <?pagebreak page34?><ref id="bib1.bibx23"><label>Rasp and Lerch(2018)</label><?label rasp.lerch:2018?><mixed-citation>Rasp, S. and Lerch, S.: Neural Networks for Postprocessing Ensemble Weather
Forecasts, Mon. Weather Rev., 146, 3885–3900,
<ext-link xlink:href="https://doi.org/10.1175/MWR-D-18-0187.1" ext-link-type="DOI">10.1175/MWR-D-18-0187.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>R Core Team(2019)</label><?label rcoreteam:2019?><mixed-citation>R Core Team: R: A Language and Environment for Statistical Computing, R
Foundation for Statistical Computing, Vienna, Austria,
available at: <uri>https://www.R-project.org/</uri>, last access: 10 December 2019.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Rodwell et al.(2018)Rodwell, Richardson, Parsons, and
Wernli</label><?label rodwell.etal:2018?><mixed-citation>Rodwell, M. J., Richardson, D. S., Parsons, D. B., and Wernli, H.:
Flow-dependent reliability: A path to more skillful ensemble forecasts,
B. Am. Meteorol. Soc., 99, 1015–1026,
<ext-link xlink:href="https://doi.org/10.1175/BAMS-D-17-0027.1" ext-link-type="DOI">10.1175/BAMS-D-17-0027.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Scheuerer(2014)</label><?label scheuerer:2014?><mixed-citation>Scheuerer, M.: Probabilistic Quantitative Precipitation Forecasting Using
Ensemble Model Output Statistics, Q. J. Roy. Meteor. Soc., 140, 1086–1096, <ext-link xlink:href="https://doi.org/10.1002/qj.2183" ext-link-type="DOI">10.1002/qj.2183</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Schlosser et al.(2019)Schlosser, Hothorn, Stauffer, and
Zeileis</label><?label schlosser.etal:2019?><mixed-citation>Schlosser, L., Hothorn, T., Stauffer, R., and Zeileis, A.: Distributional
Regression Forests for Probabilistic Precipitation Forecasting in Complex
Terrain, Ann. Appl. Stat., 13, 1564–1589,
<ext-link xlink:href="https://doi.org/10.1214/19-AOAS1247" ext-link-type="DOI">10.1214/19-AOAS1247</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Taillardat et al.(2016)Taillardat, Mestre, Zamo, and
Naveau</label><?label taillardat.etal:2016?><mixed-citation>Taillardat, M., Mestre, O., Zamo, M., and Naveau, P.: Calibrated Ensemble
Forecasts Using Quantile Regression Forests and Ensemble Model Output
Statistics, Mon. Weather Rev., 144, 2375–2393,
<ext-link xlink:href="https://doi.org/10.1175/mwr-d-15-0260.1" ext-link-type="DOI">10.1175/mwr-d-15-0260.1</ext-link>, 2016.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx29"><label>Umlauf et al.(2018)Umlauf, Klein, and Zeileis</label><?label umlauf.etal:2018?><mixed-citation>Umlauf, N., Klein, N., and Zeileis, A.: BAMLSS: Bayesian Additive Models
for Location, Scale, and Shape (and Beyond),
J. Comput. Graph. Stat., 27, 612–627, <ext-link xlink:href="https://doi.org/10.1080/10618600.2017.1407325" ext-link-type="DOI">10.1080/10618600.2017.1407325</ext-link>,
2018.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Vogel et al.(2018)Vogel, Knippertz, Fink, Schlueter, and
Gneiting</label><?label vogel.etal:2018?><mixed-citation>Vogel, P., Knippertz, P., Fink, A. H., Schlueter, A., and Gneiting, T.: Skill
of Global Raw and Postprocessed Ensemble Predictions of Rainfall over
Northern Tropical Africa, Weather Forecast., 33, 369–388,
<ext-link xlink:href="https://doi.org/10.1175/waf-d-17-0127.1" ext-link-type="DOI">10.1175/waf-d-17-0127.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Wilson et al.(2007)Wilson, Beauregard, Raftery, and
Verret</label><?label wilson.etal:2007?><mixed-citation>Wilson, L. J., Beauregard, S., Raftery, A. E., and Verret, R.: Calibrated
Surface Temperature Forecasts from the Canadian Ensemble Prediction System
Using Bayesian Model Averaging, Mon. Weather Rev., 135, 1364–1385,
<ext-link xlink:href="https://doi.org/10.1175/MWR3347.1" ext-link-type="DOI">10.1175/MWR3347.1</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Wood(2017)</label><?label wood:2017?><mixed-citation>Wood, S. N.: Generalized Additive Models: An Introduction with R, Chapman and
Hall/CRC, <ext-link xlink:href="https://doi.org/10.1201/9781315370279" ext-link-type="DOI">10.1201/9781315370279</ext-link>, 2017.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Remember the past: a comparison of time-adaptive training schemes for non-homogeneous regression</article-title-html>
<abstract-html><p>Non-homogeneous regression is a frequently used post-processing method for
increasing the predictive skill of probabilistic ensemble weather
forecasts. To adjust for seasonally varying error characteristics between
ensemble forecasts and corresponding observations, different time-adaptive
training schemes, including the classical sliding training window, have
been developed for non-homogeneous regression. This study compares three
such training approaches with the sliding-window approach for the
application of post-processing near-surface air temperature forecasts
across central Europe. The predictive performance is evaluated conditional
on three different groups of stations located in plains, in mountain
foreland, and within mountainous terrain, as well as on a specific change
in the ensemble forecast system of the European Centre for Medium-Range
Weather Forecasts (ECMWF) used as input for the post-processing.</p><p>The results show that time-adaptive training schemes using data over multiple
years stabilize the temporal evolution of the coefficient estimates, yielding
an increased predictive performance for all station types tested compared to
the classical sliding-window approach based on the most recent days only. While
this may not be surprising under fully stable model conditions, it is shown
that <q>remembering the past</q> from multiple years of training data is typically
also superior to the classical sliding-window approach when the ensemble prediction
system is affected by certain model changes. Thus, reducing the variance of the
non-homogeneous regression estimates due to increased training data appears to
be more important than reducing its bias by adapting rapidly to the most
current training data only.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Baran and Möller(2017)</label><mixed-citation>
Baran, S. and Möller, A.: Bivariate Ensemble Model Output Statistics
Approach for Joint Forecasting of Wind Speed and Temperature,
Meteorol. Atmos. Phys., 129, 99–112, <a href="https://doi.org/10.1007/s00703-016-0467-8" target="_blank">https://doi.org/10.1007/s00703-016-0467-8</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Barnes et al.(2019)Barnes, Brierley, and Chandler</label><mixed-citation>
Barnes, C., Brierley, C. M., and Chandler, R. E.: New approaches to
postprocessing of multi-model ensemble forecasts,
Q. J. Roy. Meteor. Soc., 145, 3479–3498, <a href="https://doi.org/10.1002/qj.3632" target="_blank">https://doi.org/10.1002/qj.3632</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Demaeyer and Vannitsem(2019)</label><mixed-citation>
Demaeyer, J. and Vannitsem, S.: Correcting for Model Changes in Statistical Post-Processing – An approach based on Response Theory, Nonlin. Processes Geophys. Discuss., <a href="https://doi.org/10.5194/npg-2019-57" target="_blank">https://doi.org/10.5194/npg-2019-57</a>, in review, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Gebetsberger et al.(2018)Gebetsberger, Messner, Mayr, and
Zeileis</label><mixed-citation>
Gebetsberger, M., Messner, J. W., Mayr, G. J., and Zeileis, A.: Estimation
Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked
Probability Score versus Maximum Likelihood, Mon. Weather Rev., 146,
4323–4338, <a href="https://doi.org/10.1175/MWR-D-17-0364.1" target="_blank">https://doi.org/10.1175/MWR-D-17-0364.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Gneiting and Katzfuss(2014)</label><mixed-citation>
Gneiting, T. and Katzfuss, M.: Probabilistic Forecasting,
Annu. Rev. Stat. Appl., 1, 125–151,
<a href="https://doi.org/10.1146/annurev-statistics-062713-085831" target="_blank">https://doi.org/10.1146/annurev-statistics-062713-085831</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Gneiting and Raftery(2007)</label><mixed-citation>
Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules, Prediction, and
Estimation, J. Am. Stat. Assoc., 102, 359–378,
<a href="https://doi.org/10.1198/016214506000001437" target="_blank">https://doi.org/10.1198/016214506000001437</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Gneiting et al.(2005)Gneiting, Raftery, Westveld III, and
Goldman</label><mixed-citation>
Gneiting, T., Raftery, A. E., Westveld III, A. H., and Goldman, T.: Calibrated
Probabilistic Forecasting Using Ensemble Model Output Statistics and Minimum
CRPS Estimation, Mon. Weather Rev., 133, 1098–1118,
<a href="https://doi.org/10.1175/MWR2904.1" target="_blank">https://doi.org/10.1175/MWR2904.1</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Hamill(2018)</label><mixed-citation>
Hamill, T. M.: Practical Aspects of Statistical Postprocessing, in: Statistical
Postprocessing of Ensemble Forecasts, edited by: Vannitsem, S., Wilks, D. S.,
and Messner, J. W.,  187–217, Elsevier, <a href="https://doi.org/10.1016/C2016-0-03244-8" target="_blank">https://doi.org/10.1016/C2016-0-03244-8</a>,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Hamill et al.(2008)Hamill, Hagedorn, and Whitaker</label><mixed-citation>
Hamill, T. M., Hagedorn, R., and Whitaker, J. S.: Probabilistic forecast
calibration using ECMWF and GFS ensemble reforecasts. Part II:
Precipitation, Mon. Weather Rev., 136, 2620–2632,
<a href="https://doi.org/10.1175/2007MWR2411.1" target="_blank">https://doi.org/10.1175/2007MWR2411.1</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Hastie and Tibshirani(1986)</label><mixed-citation>
Hastie, T. and Tibshirani, R.: Generalized Additive Models, Stat.
Sci., 1, 297–310,
1986.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Hemri et al.(2016)Hemri, Haiden, and Pappenberger</label><mixed-citation>
Hemri, S., Haiden, T., and Pappenberger, F.: Discrete Postprocessing of Total
Cloud Cover Ensemble Forecasts, Mon. Weather Rev., 144, 2565–2577,
<a href="https://doi.org/10.1175/mwr-d-15-0426.1" target="_blank">https://doi.org/10.1175/mwr-d-15-0426.1</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Henzi et al.(2019)Henzi, Ziegel, and Gneiting</label><mixed-citation>
Henzi, A., Ziegel, J. F., and Gneiting, T.: Isotonic Distributional Regression,
arXiv 1909.03725, arXiv.org E-Print Archive,
available at: <a href="http://arxiv.org/abs/1909.03725" target="_blank"/>, last access: 10 December 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Jordan et al.(2019)Jordan, Krüger, and Lerch</label><mixed-citation>
Jordan, A., Krüger, F., and Lerch, S.: Evaluating Probabilistic Forecasts
with scoringRules, J. Stat. Softw., 90, 1–37,
<a href="https://doi.org/10.18637/jss.v090.i12" target="_blank">https://doi.org/10.18637/jss.v090.i12</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Junk et al.(2015)Junk, Monache, and Alessandrini</label><mixed-citation>
Junk, C., Monache, L. D., and Alessandrini, S.: Analog-Based Ensemble Model
Output Statistics, Mon. Weather Rev., 143, 2909–2917,
<a href="https://doi.org/10.1175/mwr-d-15-0095.1" target="_blank">https://doi.org/10.1175/mwr-d-15-0095.1</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Klein et al.(2014)Klein, Kneib, Klasen, and Lang</label><mixed-citation>
Klein, N., Kneib, T., Klasen, S., and Lang, S.: Bayesian Structured Additive
Distributional Regression for Multivariate Responses,
J. R. Stat. Soc. C-Appl., 64, 569–591,
<a href="https://doi.org/10.1111/rssc.12090" target="_blank">https://doi.org/10.1111/rssc.12090</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Lang et al.(2019)Lang, Mayr, Stauffer, and Zeileis</label><mixed-citation>
Lang, M. N., Mayr, G. J., Stauffer, R., and Zeileis, A.: Bivariate Gaussian models for wind vectors in a distributional regression framework, Adv. Stat. Clim. Meteorol. Oceanogr., 5, 115–132, <a href="https://doi.org/10.5194/ascmo-5-115-2019" target="_blank">https://doi.org/10.5194/ascmo-5-115-2019</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Lerch and Baran(2017)</label><mixed-citation>
Lerch, S. and Baran, S.: Similarity-based semilocal estimation of
post-processing models, J. R. Stat. Soc. C-Appl., 66, 29–51, <a href="https://doi.org/10.1111/rssc.12153" target="_blank">https://doi.org/10.1111/rssc.12153</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Messner et al.(2016)Messner, Mayr, and Zeileis</label><mixed-citation>
Messner, J. W., Mayr, G. J., and Zeileis, A.: Heteroscedastic Censored and
Truncated Regression with crch, R J., 8, 173–181,
<a href="https://doi.org/10.32614/RJ-2016-012" target="_blank">https://doi.org/10.32614/RJ-2016-012</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Möller et al.(2018)Möller, Spazzini, Kraus, Nagler, and
Czado</label><mixed-citation>
Möller, A., Spazzini, L., Kraus, D., Nagler, T., and Czado, C.: Vine Copula
Based Post-Processing of Ensemble Forecasts for Temperature, arXiv
1811.02255, arXiv.org E-Print Archive,
available at: <a href="http://arxiv.org/abs/1811.02255" target="_blank"/> (last access: 10 December 2019), 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>NASA JPL(2013)</label><mixed-citation>
NASA JPL: NASA Shuttle Radar Topography Mission Global 30 Arc Second [Data
Set], NASA EOSDIS Land Processes DAAC,
<a href="https://doi.org/10.5067/MEaSUREs/SRTM/SRTMGL30.002" target="_blank">https://doi.org/10.5067/MEaSUREs/SRTM/SRTMGL30.002</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Palmer(2002)</label><mixed-citation>
Palmer, T. N.: The Economic Value of Ensemble Forecasts as a Tool for Risk
Assessment: From Days to Decades, Q. J. Roy. Meteor. Soc., 128, 747–774, <a href="https://doi.org/10.1256/0035900021643593" target="_blank">https://doi.org/10.1256/0035900021643593</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Pantillon et al.(2018)Pantillon, Lerch, Knippertz, and
Corsmeier</label><mixed-citation>
Pantillon, F., Lerch, S., Knippertz, P., and Corsmeier, U.: Forecasting Wind
Gusts in Winter Storms Using a Calibrated Convection-Permitting Ensemble,
Q. J. Roy. Meteor. Soc., 144, 1864–1881,
<a href="https://doi.org/10.1002/qj.3380" target="_blank">https://doi.org/10.1002/qj.3380</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Rasp and Lerch(2018)</label><mixed-citation>
Rasp, S. and Lerch, S.: Neural Networks for Postprocessing Ensemble Weather
Forecasts, Mon. Weather Rev., 146, 3885–3900,
<a href="https://doi.org/10.1175/MWR-D-18-0187.1" target="_blank">https://doi.org/10.1175/MWR-D-18-0187.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>R Core Team(2019)</label><mixed-citation>
R Core Team: R: A Language and Environment for Statistical Computing, R
Foundation for Statistical Computing, Vienna, Austria,
available at: <a href="https://www.R-project.org/" target="_blank"/>, last access: 10 December 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Rodwell et al.(2018)Rodwell, Richardson, Parsons, and
Wernli</label><mixed-citation>
Rodwell, M. J., Richardson, D. S., Parsons, D. B., and Wernli, H.:
Flow-dependent reliability: A path to more skillful ensemble forecasts,
B. Am. Meteorol. Soc., 99, 1015–1026,
<a href="https://doi.org/10.1175/BAMS-D-17-0027.1" target="_blank">https://doi.org/10.1175/BAMS-D-17-0027.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Scheuerer(2014)</label><mixed-citation>
Scheuerer, M.: Probabilistic Quantitative Precipitation Forecasting Using
Ensemble Model Output Statistics, Q. J. Roy. Meteor. Soc., 140, 1086–1096, <a href="https://doi.org/10.1002/qj.2183" target="_blank">https://doi.org/10.1002/qj.2183</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Schlosser et al.(2019)Schlosser, Hothorn, Stauffer, and
Zeileis</label><mixed-citation>
Schlosser, L., Hothorn, T., Stauffer, R., and Zeileis, A.: Distributional
Regression Forests for Probabilistic Precipitation Forecasting in Complex
Terrain, Ann. Appl. Stat., 13, 1564–1589,
<a href="https://doi.org/10.1214/19-AOAS1247" target="_blank">https://doi.org/10.1214/19-AOAS1247</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Taillardat et al.(2016)Taillardat, Mestre, Zamo, and
Naveau</label><mixed-citation>
Taillardat, M., Mestre, O., Zamo, M., and Naveau, P.: Calibrated Ensemble
Forecasts Using Quantile Regression Forests and Ensemble Model Output
Statistics, Mon. Weather Rev., 144, 2375–2393,
<a href="https://doi.org/10.1175/mwr-d-15-0260.1" target="_blank">https://doi.org/10.1175/mwr-d-15-0260.1</a>, 2016.

</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Umlauf et al.(2018)Umlauf, Klein, and Zeileis</label><mixed-citation>
Umlauf, N., Klein, N., and Zeileis, A.: BAMLSS: Bayesian Additive Models
for Location, Scale, and Shape (and Beyond),
J. Comput. Graph. Stat., 27, 612–627, <a href="https://doi.org/10.1080/10618600.2017.1407325" target="_blank">https://doi.org/10.1080/10618600.2017.1407325</a>,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Vogel et al.(2018)Vogel, Knippertz, Fink, Schlueter, and
Gneiting</label><mixed-citation>
Vogel, P., Knippertz, P., Fink, A. H., Schlueter, A., and Gneiting, T.: Skill
of Global Raw and Postprocessed Ensemble Predictions of Rainfall over
Northern Tropical Africa, Weather Forecast., 33, 369–388,
<a href="https://doi.org/10.1175/waf-d-17-0127.1" target="_blank">https://doi.org/10.1175/waf-d-17-0127.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Wilson et al.(2007)Wilson, Beauregard, Raftery, and
Verret</label><mixed-citation>
Wilson, L. J., Beauregard, S., Raftery, A. E., and Verret, R.: Calibrated
Surface Temperature Forecasts from the Canadian Ensemble Prediction System
Using Bayesian Model Averaging, Mon. Weather Rev., 135, 1364–1385,
<a href="https://doi.org/10.1175/MWR3347.1" target="_blank">https://doi.org/10.1175/MWR3347.1</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Wood(2017)</label><mixed-citation>
Wood, S. N.: Generalized Additive Models: An Introduction with R, Chapman and
Hall/CRC, <a href="https://doi.org/10.1201/9781315370279" target="_blank">https://doi.org/10.1201/9781315370279</a>, 2017.
</mixed-citation></ref-html>--></article>
