<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" dtd-version="3.0">
  <front>
    <journal-meta>
<journal-id journal-id-type="publisher">NPG</journal-id>
<journal-title-group>
<journal-title>Nonlinear Processes  in Geophysics</journal-title>
<abbrev-journal-title abbrev-type="publisher">NPG</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Nonlin. Processes Geophys.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1607-7946</issn>
<publisher><publisher-name>Copernicus GmbH</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>

    <article-meta>
      <article-id pub-id-type="doi">10.5194/npg-22-403-2015</article-id><title-group><article-title>Verification against perturbed analyses and observations</article-title>
      </title-group><?xmltex \runningtitle{Verification against perturbed analyses and observations}?><?xmltex \runningauthor{N.~E.~Bowler et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Bowler</surname><given-names>N. E.</given-names></name>
          <email>neill.bowler@metoffice.gov.uk</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Cullen</surname><given-names>M. J. P.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Piccolo</surname><given-names>C.</given-names></name>
          
        </contrib>
        <aff id="aff1"><institution>Met Office, Fitzroy Road, Exeter, EX1 3PB, UK</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">N. E. Bowler (neill.bowler@metoffice.gov.uk)</corresp></author-notes><pub-date><day>24</day><month>July</month><year>2015</year></pub-date>
      
      <volume>22</volume>
      <issue>4</issue>
      <fpage>403</fpage><lpage>411</lpage>
      <history>
        <date date-type="received"><day>6</day><month>December</month><year>2013</year></date>
           <date date-type="rev-recd"><day>13</day><month>April</month><year>2015</year></date>
           <date date-type="accepted"><day>15</day><month>June</month><year>2015</year></date>
      </history>
      <permissions>
<license license-type="open-access">
<license-p>This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions><self-uri xlink:href="https://npg.copernicus.org/articles/22/403/2015/npg-22-403-2015.html">This article is available from https://npg.copernicus.org/articles/22/403/2015/npg-22-403-2015.html</self-uri>
<self-uri xlink:href="https://npg.copernicus.org/articles/22/403/2015/npg-22-403-2015.pdf">The full text article is available as a PDF file from https://npg.copernicus.org/articles/22/403/2015/npg-22-403-2015.pdf</self-uri>


      <abstract>
    <p>It has long been known that verification of a forecast against the sequence
of analyses used to produce those forecasts can under-estimate the magnitude
of forecast errors. Here we show that under certain conditions the
verification of a short-range forecast against a perturbed analysis coming
from an ensemble data assimilation scheme can give the same root-mean-square
error as verification against the truth. This means that a perturbed analysis
can be used as a reliable proxy for the truth. However, the conditions
required for this result to hold are rather restrictive: the analysis must be
optimal, the ensemble spread must be equal to the error in the mean, the
ensemble size must be large and the forecast being verified must be the
background forecast used in the data assimilation. Although these criteria
are unlikely to be met exactly it becomes clear that for most cases
verification against a perturbed analysis gives better results than
verification against an unperturbed analysis.</p>
    <p>We demonstrate the application of these results in a idealised model
framework and a numerical weather prediction context. In deriving this result
we recall that an optimal (Kalman) analysis is one for which the analysis
increments are uncorrelated with the analysis errors.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p>Verification of forecasts is an important aspect in the development of those
forecasts. Any improvement in the forecasting system should be tested to
demonstrate that the forecasts are genuinely improved. Each forecast is
typically launched from an analysis state which is a combination of
observations with a previous short-range forecast from the system. A common
practice is to use the analysis from such a system as the truth against which
to verify (for instance see <xref ref-type="bibr" rid="bib1.bibx5" id="altparen.1"/>). Since each analysis depends
on the forecasts from previous cycles this is a dangerous practice,
particularly at short forecast lead times <xref ref-type="bibr" rid="bib1.bibx3" id="paren.2"/>. Nonetheless
the convenience of performing verification against a state which is available
on the model grid means that this remains a common practice with its
attendant problems (as observed in <xref ref-type="bibr" rid="bib1.bibx8" id="altparen.3"/>).</p>
      <p>One solution to the problem of verification against analyses is to verify
forecasts against observations. The observations do not depend on the
forecast, and so provide an independent measurement of the true state of the
system<fn id="Ch1.Footn1"><p>Although any time correlation in observation errors can
create a correlation between forecast and observation errors.</p></fn>. However,
observations themselves are contaminated by errors. Methods exist to account
for the effect of these errors on verification statistics
<xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx17 bib1.bibx2 bib1.bibx6" id="paren.4"/>. However, these errors
are often poorly known, so accounting for their effect is difficult.
Additionally, there are often few conventional observations over the oceans,
which means that verification statistics can be blind to these areas.</p>
      <p>As an alternative solution to these problems, we offer the idea of performing
the verification against a perturbed analysis.</p>
</sec>
<sec id="Ch1.S2">
  <title>Verification against perturbed analysis</title>
      <p>We are looking to verify a forecast <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> using the
root-mean-square (RMS) error. This forecast is a single realisation, and so
could either be a forecast from a deterministic system or an ensemble mean
forecast. Ideally one would verify this forecast against the true state of
the system <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>, but this state is generally unknown.
Given that the truth is unknown we choose to verify instead some other state,
in this case an analysis. We consider that rather than having a single
analysis we have an ensemble of analyses and verify against a randomly chosen
analysis ensemble member. We assume that the analysis ensemble represents its
own errors correctly. Since we are considering mean-square errors, then we
only need this last statement to hold to second order; that is, we require
that

              <disp-formula id="Ch1.E1" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="bold-italic">x</mml:mi></mml:mrow></mml:math></inline-formula> denotes the inner product
where <inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mtext>T</mml:mtext></mml:msup></mml:math></inline-formula> indicates the matrix transpose, and the angle brackets
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>.</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> indicate the average over a large number of cases. The
ensemble states are denoted by <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> is the ensemble
member number and the overbar (<inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula>) indicates the
ensemble mean.</p>
      <p>Given the above definitions we consider the RMS error calculated against a
perturbed analysis, that is a randomly chosen member of the analysis
ensemble. The mean-square error of the forecast against this analysis is

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>-</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>+</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E2"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          In this case we are considering the verification against a given, chosen
ensemble member <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, not against each ensemble member in turn. However, since
all ensemble members are typically exchangeable, this distinction is not
important. We do not include a time index in this notation since all
quantities are valid at the same time.</p>
      <p>To continue the analysis, we consider that there exists the truth state,
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>, against which we would ideally conduct the
verification. Using this we expand one of the terms appearing on the
right-hand side of Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>):

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>-</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>+</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E3"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Combining Eqs. (<xref ref-type="disp-formula" rid="Ch1.E2"/>) and (<xref ref-type="disp-formula" rid="Ch1.E3"/>), we find that

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>+</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          The last term in this equation can be further re-arranged:

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>+</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
      <p>We have previously assumed that the ensemble of analyses is ideal
(Eq. <xref ref-type="disp-formula" rid="Ch1.E1"/>). Using this assumption and substituting
Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>) into Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>), various terms
cancel and we find
<?xmltex \hack{\newpage}?><?xmltex \hack{\vspace*{-8mm}}?>

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E6"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          So, if the last two terms in this equation are zero (or cancel), then we
would expect that verifying against a perturbed analysis would give the same
result as verification against the truth.</p>
      <p>In the second to last term, the second bracket is the difference between a
random analysis ensemble member and the ensemble mean. If this term were
averaged over all the choices of the random member, then it is easy to see
that this term is zero, since the mean of the second bracket would be
precisely zero. If all the ensemble members are equivalent to each other,
then this term should disappear if the number of cases is large enough.</p>
      <p>If the final term also vanishes, then we can consider that the
data-assimilation system is in some sense optimal. If the final term were not
zero, then it would be possible to make the ensemble mean analysis closer to
the truth by post-processing it using the difference
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>. A
statistically optimal analysis will not benefit from post-processing in this
way because it is by design as close to the truth as possible, and so the
final term must also be zero. This is a somewhat different definition of an
“optimal” data assimilation scheme from the usual. This difference is
explored in more detail in Sect. <xref ref-type="sec" rid="Ch1.S4"/>.</p>
      <p>Therefore, we conclude that verification against a perturbed analysis will
give the same RMS error as verification against the truth if the analysis
ensemble is ideal (the spread equals the error of the mean analysis) and the
analysis is statistically optimal (could not be improved by simple
post-processing). In a sense Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) is a simple
result, since we have assumed that the analysis ensemble correctly represents
the errors in the ensemble mean analysis. However, this re-arrangement allows
us to see that all that is required for perturbed analysis to be a good proxy
for the truth is for two cross-terms to be zero. The first of these is
straightforwardly zero; the condition for the second to be zero is more
challenging, as will be seen below.</p>
</sec>
<sec id="Ch1.S3">
  <title>Verification against perturbed observations</title>
      <p>It might be thought that, since a true observation is statistically
indistinguishable from a random member of a set of perturbed observations,
then verification against perturbed observations would also be equivalent to
verification against the truth. However, we show that this is not the case.</p>
      <p>Consider the final term in Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>). If we replace the
references to the analysis with the observations, then this term becomes
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="bold-italic">y</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="bold">H</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>
where <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> are the observations and
<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> is the observation operator, which we will assume to be linear
for simplicity. Now, we choose to define the observation using <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>,
its departure from the truth

              <disp-formula id="Ch1.E7" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        Using this definition, we find

              <disp-formula id="Ch1.E8" content-type="numbered"><mml:math display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9}{9}\selectfont$\displaystyle}?><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula>

        If we assume that forecast and observation errors are uncorrelated, then this
reduces to

              <disp-formula id="Ch1.E9" content-type="numbered"><mml:math display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9.5}{9.5}\selectfont$\displaystyle}?><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mo>-</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold">R</mml:mi><mml:mo>)</mml:mo><?xmltex \hack{$\egroup}?><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        which is the trace of the observation error covariance matrix. Therefore
verification against perturbed observations will not give the same result as
verification against the truth.</p>
      <p>Although the use of perturbed observations is unhelpful, it is possible to
subtract the estimated observation error from the RMSE calculated using
unperturbed observations. This has been used successfully by some authors
<xref ref-type="bibr" rid="bib1.bibx4" id="paren.5"><named-content content-type="pre">for instance see</named-content></xref>, but retains the limitation that
observations do not universally cover the globe.</p>
</sec>
<sec id="Ch1.S4">
  <title>Definitions of an optimal analysis</title>
      <p>Earlier we indicated that an analysis for which <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0
should be considered an optimal analysis, since it would not be possible
to improve this analysis by a simple post-processing. This is the same as
saying that the analysis increments are orthogonal to the analysis errors.
However, a more usual definition of an optimal analysis is one which uses the
Kalman gain in calculating the analysis state. In the following we will
demonstrate that these two definitions of an optimal analysis are equivalent.
The orthogonality of analysis increments and errors for an optimal filter has
been known for many years <xref ref-type="bibr" rid="bib1.bibx13" id="paren.6"><named-content content-type="pre">see for instance</named-content></xref>. We include
a derivation of this fact here as it highlights certain assumptions which
need to be made.</p>
      <p>To calculate an analysis state we use the following formula:

              <disp-formula id="Ch1.E10" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        In this equation and the following paragraphs we refer to <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> without an overbar because this derivation can apply to any
forecast and analysis and not simply one coming from an ensemble system.
<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> is the gain matrix applied to the innovations – this does not
need to be the optimal (Kalman) gain. As in Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>) the
observation is defined by its departure from the truth, <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>. This
allows us to re-arrange Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) as

              <disp-formula id="Ch1.E11" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="bold">KH</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>=</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold">I</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">KH</mml:mi><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        <?xmltex \hack{\newpage}?><?xmltex \hack{\noindent}?>We post-multiply this equation by <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> and take
the average over a large number of cases. This yields

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mi mathvariant="bold">KH</mml:mi></mml:mtd><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="bold">I</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">KH</mml:mi><mml:mo>)</mml:mo><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E12"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where we have assumed that <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> are constant in
time. Note that in this equation the terms appear as
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>, which is the outer product where
previously we have been dealing with terms like
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>, which is the inner product. Now, to
deal with the terms on the right-hand side of this equation, we re-arrange
the analysis Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) to be

              <disp-formula id="Ch1.E13" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>=</mml:mo><mml:mi mathvariant="bold">KH</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        We can square this equation, and take the average over a long time series to
give

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="bold">KH</mml:mi><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E14"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where we have assumed that the forecast and observation errors are
uncorrelated. We re-write the forecast and observation covariance matrices
using their usual terms <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">B</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula> to give

              <disp-formula id="Ch1.E15" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:msup><mml:mi mathvariant="bold">KHBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="bold">KRK</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        Returning to Eq. (<xref ref-type="disp-formula" rid="Ch1.E13"/>) we may multiply this by
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mi>o</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> to get the estimate of the second term as

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E16"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          If we assume that forecast and observation errors are uncorrelated, then we
find that

              <disp-formula id="Ch1.E17" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mtext>o</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">RK</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        Substituting Eqs. (<xref ref-type="disp-formula" rid="Ch1.E15"/>) and (<xref ref-type="disp-formula" rid="Ch1.E17"/>)
into Eq. (<xref ref-type="disp-formula" rid="Ch1.E12"/>) we find that

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mi mathvariant="bold">KH</mml:mi></mml:mtd><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E18"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="bold">I</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">KH</mml:mi><mml:mo>)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold">KHBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="bold">KRK</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">KRK</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Expanding the right-hand side and cancelling terms, we get

              <disp-formula id="Ch1.E19" content-type="numbered"><mml:math display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{8.5}{8.5}\selectfont$\displaystyle}?><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:msup><mml:mi mathvariant="bold">BH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">KHBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">KRK</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>.</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula>

        In Eq. (<xref ref-type="disp-formula" rid="Ch1.E19"/>) we have not made any assumption about the form
of <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula>, and the terms labelled <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">B</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula> are the
true forecast- and observation-error covariance matrices. Previously we
argued that Eq. (<xref ref-type="disp-formula" rid="Ch1.E19"/>) is zero if the gain matrix is equal to
the Kalman gain. So, we substitute the Kalman gain <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">B</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="bold">B</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="bold">R</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>
for some of the terms in Eq. (<xref ref-type="disp-formula" rid="Ch1.E19"/>) to give

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfenced close="" open="["><mml:msup><mml:mi mathvariant="bold">BH</mml:mi><mml:mtext>T</mml:mtext></mml:msup></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced open="." close=""><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">BH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold">HBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">R</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mi mathvariant="bold">HBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced close="]" open="."><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">BH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mi mathvariant="bold">HBH</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">R</mml:mi><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mi mathvariant="bold">R</mml:mi></mml:mfenced><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E20"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mn>0.</mml:mn></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          So, if we assume that the gain used in the data assimilation is optimal, then
the key cross-term in Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) is zero. This is one
of the conditions required for verification against a perturbed analysis to
give the same RMS error as verification against the truth.</p>
      <p>Now, Eq. (<xref ref-type="disp-formula" rid="Ch1.E20"/>) states that the outer product of the
analysis errors with the analysis increment is zero. However, for the
verification against a perturbed analysis to be a suitable substitute for
verification against the truth we require the inner product of these two
terms to be zero. If we have two vectors <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> then stating
that the average of the outer product of these vectors is zero,
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0, is the same as stating that

              <disp-formula id="Ch1.E21" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mtext>for</mml:mtext><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mtext>all</mml:mtext><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        If the inner product is to be zero, then we require that

              <disp-formula id="Ch1.E22" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mo>&lt;</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mn>0.</mml:mn></mml:mrow></mml:math></disp-formula>

        This demonstrates that Eq. (<xref ref-type="disp-formula" rid="Ch1.E20"/>) implies that
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.</p>
      <p>In this calculation the forecast <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> is the one used in
calculating the new analysis. Given that the analysis referred to in the last
term of Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) is an ensemble mean, then
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> should be the ensemble mean background forecast to the data
assimilation. That is, we must re-write Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) as

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E23"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>≈</mml:mo><mml:mo>&lt;</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> is the ensemble-mean background for the
ensemble data assimilation. Thus the above argument does not apply to
deterministic forecasts or longer lead time forecasts. The issue of longer
lead times is discussed further in Sect. <xref ref-type="sec" rid="Ch1.S7"/>.</p>
      <p>This derivation also informs how the analysis ensemble is created. Following
Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) the update of the ensemble mean will follow
<?xmltex \hack{\newpage}?><?xmltex \hack{\vspace*{-8mm}}?>

              <disp-formula id="Ch1.E24" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> is the optimal (Kalman) gain matrix. In
Sect. <xref ref-type="sec" rid="Ch1.S2"/> we assumed that the analysis ensemble
perturbations are drawn from the same distribution as the analysis errors.
One way to ensure this <xref ref-type="bibr" rid="bib1.bibx1" id="paren.7"/> is to update each ensemble member
according to

              <disp-formula id="Ch1.E25" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>f</mml:mtext></mml:msubsup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">K</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mtext>f</mml:mtext></mml:msubsup><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a perturbation to the observations created using the
(true) observation error covariance matrix, <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula>. Note that in both
the above equations <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> is the Kalman gain calculated using the true
(unknown) background and observation error covariance matrices. This matrix
is approximated in the ensemble Kalman filter and ensemble-variational
methods used with geophysical models <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx9" id="paren.8"/>. In
the following tests we use <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">B</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="bold">B</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mtext>T</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="bold">R</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>
with <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">B</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula> fixed.</p>
</sec>
<sec id="Ch1.S5">
  <title>Testing using a simple model</title>
      <p>A toy-model data assimilation system was created to test whether the above
assumptions can hold in an idealised context. For this, the logistic map was
used <xref ref-type="bibr" rid="bib1.bibx16" id="paren.9"><named-content content-type="pre">see for instance</named-content></xref>. The logistic map is a
single-variable chaotic map, iterated according to

              <disp-formula id="Ch1.E26" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>C</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo mathsize="1.1em">(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> is a constant. The basin of attraction for this map is the
range (0, 1), and states <inline-formula><mml:math display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 1 will diverge towards infinity. The map
is chaotic when <inline-formula><mml:math display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 3.57 (approx.) and has a Hausdorff dimension of
about 0.538 <xref ref-type="bibr" rid="bib1.bibx10" id="paren.10"/>. In our experiments we choose
<inline-formula><mml:math display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 3.7 as for this value the map exhibits chaotic behaviour.</p>
      <p>We initialise an ensemble by randomly choosing states in the interval (0, 1).
The logistic map is applied to each member to create a forecast ensemble. The
forecast ensemble is transformed into an analysis ensemble by each member
assimilating a perturbed observation. The observations are created by adding
a perturbation to the run of the truth model. These perturbations are
distributed according to <inline-formula><mml:math display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>(0, 0.001). The perturbed observations
are created from the observations by adding a perturbation sampled from the
same distribution. The assimilation always uses a fixed background error
variance, <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, and we test the formulas derived above by varying the value of
<inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. A fixed <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is a poor approximation to the true background errors. This
assimilation will not be optimal and we may find that
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>f</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>
is non-zero. We examine this later. Observations are assimilated every time
step and Eq. (<xref ref-type="disp-formula" rid="Ch1.E26"/>) is used to iterate both the ensemble members
and the truth run. The first 2000 assimilation cycles are rejected as a
spin-up period. Analysis states which fall outside the basin of attraction
are reset to lie within it. The assimilation is run for a further
200 000 assimilation cycles and 400 ensemble members are used. Confidence
intervals were calculated using the bootstrap method assuming each of the
assimilation cycles gives an independent sample of the analysis error. Since
we use a long run the estimated confidence intervals are very narrow, and
correspond approximately to the line width in the plots. Therefore these are
not shown in order to aid clarity. In order to be consistent with the results
of the previous section the only forecasts verified are the ensemble mean
background forecasts. All results shown here have used the logistic map.
Similar results have also been found with the models of <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx15" id="text.11"/>.</p>
      <p>Figure <xref ref-type="fig" rid="Ch1.F1"/> shows the RMS background-forecast and
analysis errors as a function of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. When <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is small the forecast and
analysis errors (dark blue line and red line, respectively) are large and the
system is sub-optimal for these values. Verification against a perturbed
analysis gives a systematically lower RMS error (RMSE) than verification
against the truth (dark blue line) for small values of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, since
insufficient weight is given to the observations. The RMS error for
verification against a perturbed analysis becomes equal to that when
verifying against the truth for moderate values of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 0.049). This
point is also where the RMS error crosses the diagonal, indicating that the
background errors used in the assimilation are equal to the actual background
errors, and the assimilation is optimal. Verification against observations
gives RMS errors which are systematically higher than all the other
estimates. If observation errors are accounted for, then verification against
observations becomes very similar to verification against the truth (not
shown). Verification against unperturbed analyses gives smaller RMSEs than
all the other methods.</p>
      <p>The circles in Fig. <xref ref-type="fig" rid="Ch1.F1"/> indicate the point at which
the RMS errors are minimised for each curve. The minimum RMSE for
verification against analysis (purple line) is a value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> around 0.026
which is much lower than the optimal value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> for verification against
the truth. The black line shows verification against perturbed analyses and
the minimum RMS error for this curve is when <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is around 0.03. This is much
larger than the value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> for the minimum RMS error for verification
against (unperturbed) analyses. However, this value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, around 0.03, is
much lower than the optimal (Kalman) value of around 0.049. When verifying
forecasts against the truth (dark blue line) the minimum value of the
forecast error is found for <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> around 0.036, lower than the optimal (Kalman)
value. This statement may seem counter-intuitive – the lowest forecast error
is found when the value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> used in the analysis is not equal to the
forecast error. However, recall that the logistic map is a non-linear map and
that the Kalman filter is only optimal for linear models. We have found a
similar result with other models (the models of <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx15" id="altparen.12"/>). For both these models the forecast error is minimised when the
value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> used is larger than the actual forecast error (the value given
for the Kalman filter). For the logistic map the value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> which minimises
the analysis error is around 0.044, closer to the Kalman value than for the
forecast error – this appears to be a result consistent across the different
models.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><caption><p>RMS error of the forecast and analysis using the logistic model as a
function of the background error standard deviation used in calculating the
analysis. The red and blue lines show the RMSE for the analysis and forecast
measured against the truth state. The other lines show the RMSE of the
forecast, when verified against a different proxy for the truth. Verification
is calculated over 200 000 analysis and forecast cycles.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="npg-2013-160-f01.pdf"/>

      </fig>

      <p>The vertical line in Fig. <xref ref-type="fig" rid="Ch1.F1"/> is the point at
which the cross-term <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>
(last term of Eq. <xref ref-type="disp-formula" rid="Ch1.E23"/>) is zero. We can see that
this vertical line is at approximately the same value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> where the
forecast and background errors are equal. This cross-term is plotted in
Fig. <xref ref-type="fig" rid="Ch1.F2"/>, as the solid green line, as a function of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. Also
plotted is the correlation between the forecast and analysis errors
<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>
(blue dashed line). This is non-zero for all the values
of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> run in these experiments. This demonstrates the problem in verifying
against an unperturbed analysis that for all the values of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> used here the
errors in the forecast are correlated with the errors in the analysis.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><caption><p>Important cross-terms calculated from a long analysis cycle using
the logistic model, as a function of the background error standard deviation
used in calculating the analysis. These are <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> (blue
dashed) and <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula>
(green solid).</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="npg-2013-160-f02.pdf"/>

      </fig>

      <p>One of the conditions required for verification against perturbed analyses to
give similar results to verification against the truth is for the analysis
ensemble spread to equal the RMS analysis errors (Eq. <xref ref-type="disp-formula" rid="Ch1.E1"/>).
The analysis and forecast ensemble spread and error is plotted
in Fig. <xref ref-type="fig" rid="Ch1.F3"/>. The ensembles appear to be well calibrated
for most values of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. This may change if model error were introduced into
the system.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><caption><p>RMS error and ensemble spread of the forecast and analysis using the
logistic model, as a function of the background error standard deviation used
in calculating the analysis. The ensembles were created by each ensemble
member using the same assimilation method, assimilating perturbed observations.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="npg-2013-160-f03.pdf"/>

      </fig>

</sec>
<sec id="Ch1.S6">
  <title>Considering the effects of ensemble size</title>
      <p>Next, we consider whether these results change substantially if fewer
ensemble members are used. Results with a 10 member ensemble are shown in
Fig. <xref ref-type="fig" rid="Ch1.F4"/>. This figure is rather similar to
Fig. <xref ref-type="fig" rid="Ch1.F1"/>, with the most notable difference being
that the vertical line no longer meets where the other lines cross.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4"><caption><p>RMS error of the forecast and analysis as plotted in
Fig. <xref ref-type="fig" rid="Ch1.F1"/>, but using an ensemble with only 10 members.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="npg-2013-160-f04.pdf"/>

      </fig>

      <p><?xmltex \hack{\newpage}?>To understand how ensemble size can affect the results, we need to return to
estimates of the analysis error and spread. In Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) we
relied on a cancellation of the analysis ensemble spread with the error of
the ensemble mean. For a limited-size ensemble this cancellation does not
hold precisely. As has been shown by <xref ref-type="bibr" rid="bib1.bibx18" id="text.13"/> the RMS error of an
ensemble mean is slightly increased by effects related to the limited
ensemble size. To show the limitations consider that the true state and each
ensemble member are a random draw from the same distribution which has mean
<inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and variance <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. We can thus write the truth as the mean of
this distribution plus a deviation from the mean
<?xmltex \hack{\newpage}?><?xmltex \hack{\vspace*{-8mm}}?>

              <disp-formula id="Ch1.E27" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msup><mml:mi>x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:mo>=</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>+</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mi>s</mml:mi><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0 and <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. For an analysis
ensemble member we would have

              <disp-formula id="Ch1.E28" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a random draw from the same distribution as <inline-formula><mml:math display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>. Thus we may
write the ensemble mean as

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E29"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>+</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          We see that <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> has mean zero and variance <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>
is the ensemble size. Using this <xref ref-type="bibr" rid="bib1.bibx18" id="text.14"/> showed that the
mean-square error of the ensemble mean is

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi>x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>&lt;</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mi>s</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E30"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          since <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>s</mml:mi><mml:mo>&gt;</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0. Due to the fact that the ensemble mean is
not exactly equal to the mean of the distribution, the error of the ensemble
mean is slightly larger than the variance of the distribution. This is a
standard mathematical result <xref ref-type="bibr" rid="bib1.bibx11" id="paren.15"><named-content content-type="pre">for instance see</named-content><named-content content-type="post">p. 128</named-content></xref>.
Using a similar argument lets us now consider the ensemble perturbations

              <disp-formula id="Ch1.E31" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mo>&lt;</mml:mo><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        From the definition of <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> and recalling that the <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>'s are
independent samples we see that
<?xmltex \hack{\newpage}?><?xmltex \hack{\vspace*{-8mm}}?>

              <disp-formula id="Ch1.E32" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mover accent="true"><mml:mi>w</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mo>&lt;</mml:mo><mml:msubsup><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:mfrac><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mi>N</mml:mi></mml:mfrac></mml:mrow></mml:math></disp-formula>

        and so

              <disp-formula id="Ch1.E33" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="italic">&gt;=</mml:mi><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo mathsize="1.1em">(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        So, the ensemble spread is slightly smaller than the variance of the
distribution due to correlations between deviations of the ensemble mean from
the distribution mean and the perturbations. This is often accounted for by
using the unbiased estimator of the ensemble spread. Putting all this
together, we find that for a well-calibrated ensemble

              <disp-formula id="Ch1.E34" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mfrac><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi>x</mml:mi><mml:mtext>t</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow><mml:mrow><mml:mo>&lt;</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mtext>a</mml:mtext></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup><mml:msup><mml:mo mathsize="1.1em">)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>&gt;</mml:mo></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        As the ensemble size goes to infinity this ratio tends to 1 and
Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) holds. However, for a limited ensemble size these
differences mean that verification against analysis is not the same as
verification against the truth, even when the other conditions hold. This
could be corrected for if the analysis spread is known.</p>
</sec>
<sec id="Ch1.S7">
  <title>Longer lead times</title>
      <p>As was discussed in Sect. <xref ref-type="sec" rid="Ch1.S4"/> the argument that the final term in
Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) is zero requires the forecast being verified to
be the background for the analysis. However, we might expect that this term
is zero for longer lead times, since otherwise it should be possible to
produce a superior analysis. To investigate this further we turn to the
simple model tests used earlier.</p>
      <p>Verification for longer lead times using the system described in
Sect. <xref ref-type="sec" rid="Ch1.S5"/> are given in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. This shows the
ratio of the RMSE measured against truth to the RMSE measured against
perturbed analyses. This line is plotted for two choices of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. When the
Kalman value of <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is used the two verifications give the same RMS error at
the first lead time (i.e. where the forecast is the background for the
analysis). At longer lead times the RMS error when verifying against a
perturbed analysis becomes larger than when verifying against the truth. This
is caused by the final term in Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) giving a
positive contribution to the verification against perturbed analysis. The
interpretation is that
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mtext>t</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>f</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>a</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula>
are positively correlated – errors in the analysis are anti-correlated with
differences between the forecast and the analysis. The correlation of
analysis errors with forecast-analysis differences may be related to the use
of a nonlinear model. The nonlinearity can lead to non-randomness of the
errors which leads to the correlation.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5"><caption><p>Ratio of the RMS errors of forecasts verified against truth and
perturbed analyses using the logistic map for various lead times. For the
solid line the background error was taken as the approximate Kalman value.
For the dashed line <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> was taken for the value which minimises the
short-period forecast error.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="npg-2013-160-f05.pdf"/>

      </fig>

      <p>Also shown in Fig. <xref ref-type="fig" rid="Ch1.F5"/> is the ratio when <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is chosen to be
the value which gives the minimum forecast error – for the logistic map this
value is lower than the Kalman value for <inline-formula><mml:math display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>. In this case verification
against perturbed analysis gives smaller RMSEs than verification against the
truth at short lead times. At longer lead times the verifications cross over
and the RMSE against perturbed analyses is greater than the RMSE against the
truth.</p>
      <p>This behaviour at long lead times suggests that verification against a
perturbed analysis is most useful at short lead times. Nonetheless it avoids
the worst problems of verification against an unperturbed analysis.
Therefore, we argue that it is still a useful replacement for that method of verification.</p>
</sec>
<sec id="Ch1.S8">
  <title>Verification of NWP forecasts</title>
      <p>In order to understand whether this method can be applied to numerical
weather prediction (NWP) systems we calculated the RMS error of a forecast
ensemble mean against observations and perturbed analyses. The RMS error
against analyses was calculated at observation locations so that the
quantities are directly comparable.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><caption><p>RMS errors of MOGREPS ensemble mean as a function of forecast lead
time for forecasts of 500 hPa geopotential height. The forecast errors are
reported for verification against observations and perturbed and unperturbed
analyses.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="npg-2013-160-f06.pdf"/>

      </fig>

      <p>Figure <xref ref-type="fig" rid="Ch1.F6"/> shows the RMS error of the forecast ensemble mean as a
function of lead time for 500 hPa geopotential height for the Met Office
Global and Regional Ensemble Prediction System, MOGREPS <xref ref-type="bibr" rid="bib1.bibx4" id="paren.16"/>.
At the time the forecasts were taken the MOGREPS ensemble consisted of a
random sample of 11 members selected from 22 perturbed members used to cycle
the ETKF every 6 h, plus the control member. The time average has been taken
over 1 month of data. The different panels in Fig. <xref ref-type="fig" rid="Ch1.F6"/> represent
means over different geographical areas: Northern Hemisphere, tropics,
Southern Hemisphere and the whole globe. Each panel shows the RMS error of
the ensemble mean against the unperturbed analysis in red, the perturbed
analyses in black, and the observations in blue, in green against the
observations when the observation errors are accounted for. An observation
error of 9.4 m (RMS) has been assumed.</p>
      <p>Verification against observations gives RMS errors which are systematically
higher than all other estimates, while verification against unperturbed
analyses provides smaller RMS error than verification against observations
and perturbed analyses. This is in agreement with
Fig. <xref ref-type="fig" rid="Ch1.F1"/>. The exception is for the Southern
Hemisphere, where the error against observations becomes smaller than the
estimates against analyses after <inline-formula><mml:math display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 60 h. When observation errors are
accounted for, the verification against the observations is very similar to
the verification against perturbed analyses from <inline-formula><mml:math display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 0 h to
<inline-formula><mml:math display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 36 h for the Northern and Southern hemispheres, while for longer
lead times it gives lower RMS errors. This does not happen in the tropics
since it is likely that verification includes the contribution of systematic
errors which are not accounted for in the analysis perturbations. This is
expected since 500 hPa geopotential height does not provide a good
representation of what happens in the tropics.</p>
      <p>The consistency of the RMS errors for short lead times in the northern and
southern extra-tropics when calculated against perturbed analyses and
observations (when subtracting observation error) suggests that this ensemble
meets many of the required criteria. At longer lead times the RMS error
against perturbed and unperturbed analyses gives larger errors than for
verification against observations, when subtracting observation error. This
is consistent with the results in Fig. <xref ref-type="fig" rid="Ch1.F5"/> – when
analysis and forecast errors are no longer correlated the effect of analysis
error is to over-estimate the RMSE.</p>
</sec>
<sec id="Ch1.S9" sec-type="conclusions">
  <title>Conclusions</title>
      <p>We have shown that verification against a perturbed analysis gives the same
RMS errors as verification against the truth, under certain conditions. These
conditions require that the analysis ensemble is ideal (its RMS spread
matches the RMS error in the mean analysis), that the analysis is optimal and
that the ensemble size is large. Although NWP data assimilation systems are
typically well tuned (to maximise forecast performance), none of these
conditions is likely to hold exactly in practice. Additionally, the above
results only apply to a forecast which is the background for the analysis
against which it is verified.</p>
      <p>In spite of these limitations we believe that this may be a useful approach
to verification. Firstly it will give more realistic results than
verification against an unperturbed analysis in most situations. Secondly the
alternative is to verify against observations and explicitly account for the
effect of observation error. Given the difficulty in estimating observation
error and the fact that many parts of the world are sparsely observed, this
has its own limitations. The verification results for NWP forecasts indicate
it gives very similar results to verification against observations, when
observation error is accounted for, for short lead times in the
extra-tropics. Given that the problems of verification against unperturbed
analyses are most pronounced at short lead times, our method is potentially
valuable for verification of short-term NWP forecasts.</p>
      <p>It would be interesting to further explore some of the aspects of this
method. For instance, what is the effect of using an analysis ensemble which
is over-spread in some areas and under-spread in others? This study also
demonstrated that for a non-linear model the Kalman filter solution may not
minimise the system's forecast error. We feel that a better understanding of
this result would be beneficial.</p>
</sec>

      
      </body>
    <back><ack><title>Acknowledgement</title><p>The analysis of limited ensemble size came about through discussion with
Jonathan Flowerdew. Rob Darvell gave extensive assistance in the verification
of the NWP forecasts. <?xmltex \hack{\newline}?><?xmltex \hack{\newline}?>
Edited by: O. Talagrand <?xmltex \hack{\newline}?>
Reviewed by: five anonymous referees</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Berre et al.(2006)Berre, Stefanescu, and Pereira</label><mixed-citation>
Berre, L., Stefanescu, S., and Pereira, M.: The representation of the analysis
effect in three error simulation techniques, Tellus A, 58, 196–209, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Bowler(2006)</label><mixed-citation>
Bowler, N. E.: Explicitly accounting for observation error in categorical
verification of forecasts, Mon. Weather Rev., 134, 1600–1606, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Bowler(2008)</label><mixed-citation>
Bowler, N. E.: Accounting for the effect of observation errors on verification
of MOGREPS, Meteorol. Appl., 15, 199–205, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bowler et al.(2008)Bowler, Arribas, Mylne, Robertson, and
Beare</label><mixed-citation>Bowler, N. E., Arribas, A., Mylne, K. R., Robertson, K. B., and Beare, S. E.:
The MOGREPS short-range ensemble prediction system, Q. J. Roy. Meteorol.
Soc., 134, 703–722, 2008.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx5"><label>Buizza et al.(2005)Buizza, Houtekamer, Toth, Pellerin, Wei, and
Zhu</label><mixed-citation>
Buizza, R., Houtekamer, P., Toth, Z., Pellerin, G., Wei, M., and Zhu, Y.: A
comparison of the ECMWF, MSC, and NCEP global ensemble prediction systems,
Mon. Weather Rev., 133, 1076–1097, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Candille and Talagrand(2008)</label><mixed-citation>
Candille, G. and Talagrand, O.: Impact of observational error on the validation
of ensemble prediction systems, Q. J. Roy. Meteorol. Soc., 134, 957–971, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Ciach and Krajewski(1999)</label><mixed-citation>
Ciach, G. J. and Krajewski, W. F.: On the estimation of radar rainfall error
variance, Adv. Water Resour., 22, 585–595, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Clayton et al.(2013)Clayton, Lorenc, and Barker</label><mixed-citation>
Clayton, A. M., Lorenc, A. C., and Barker, D. M.: Operational implementation
of a hybrid ensemble/4D-Var global data assimilation system at the Met
Office, Q. J. Roy. Meteorol. Soc., 139, 1445–1461, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Evensen(1994)</label><mixed-citation>
Evensen, G.: Sequential data assimilation with a nonlinear quasi-geostrophic
model using monte-carlo methods to forecast error statistics, J.
Geophys. Res.-Oceans, 99, 10143–10162, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Grassberger and Procaccia(1983)</label><mixed-citation>
Grassberger, P. and Procaccia, I.: Measuring the strangeness of strange
attractors, Physica D, 9, 189–208, 1983.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Hoel(1984)</label><mixed-citation>
Hoel, P. G.: Introduction to mathematical statistics, 5th Edn., Wiley,  1984.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Houtekamer et al.(1996)Houtekamer, Lefaivre, Derome, Ritchie, and
Mitchell</label><mixed-citation>
Houtekamer, P., Lefaivre, L., Derome, J., Ritchie, H., and Mitchell, H.: A
system simulation approach to ensemble prediction, Mon. Weather Rev., 124, 1225–1242, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Kailath(1968)</label><mixed-citation>
Kailath, T.: An innovations approach to least-squares estimation, Part I:
linear filtering in additive white noise, IEEE T. Autom. Control, 13, 646–655, 1968.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Lorenz(1963)</label><mixed-citation>
Lorenz, E. N.: Deterministic nonperiodic flow, J. Atmos. Sci., 20, 130–148, 1963.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Lorenz(1995)</label><mixed-citation>
Lorenz, E. N.: Predictability: a problem partly solved, in: Proceedings of
the seminar on predictability, vol. I, ECMWF, Reading, Berkshire, UK, 1–18, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Peitgen et al.(1992)Peitgen, Jurgens, and Saupe</label><mixed-citation>
Peitgen, H. O., Jurgens, H., and Saupe, D.: Chaos and Fractals: New Frontiers
of Science, Springer-Velag, New York, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Saetra et al.(2004)Saetra, Hersbach, Bidlot, and
Richardson</label><mixed-citation>
Saetra, O., Hersbach, H., Bidlot, J.-R., and Richardson, D. S.: Effects of
observation errors on the statistics for ensemble spread and reliability,
Mon. Weather Rev., 132, 1487–1501, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Weigel(2011)</label><mixed-citation>
Weigel, A. P.: Ensemble forecasts, in: Forecast verification: A practitioner's
guide in atmospheric science, edited by: Jolliffe, I. T. and Stephenson,
D. B., Wiley-Blackwell, Chichester, England, p. 144, 2011.</mixed-citation></ref>

  </ref-list><app-group content-type="float"><app><title/>

    </app></app-group></back>
    </article>
