<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NPG</journal-id><journal-title-group>
    <journal-title>Nonlinear Processes in Geophysics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NPG</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nonlin. Processes Geophys.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7946</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/npg-33-233-2026</article-id><title-group><article-title>Boosting ensembles for statistics of tails at  conditionally optimal advance split times</article-title><alt-title>BEST COAST</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Finkel</surname><given-names>Justin</given-names></name>
          <email>jfinkel@uchicago.edu</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>O'Gorman</surname><given-names>Paul A.</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, United States</institution>
        </aff>
        <aff id="aff2"><label>a</label><institution>Current affiliation: Department of Geophysical Sciences and the Data Science Institute,  University of Chicago, 5801 S. Ellis Ave, Chicago, IL 60637, United States</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Justin Finkel (jfinkel@uchicago.edu)</corresp></author-notes><pub-date><day>28</day><month>May</month><year>2026</year></pub-date>
      
      <volume>33</volume>
      <issue>2</issue>
      <fpage>233</fpage><lpage>265</lpage>
      <history>
        <date date-type="received"><day>18</day><month>October</month><year>2025</year></date>
           <date date-type="rev-request"><day>3</day><month>November</month><year>2025</year></date>
           <date date-type="rev-recd"><day>16</day><month>March</month><year>2026</year></date>
           <date date-type="accepted"><day>29</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Justin Finkel</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026.html">This article is available from https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026.html</self-uri><self-uri xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026.pdf">The full text article is available as a PDF file from https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e100">Climate science needs more efficient ways to study high-impact, low-probability extreme events, which are rare by definition and costly to simulate in large numbers. Rare event sampling (RES), including ensemble boosting, offers a novel strategy to extract more information from those occasional simulated events, by applying small perturbations to turn a moderate event into a severe one which otherwise might not come for many more simulation-years. But how severe the events can become, and their estimated probabilities, depend sensitively on the details of the perturbation. In particular, for sudden and transient events like precipitation, performance of boosting depends sensitively on the choice of <italic>advance split time</italic> (AST) of the perturbation. Heuristically, the perturbation must come early enough before the event to let the ensemble of simulations diversify, but not so early that they forget the special initial conditions that led to the extreme. In pursuit of guidelines for choosing the AST, we study the effect of AST in the task of sampling extreme fluctuations of a passive tracer in a quasigeostrophic turbulent channel flow. This model system is idealized, but captures key elements of midlatitude storm track dynamics while exposing similar algorithmic challenges. We formulate AST selection as a concrete optimization problem for statistical accuracy against a ground truth. Given that such a ground truth would not generally be available, we propose a proxy objective function to optimize in practice: <italic>thresholded entropy</italic>, which rewards ensembles with both a high mean and a large spread. We show that ensemble boosting, when given a well-chosen AST and equipped with methods to estimate probabilities, can accurately sample extremes at long return periods. We furthermore find evidence that thresholded entropy successfully identifies an optimal AST, which is roughly 1–3  ddy turnover timescales in the quasigeostrophic system. Moreover, this proxy captures the <italic>variation</italic> of AST with the target location of the tracer within the flow field, suggesting it can generalize to more general chaotic systems including realistic climate models. Applying our boosting methodology at scale will require further development in adaptive optimization strategies, but our work here is an essential first step for establishing what must be optimized.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
<sec id="Ch1.S1.SS1">
  <label>1.1</label><title>Background and motivation</title>
      <p id="d2e128">The outsize impact of extreme weather events, and the need to understand the physical processes that cause them, have driven substantial research interest in the tails of climatological probability distributions. The fundamental challenge is scarcity of data: the historical record is too short to enable robust estimation of extremes rarer than a few times per century, even if the climate were stationary. Different modeling paradigms have developed to confront the issue. The most straightforward is direct numerical simulation (DNS), whereby a climate model is integrated extensively and the extreme events tallied, either as a single long run with stationary forcing <xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx51" id="paren.1"><named-content content-type="pre">e.g.,</named-content></xref> or as an ensemble with non-stationary forcing <xref ref-type="bibr" rid="bib1.bibx71 bib1.bibx32" id="paren.2"><named-content content-type="pre">e.g.,</named-content></xref>. This increases the sample size of extreme events, and reduces the relative error (mean/standard deviation) of an empirical estimate <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi mathvariant="italic">#</mml:mi><mml:mi mathvariant="normal">extremes</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="italic">#</mml:mi><mml:mi mathvariant="normal">total</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">samples</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>, but at a slow rate of <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:msqrt><mml:mrow><mml:mi mathvariant="double-struck">V</mml:mi><mml:mo>[</mml:mo><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>]</mml:mo></mml:mrow></mml:msqrt><mml:mrow><mml:mi mathvariant="double-struck">E</mml:mi><mml:mo>[</mml:mo><mml:mover accent="true"><mml:mi>p</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>]</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:msqrt><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>p</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:msqrt><mml:mi>p</mml:mi></mml:mfrac></mml:mstyle><mml:mo>∼</mml:mo><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mi>p</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>≪</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx84" id="paren.3"/>. For example, estimating the probability of a once-per-century storm (<inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula> yr<sup>−1</sup>) to within 10 % relative error would take roughly <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">0.01</mml:mn></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> model years. Most of that simulation time is wasted, just waiting for the next event.</p>
      <p id="d2e325">Rare event sampling (RES) takes a shortcut by repurposing that time to generate more extremes instead, perturbing simulations in a targeted way to favor extreme behavior – with the tradeoff of having to account for bias properly. The need to track probabilities makes rare event <italic>sampling</italic> distinct from <italic>optimization</italic>, i.e., finding the most extreme event possible (or plausible) given physical constraints. That problem has been attacked successfully with constrained optimization algorithms by <xref ref-type="bibr" rid="bib1.bibx14" id="text.4"/> and <xref ref-type="bibr" rid="bib1.bibx5" id="text.5"/> for extreme dissipation events in turbulence, and in AI-based weather forecasting by <xref ref-type="bibr" rid="bib1.bibx80" id="text.6"/> for extreme heat waves. RES can benefit from these techniques, but aims to represent the entire tail <italic>distribution</italic> of extremes with statistical fidelity and not just the maximum.</p>
      <p id="d2e347">RES was first developed for nuclear safety assessment <xref ref-type="bibr" rid="bib1.bibx35" id="paren.7"/>, and has since been generalized for diverse applications including structural reliability engineering <xref ref-type="bibr" rid="bib1.bibx1" id="paren.8"/>, molecular dynamics <xref ref-type="bibr" rid="bib1.bibx83" id="paren.9"/>, and more recently climate and weather <xref ref-type="bibr" rid="bib1.bibx63 bib1.bibx79 bib1.bibx2" id="paren.10"><named-content content-type="pre">e.g.,</named-content></xref>.  RES stands in contrast to many other strategies which, in one way or another, replace the expensive physical model with a cheaper approximation. Extreme value theory gives principles for parametrically estimating distributions tails <xref ref-type="bibr" rid="bib1.bibx10" id="paren.11"/>, but its asymptotic assumptions are not always justified by the finite datasets available, and it is best suited to model univariate distributions (e.g., average temperature over a region) rather than full spatiotemporal processes like storms, although spatial extreme value modeling is steadily progressing <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx30" id="paren.12"/>. Hybrid statistical/physical models aim to parameterize physical processes rather than the final output statistics, and include linear inverse models <xref ref-type="bibr" rid="bib1.bibx54" id="paren.13"/>; stochastic weather generators based on analogues or Markov state models <xref ref-type="bibr" rid="bib1.bibx74 bib1.bibx24 bib1.bibx82 bib1.bibx19 bib1.bibx57" id="paren.14"/>; empirical downscaling <xref ref-type="bibr" rid="bib1.bibx73 bib1.bibx66 bib1.bibx64" id="paren.15"/>; statistical (including machine-learned) emulation <xref ref-type="bibr" rid="bib1.bibx69 bib1.bibx6 bib1.bibx44 bib1.bibx45" id="paren.16"/>; and generative modeling <xref ref-type="bibr" rid="bib1.bibx78 bib1.bibx68 bib1.bibx25" id="paren.17"/>. Machine learning models in particular are proliferating at a dizzying pace, and they can indeed generate new samples at low cost, but their ability to represent physics outside their training data – perhaps the most essential requirement for extreme event modeling – is rightly regarded with suspicion.</p>
      <p id="d2e386">In light of these options, modelers have several tools to help deal with the tradeoff between bias (incorrect physics or limited resolution) and variance (erratic statistical estimates due to limited sample size). The methods are not mutually exclusive, with many interesting synergies possible <xref ref-type="bibr" rid="bib1.bibx43" id="paren.18"><named-content content-type="pre">e.g., as conceptualized in</named-content></xref>, but RES in particular is our focus here as an under-utilized and under-developed strategy to reduce variance without incurring extra bias.</p>
</sec>
<sec id="Ch1.S1.SS2">
  <label>1.2</label><title>Rare event sampling: promise and pitfalls</title>
      <p id="d2e402">The generic RES procedure can be summarized as follows. We denote the full state vector by <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, and the measure of <italic>severity</italic> by <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>: some functional of a trajectory <inline-formula><mml:math id="M9" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> that is user-defined, e.g., rainfall averaged over any time interval and spatial region of interest. <list list-type="order"><list-item>
      <p id="d2e449">Generate an ensemble of initial conditions to serve as candidate extreme events. Call these “ancestors”.</p></list-item><list-item>
      <p id="d2e453">Select a subset of ancestors with high propensity to produce extreme events (large <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>), discarding the others. Apply small perturbations to this subset to generate “descendants”: new simulations likely to generate large <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> like their parents, but to do so in diverse ways.</p></list-item><list-item>
      <p id="d2e479">Adjust the probability weights downward on these selected ancestors, spreading their weight across their descendants to correct for the over-sampling.</p></list-item><list-item>
      <p id="d2e483">Repeat steps 2–3 multiple times on the new, extreme-skewed population, until hitting a termination criterion.</p></list-item><list-item>
      <p id="d2e487">Estimate any climatological statistics of interest by taking weighted averages of all the simulations.</p></list-item></list></p>
      <p id="d2e490">This template must be specialized for the kind of target event. Diffusion Monte Carlo (DMC), as applied to season-long hot extremes <xref ref-type="bibr" rid="bib1.bibx63" id="paren.19"><named-content content-type="pre">with a variant called “GKTL” after its inventors;</named-content></xref> and tropical cyclones <xref ref-type="bibr" rid="bib1.bibx79" id="paren.20"><named-content content-type="pre">with a variant called “QDMC” that applies quantile mapping to intensity values;</named-content></xref>, performs the split/kill operation at a chronological sequence of time points, extending the timespan of surviving members while aborting discarded members before they can run to completion—thus, before their <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> values can even be measured. This is appropriate when the propensity for a <italic>future</italic> extreme <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is well-approximated by some property <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> measurable at the <italic>present</italic>: for example, if <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is the mean temperature from June to August, <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M17" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> (running average temperature from 1 June to <inline-formula><mml:math id="M18" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>) is a good splitting criterion <xref ref-type="bibr" rid="bib1.bibx63" id="paren.21"/>. If <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is peak wind speed over a tropical cyclone's lifetime, <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> (minimum sea-level pressure in the eye) is a good splitting criterion <xref ref-type="bibr" rid="bib1.bibx79" id="paren.22"/>.</p>
      <p id="d2e637">But suppose that no good predictor exists. In particular, assume that the severity function <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> of a simulation is the maximum over the event's timespan of a user-defined observable <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, such as the accumulated rainfall over a small region between <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> day and <inline-formula><mml:math id="M24" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, which we generically call the <italic>intensity</italic> function. Assume further that no better predictor for <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is known besides <inline-formula><mml:math id="M26" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> itself at the present time. In this case, a better choice of RES algorithm might be adaptive multilevel splitting <xref ref-type="bibr" rid="bib1.bibx11" id="paren.23"><named-content content-type="pre">AMS;</named-content></xref>, or more general versions such as “anticipated AMS” <xref ref-type="bibr" rid="bib1.bibx65" id="paren.24"/> and “trying-early” AMS (TEAMS), which we previously introduced in <xref ref-type="bibr" rid="bib1.bibx17" id="text.25"/> – itself a special case of subset simulation <xref ref-type="bibr" rid="bib1.bibx1" id="paren.26"/> from engineering – in which every ensemble member runs to completion and produces an actual value of <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, not some proxy for it. Descendants are then spawned from the ancestor at some <italic>advance split time</italic> (AST) <inline-formula><mml:math id="M28" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> before <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is achieved, to give them enough time to diversify and perhaps exceed their ancestor's severity, but not so much time to forget their ancestor's special initial conditions. Figure <xref ref-type="fig" rid="F1"/> illustrates this tradeoff when selecting AST in the context of a simple stochastic system, namely Langevin dynamics <xref ref-type="bibr" rid="bib1.bibx53" id="paren.27"/> with a logarithmic potential which is specified in Appendix A, but the picture alone conveys the essential phenomenon of an <italic>optimal</italic> AST. The existence of a nontrivial (i.e., strictly positive) optimum is obvious when looking at isolated events, but its precise value is subtle to quantify when our purpose relates to <italic>climatological</italic> statistics, i.e., averages over many events.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e775">Schematic summarizing the ensemble boosting and tail estimation procedure, using a simple Langevin dynamics with a potential that is quadratic for <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mi>x</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> – the blue-shaded region in <bold>(a)</bold>- – and logarithmic outside this range. Appendix A specifies the system completely. The position variable <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mi>X</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> exhibits intermittent, transient extremes <bold>(a.i)</bold> and power law tails <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mo>|</mml:mo><mml:mi>X</mml:mi><mml:mo>|</mml:mo><mml:mo>&gt;</mml:mo><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo><mml:mo mathvariant="italic">}</mml:mo><mml:mo>∼</mml:mo><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:msup><mml:mo>|</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3.1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> <bold>(a.ii)</bold>. We set a threshold for severity (horizontal black dashed line) at roughly the minimum probability estimable from a relatively short (duration 1600) timeseries (see the black empirical PDF in a.ii and the black empirical complementary CDFs (CCDFs) in <bold>(b, c, d).iii</bold>, as compared with the true PDF and CCDF in gray). We then identify the peaks over the threshold (marked by vertical black dashed lines in <bold>a.i</bold>), and perturb the simulation in advance of these peaks. Three choices of advance split time (AST) are shown in rows <bold>(b)</bold>–<bold>(d)</bold> marked by vertical red lines, each resulting in “boosted” peak ensembles, shown as red curves in <bold>(b–d).(i,ii)</bold> and summarized by CCDFs shown in light red in <bold>(b–d).(iii)</bold>. Combining these conditional CCDFs together using the “MoCTail” estimator introduced later in Eq. (<xref ref-type="disp-formula" rid="Ch1.E17"/>) gives the dark red dashed line, which is meant to approximate the ground truth (gray line) better than the short DNS alone can do, including by going to higher values of <inline-formula><mml:math id="M33" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>. The intermediate AST <bold>(c)</bold> is best among the three for this task, and our goal is to formulate and characterize this optimal AST more generally.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f01.png"/>

        </fig>

      <p id="d2e902">There is no general procedure for selecting AST and other hyperparameters, which impedes the application of RES methods to arbitrary target events and models. We have shown empirically in <xref ref-type="bibr" rid="bib1.bibx17" id="text.28"/> the existence of an optimal AST – in the sense of accuracy of long return period estimates – that is roughly approximated by the time until <inline-formula><mml:math id="M34" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> of error saturation. But this result might be specific to the Lorenz-96 system and a number of choices made in <xref ref-type="bibr" rid="bib1.bibx17" id="text.29"/>, in particular relating to <list list-type="order"><list-item>
      <p id="d2e924">The target variable defining intensity (energy density, <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, with site index <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, though for Lorenz-96 all sites are statistically equivalent).</p></list-item><list-item>
      <p id="d2e953">The spatial and temporal scale for averaging the target variable (we simply studied the instantaneous maximum at a single site, <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>).</p></list-item><list-item>
      <p id="d2e969">The stochastic parameterization (smooth in space, white in time).</p></list-item><list-item>
      <p id="d2e973">The metric in which to measure distances between ensemble members (Euclidean distance, <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>K</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></inline-formula>).</p></list-item></list></p>
      <p id="d2e1040">Practitioners working with models more complicated than Lorenz-96 face a vast menu of choices in all four domains, the first two falling under the purview of domain science and the last two falling under algorithm design. If the physical model or the choice of target variable changes, it stands to reason that the choice of metric should also change, and any single prescription of AST <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="normal">like</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">the</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle><mml:mtext>-</mml:mtext><mml:mi mathvariant="normal">saturation</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">time</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> is unlikely to work for all cases. Indeed, in our recent application of TEAMS to extremes of temperature and daily precipitation in a general circulation model, we found that the <inline-formula><mml:math id="M40" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule provided some guidance but underestimated the optimal AST for both temperature and precipitation <xref ref-type="bibr" rid="bib1.bibx18" id="paren.30"/>. Error norms incorporating global information will be less relevant than local norms around the target region, which tend to saturate more slowly <xref ref-type="bibr" rid="bib1.bibx18" id="paren.31"/>.</p>
      <p id="d2e1087">Our primary goal in this study is to establish a general principle for optimizing AST for intermittent extreme events in meteorologically relevant dynamical systems. To balance computational economy with physical realism, we select a system of intermediate complexity between Lorenz-96 and a moist GCM: a 2-layer quasigeostrophic (QG) flow with a passive tracer. Since its original formulation in <xref ref-type="bibr" rid="bib1.bibx55" id="text.32"/>, the 2-layer QG model has served as a useful paradigmatic minimal model for baroclinic instability and associated jets, waves, and vortices in the atmosphere and ocean. It has been augmented in many ways to study specific processes, for example by <xref ref-type="bibr" rid="bib1.bibx36" id="text.33"/>, who coupled in a moisture component and found the resulting precipitation and latent heating to strongly affect the balance between waves and vortices in the underlying flow. However, even the simpler addition of a <italic>passive</italic> tracer – one without feedback through latent heating – is enough to advance the algorithmic questions we pursue here. Passive tracer dynamics is physically interesting in its own right, as seen by many studies of <italic>intermittency</italic> and heavy-tailed tracer statistics in turbulent flows <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx26 bib1.bibx58" id="paren.34"/>. In climate science too, extremes of pollution concentration and temperature can be captured partially through passive tracer advection <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx48 bib1.bibx39" id="paren.35"/>.</p>
      <p id="d2e1109">Our choice of the 2-layer QG model as a test system is thus a major upgrade in physical relevance as well as algorithmic difficulty from Lorenz-96, which resembles QG dynamics only loosely via its Hopf bifurcation structure <xref ref-type="bibr" rid="bib1.bibx75" id="paren.36"/>. This path up the model hierarchy has been trodden before by <xref ref-type="bibr" rid="bib1.bibx59 bib1.bibx60" id="text.37"/>, who added passive tracers to Lorenz-96 and a QG model respectively and studied extreme fluctuations in the tracer's Fourier modes. Also,  <xref ref-type="bibr" rid="bib1.bibx21" id="text.38"/> quantified extreme value statistics – including local and global statistics – of QG wind fields themselves. All these works have inspired and guided this one, but we focus distinctly on the link between <italic>short-time perturbation dynamics</italic> and <italic>long-term climate statistics</italic>.</p>
      <p id="d2e1127">The QG model has enough “space” to explore the effects of all four decision axes listed above on optimal AST. In principle, one can do this with an exhaustive suite of experiments: for every target region (location, size) and every version of stochastic input (e.g., perturbation magnitude and spatial scale) of interest, run TEAMS with a wide range of AST parameters, measure the skill of each AST in matching a reference ground truth distribution, and select the optimal AST. In practice, this exhaustive procedure is not feasible, in part because of the huge number of potential targets, but more  fundamentally because TEAMS' performance is <italic>highly subject to randomness</italic>. Measuring the effect of any parameter change on the algorithm's performance requires many repetitions – several dozen at least – to average out the variability inherent in Monte Carlo. Moreover, other hyperparameters related to “population management” exist within TEAMS and other rare event algorithms: the number of initial ensemble members, how many of them to kill and clone at every iteration, and the termination criterion, to name a few. Randomness appears not only as physical forcing, but also in selecting which members to clone, thus interacting tightly with the population hyperparameters. One can think of this as confounding due to sampling bias, which further blurs the imprint of AST itself on performance.</p>
      <p id="d2e1134">So instead of using TEAMS for our investigation, we turn to a related method of ensemble boosting <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx20" id="paren.39"/>. The idea of ensemble boosting is simple: identify some extremes from an initial climatic timeseries, and re-simulate them with perturbed antecedent conditions to generate unrealized but physically plausible (and possibly more extreme) scenarios. By focusing on a limited set of ancestor events to boost, we avoid the additional randomness that occurs in TEAMS as the level is raised and additional ensemble members are stochastically added, which simplifies our investigation. In addition, <xref ref-type="bibr" rid="bib1.bibx4" id="text.40"/> has developed an approach to estimate probabilities based on the boosted ensembles, and we have also been developing such an estimator that is introduced below. With the addition of an ability to estimate probabilities, ensemble boosting may now be viewed as an RES algorithm.</p>
      <p id="d2e1143">We suspect that the optimal AST is closely related to a physically intrinsic quantity that is not particular to a given algorithm. Analogously to Lyapunov exponents, which encode the timescale for small perturbations to double, the optimal AST should encode the timescale for <italic>extreme values of some target variable</italic> to <italic>maximize in variability</italic>. This statement is heuristic, and a primary goal here is to propose some quantities that are very close to the optimal AST and that, like Lyapunov exponents, are intrinsic to the system and do not depend on arbitrary algorithmic choices. We propose and evaluate several candidates, including <italic>entropy</italic> and <italic>expected improvement</italic>: two functionals of ensemble distributions which are drawn from reinforcement learning.</p>
      <p id="d2e1158">We have three major contributions. First, we develop a new estimator for low probabilities of extreme fluctuations from boosted ensembles, similar to the estimator of <xref ref-type="bibr" rid="bib1.bibx4" id="text.41"/> but distinct in the aggregation step. Our approach includes an optional parametric fit of the response function to perturbations (applicable to both estimators), a simple quadratic regression model that imposes regularity on the resulting severity distribution. Second, we use the two estimators to measure the quality of a range of ASTs across a range of target events (tracer concentration at different target locations), finding evidence for an entropy-based optimality principle. Third, and most importantly from a practical perspective, we demonstrate that both estimators successfully approximate low probabilities when the ensembles are launched from a good AST, which the optimality principle can help to select efficiently. Our goal here is not to demonstrate a performant rare event algorithm – only to elucidate a necessary ingredient (AST) to be optimized in future algorithms – but even when comparing statistical errors at equal cost, we find (and report at the end of the analysis, in Fig. <xref ref-type="fig" rid="F13"/>) that our boosted ensembles are already competitive with an equal-cost DNS.</p>
      <p id="d2e1166">The rest of the paper is organized as follows. Section <xref ref-type="sec" rid="Ch1.S2"/> details the procedure of generating samples and estimating tail statistics, at a model-agnostic level, and proposes several candidate indicators of measuring ensemble dispersion that may help select an optimal AST. Section <xref ref-type="sec" rid="Ch1.S3"/> specifies the QG system, its numerical simulation, and its extreme value statistics. Section <xref ref-type="sec" rid="Ch1.S4"/> specifies the perturbed-ensemble design at a model-specific level. Section <xref ref-type="sec" rid="Ch1.S5"/> visualizes some examples of perturbed events, and how the AST selection criteria behave on these examples. Section <xref ref-type="sec" rid="Ch1.S6"/> reports the performance of different AST choices, and visualizes the overall “optimization landscape”. Section <xref ref-type="sec" rid="Ch1.S7"/> concludes with an outlook and proposed roadmap for subsequent research – theoretical, algorithmic, and applied.</p>
</sec>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Sampling and estimation methodology</title>
      <p id="d2e1191">Our methodology can be separated into three parts, summarized here and expounded in three subsections. For a given target variable and location defining the extreme event, we <list list-type="order"><list-item>
      <p id="d2e1196">run a relatively short direct numerical simulation (“short DNS”), identify the extreme events within it, and generate a dataset of boosted ensembles for each event at a range of ASTs;</p></list-item><list-item>
      <p id="d2e1200">estimate tail distributions, conditional on the event and the AST;</p></list-item><list-item>
      <p id="d2e1204">combine the conditional tails into an unconditional (“climatological”) tail, using the estimators specified below, for a range of ASTs, and select the optimal AST based on the skill of the corresponding tail estimate in reproducing the tail of a “long DNS”.</p></list-item></list></p>
      <p id="d2e1207">We then display the results of applying this procedure to a range of target locations in the model flow domain.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Generating the dataset of boosted ensembles</title>
      <p id="d2e1217">There are many design choices in ensemble boosting <xref ref-type="bibr" rid="bib1.bibx23" id="paren.42"/>: how to select extreme events to boost, how many boosts to generate, when to launch them, etc. This subsection details the choices used here.</p>
      <p id="d2e1223">We run a direct numerical simulation (“short DNS”) <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>:</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>≤</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, long enough to generate some extremes but not enough to estimate probabilities smaller than <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn><mml:mo>/</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for a relative error tolerance of <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn></mml:mrow></mml:math></inline-formula>. The premise of RES, and ensemble boosting, is that the extremes it does generate might have been even worse, perhaps just a butterfly flap away from the more intense extremes one would see with a “long DNS” of duration <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">long</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. We generate such a long DNS as well to serve as a ground-truth for validation. Following the ensemble boosting methodology laid out in <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx22" id="text.43"/> <xref ref-type="bibr" rid="bib1.bibx20" id="text.44"/> and <xref ref-type="bibr" rid="bib1.bibx50" id="text.45"/>, we first identify a threshold <inline-formula><mml:math id="M45" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> with exceedance probability <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that is moderate enough to estimate precisely with the short DNS. In other words, <inline-formula><mml:math id="M47" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> is the <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>th quantile, or “<inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>th complementary quantile”. Equivalently, <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the <italic>complementary cumulative density function</italic> (CCDF) of the random variable <inline-formula><mml:math id="M51" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, evaluated at <inline-formula><mml:math id="M52" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>. In line with the <italic>peaks-over-threshold</italic> procedure <xref ref-type="bibr" rid="bib1.bibx10" id="paren.46"/>, we take cluster maxima of exceedances above <inline-formula><mml:math id="M53" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> as the “ancestral” extreme events. Concretely, a cluster maximum is a state from the DNS,  <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, such that

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M55" display="block"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mi>R</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mi mathvariant="normal">max</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>:</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>≤</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mi>B</mml:mi></mml:mrow></mml:mfenced><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M57" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> are buffer times longer than the mixing timescale of the dynamics (i.e., how long two perturbed simulations need to become independent), ensuring that two consecutive events <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are genuinely independent from each other. <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is an upper bound on the ASTs used for boosting.</p>
      <p id="d2e1626">We collect all such peaks occurring in the short DNS,

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M60" display="block"><mml:mrow><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>:</mml:mo><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          and for a sequence of increasing ASTs <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>:</mml:mo><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mi>J</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> bounded between 0 and <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, launch an ensemble of descendants <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>:</mml:mo><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> by applying <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> different perturbations to the DNS at time <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and running each simulation to time <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>+</mml:mo><mml:mi>B</mml:mi></mml:mrow></mml:math></inline-formula>. Note that <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> could in principle vary between ancestors <inline-formula><mml:math id="M70" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> and lead times <inline-formula><mml:math id="M71" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, which is not needed for our exhaustive sweeps in this paper, but certainly would be needed in an “online” rare event sampling procedure that iteratively homes in on a subset of the most extreme-ogenic ancestors <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi>n</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> and ASTs <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi>j</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> to draw more samples from.</p>
      <p id="d2e1882">A bit more notation helps clarify how the perturbing is done, abstractly at first and concretely in Sect. <xref ref-type="sec" rid="Ch1.S3"/> when we specialize to the QG system. For each (<inline-formula><mml:math id="M74" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M75" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M76" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>), we draw a random sample <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> from some sample space <inline-formula><mml:math id="M78" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula>. Denoting by <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msup><mml:mo>:</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup><mml:mo>×</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi><mml:mo>→</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> the flow map that integrates the perturbed dynamics forward by a time interval <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:math></inline-formula>, the (<inline-formula><mml:math id="M81" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M82" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M83" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>)th descendant's trajectory through state space <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> can be written

            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M85" display="block"><mml:mrow><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced close="" open="{"><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">for</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>≤</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">for</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>+</mml:mo><mml:mi>B</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:math></disp-formula>

          In words, the descendant shares its ancestor's past up until the time of perturbation <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, after which it diverges.</p>
      <p id="d2e2225">There are two main forms of commonly used perturbation. An <italic>impulsive</italic> perturbation is a kick applied at a single time (which is used in ensemble boosting), in which case <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="double-struck">C</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, typically with <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>≪</mml:mo><mml:mi>d</mml:mi></mml:mrow></mml:math></inline-formula>, and a sample <inline-formula><mml:math id="M90" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> is transformed to spate space via a function <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>:</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>k</mml:mi></mml:msup><mml:mo>→</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> (e.g., a low-rank matrix multiplication). Then, the perturbed dynamics can be written <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>+</mml:mo><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> with only one argument is the unperturbed dynamics. We also use the convention that <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, i.e., <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> corresponds to no perturbation.</p>
      <p id="d2e2394">The other common case is where <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a stochastic process, e.g., an Itô diffusion forced by white noise, as we used in <xref ref-type="bibr" rid="bib1.bibx17" id="text.47"/> as well as the schematic in Fig. <xref ref-type="fig" rid="F1"/>. In that case, <inline-formula><mml:math id="M97" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> is a white noise process sampled at discrete times, whose dimensionality scales with the number of timesteps. In the QG experiments, we adhere to impulsive perturbations for three reasons: it introduces fewer arbitrary parameters, it is less disruptive to the system's intrinsic dynamics, and it keeps the dimensionality of the random space low. If, as we conjecture, even low-dimensional butterfly flaps are sufficient to excite the more extreme fluctuations, it would make deterministic search methods – which should always be preferred over Monte Carlo – more viable.</p>
      <p id="d2e2423">Following the perturbation, the descendant drifts away from the parent and achieves its own severity <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> (peak of its intensity function <inline-formula><mml:math id="M99" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>) at some time <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> possibly different from its ancestor's peak time <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>:

            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M102" display="block"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mi>R</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>

          where the latter notation emphasizes dependence on <inline-formula><mml:math id="M103" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, while recognizing that each (<inline-formula><mml:math id="M104" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M105" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>) induces a different severity function <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> because perturbations may be felt differently depending on the initial condition.</p>
      <p id="d2e2602">If the perturbation is small, the descendant's peak time <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> will be close to the ancestor's peak time <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>. However, if the intensity function <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> tends to oscillate, e.g., with each passing Rossby wave crest, a large-enough perturbation might cause the next wave crest after <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> to outgrow the original peak, misappropriating the imposed perturbation to fuel a different event than the original target. Tersely, <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">argmax</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> might be a discontinuous function of <inline-formula><mml:math id="M112" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> a non-differentiable function of <inline-formula><mml:math id="M114" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, which is a nuisance for our goal to optimize over <inline-formula><mml:math id="M115" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> and, more importantly, complicates the causal chain between perturbation and response. We explicitly prohibit this behavior by restricting the range of <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> as follows. <list list-type="bullet"><list-item>
      <p id="d2e2768">Set an “argmax drift” parameter <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> based on physical timescales, e.g., half an oscillation period. Initially set <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="normal">argmax</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>:</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>≤</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>.</p></list-item><list-item>
      <p id="d2e2883">If <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is a local maximum in <inline-formula><mml:math id="M120" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, then do not change it.</p></list-item><list-item>
      <p id="d2e2916">Otherwise, shift <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> backward (if at the beginning of the interval) or forward (if at the end of the interval) until it is at a local maximum.</p></list-item></list> Although it is ad-hoc, this adjustment aims to uphold the core idea of ensemble boosting to <italic>augment existing events</italic>, while preserving their basic identity, rather than <italic>discover totally new events</italic> – which may as well be done by extending the DNS. In general this is a nontrivial condition to impose, as multiple spikes in a sequence may be dynamically correlated to each other, but we use only this simple strategy as demonstration.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Estimating conditional and climatological probabilities from boosted ensembles</title>
      <p id="d2e2957">Assume now there is a probability measure <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> on <inline-formula><mml:math id="M123" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> with associated density function <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which might for example place higher weight on smaller kicks. The <inline-formula><mml:math id="M125" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> superscript will generally relate to statistics over this conditional probability measure, to distinguish it from long-term climatological statistics. A major aim of this paper is to show how they relate to each other. Each ensemble of descendants at each lead time gives rise to its own conditional severity distribution: 

            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M126" display="block"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:munder><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          which can be estimated from the samples <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>:</mml:mo><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>. Here <italic>conditional</italic> means starting with a perturbation of the <inline-formula><mml:math id="M129" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>th ancestor's particular initial condition at time <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and running forward until time <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>+</mml:mo><mml:mi>B</mml:mi></mml:mrow></mml:math></inline-formula>. By contrast, we refer to the <italic>climatological</italic> severity distribution as that resulting from a long DNS.</p>
      <p id="d2e3209">Integrals of the form (Eq. <xref ref-type="disp-formula" rid="Ch1.E5"/>) arise in many diverse risk analysis tasks, such as reliability engineering, where <inline-formula><mml:math id="M132" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> often represents wind, waves, or tremors buffeting a built structure <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx47" id="paren.48"/>, and is therefore <italic>high-dimensional</italic>. The default strategy for high-dimensional sampling is vanilla Monte Carlo, whose infamously slow convergence has motivated more efficient workarounds. A particular class of “variational” <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx72" id="paren.49"/> and “first- and second-order reliability” methods <xref ref-type="bibr" rid="bib1.bibx8" id="paren.50"/> approximate the sampling by constrained optimization, relying on the large-deviation principle that increasingly rare events have a shrinking space of possible pathways, concentrating around a single point of <inline-formula><mml:math id="M133" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula>. We could certainly make use of those methods here, but there is a crucial distinction: in our setting, the perturbation space is an arbitrary design choice aiming at an indirect goal (climate estimation), rather than some externally imposed distribution (e.g., a Gaussian process model for ocean bathymetry in <xref ref-type="bibr" rid="bib1.bibx12" id="altparen.51"/> and <xref ref-type="bibr" rid="bib1.bibx72" id="altparen.52"/>). Therefore, nothing stops us here from deliberately choosing low-dimensional perturbations instead of high-dimensional ones as in <xref ref-type="bibr" rid="bib1.bibx63" id="text.53"/> and <xref ref-type="bibr" rid="bib1.bibx4" id="text.54"/>. This enables numerical quadrature instead of Monte Carlo or elaborate large-deviation approaches, and saves on cost by allowing sample re-use across different input distributions.</p>
      <p id="d2e3253">It is possible that higher-dimensional spaces are more effective for exciting extreme fluctuations, which would make the above-cited methodologies very useful for our purpose in future research. They can also be useful when conditional risk estimation (for near-term weather forecasting) is the end goal, as well as the previously-mentioned optimization methods demonstrated in <xref ref-type="bibr" rid="bib1.bibx14" id="text.55"/>, <xref ref-type="bibr" rid="bib1.bibx5" id="text.56"/>, and <xref ref-type="bibr" rid="bib1.bibx80" id="text.57"/>. But our first goal is to determine whether our chosen low-dimensional kicks can suffice for climatological estimation.</p>
      <p id="d2e3265">Based on the samples drawn from <inline-formula><mml:math id="M134" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula>, we fit a regression model <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with parameters <inline-formula><mml:math id="M136" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula>, in our case coefficients for linear and quadratic polynomials. In general <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> could be a more elaborate parametric model, e.g., a Gaussian process or neural network with learned weights <inline-formula><mml:math id="M138" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula>, as often used in modern uncertainty quantification <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx67 bib1.bibx56" id="paren.58"/>. Then the integral over <inline-formula><mml:math id="M139" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> can be estimated, either analytically (if <inline-formula><mml:math id="M140" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> take simple enough forms) or numerically by densely filling <inline-formula><mml:math id="M142" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> with a grid of points, evaluating <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M144" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> at each point, and taking the inner product of <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M146" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> for any <inline-formula><mml:math id="M147" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>. The result is an estimate <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for the conditional CCDF, <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, obtained by replacing the <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>). The final step is to estimate the <italic>tail</italic> of the conditional CCDF,

            <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M152" display="block"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo>|</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          which we could do just by putting hats <inline-formula><mml:math id="M153" display="inline"><mml:mover accent="true"><mml:mrow><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> on the <inline-formula><mml:math id="M154" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula>s on the right-hand side. However, this risks dividing by zero, because the fitted function <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> may imply zero probability of exceeding the threshold, particularly at long ASTs when descendants have enough time to decorrelate totally with their ancestor. This loss of ancestral “wisdom” is a more fundamental problem than the numerical issue of zero denominator, and we address it by implementing a continuous version of the “accept-reject” step of the TEAMS procedure in <xref ref-type="bibr" rid="bib1.bibx17" id="text.59"/>. Wherever the descendant severity <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> falls below <inline-formula><mml:math id="M157" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, we replace it with the ancestor severity, denoted <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> (with no second subscript):

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M159" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>:=</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:munder><mml:mfenced close="}" open="{"><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">if</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mi mathvariant="normal">otherwise</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:munder><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>+</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:munder><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E9"><mml:mtd><mml:mtext>9</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:munder><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>+</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:munder><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E10"><mml:mtd><mml:mtext>10</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mo>=</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mfenced close="]" open="["><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          (<inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> when <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> since <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is decreasing, hence the two terms in the last expression correspond to the two cases).</p>
      <p id="d2e4349">This estimator can be extended to other expectations of interest conditional on the target variable being extreme. Denote by <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>]</mml:mo><mml:mo>=</mml:mo><mml:mo>:</mml:mo><mml:msub><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> a generic function of the trajectory <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> launched at AST <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> ahead of ancestor <inline-formula><mml:math id="M166" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, such as time-averaged wind speed or air temperature. It is actually a random variable (a function of <inline-formula><mml:math id="M167" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>) and its mean can be estimated by replacing each <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to obtain 

            <disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M171" display="block"><mml:mrow><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">E</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced><mml:mo>≈</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:munder><mml:msub><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mfenced open="[" close="]"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:math></disp-formula>

          The first term collects statistics of the part of the <inline-formula><mml:math id="M172" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc that contributes to the tail <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, and the second term just moves the remaining (“rejected”) probability mass back onto the ancestor at <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. We do not explore the properties of this estimator for different <inline-formula><mml:math id="M175" display="inline"><mml:mi mathvariant="normal">Φ</mml:mi></mml:math></inline-formula>s, but note it could be important to an applied study with RES.</p>
      <p id="d2e4722">Another heuristic way to justify the accept-reject expression for <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) is to stipulate that we care about approximating <italic>only the extreme part of the boosting distribution</italic>, i.e., those <inline-formula><mml:math id="M177" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>s near enough to 0 that <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:math></inline-formula>, excluding the descendants bound to fall below <inline-formula><mml:math id="M179" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>. We re-allocate the probability mass in the “non-extreme” region of the disc (where <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:math></inline-formula>) to the very center of the disc (the ancestor, where <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:math></inline-formula> by construction). This rearrangement ensures that <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is close to 1, justifying a Taylor series expansion in <inline-formula><mml:math id="M183" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M184" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E12"><mml:mtd><mml:mtext>12</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E13"><mml:mtd><mml:mtext>13</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mfenced close="]" open="["><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E14"><mml:mtd><mml:mtext>14</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mo>≈</mml:mo><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mfenced open="[" close="]"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E15"><mml:mtd><mml:mtext>15</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mo>≈</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mfenced close="]" open="["><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:mi mathvariant="double-struck">I</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E16"><mml:mtd><mml:mtext>16</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mo>=</mml:mo><mml:mo>:</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          The crux of our hypothesis is that these conditional distributions from boosting can be aggregated across ancestors to approximate the climatological distribution <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo>|</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M186" display="inline"><mml:mi mathvariant="normal">Θ</mml:mi></mml:math></inline-formula> is used to denote the ground truth that would be obtained from a long DNS. We specifically propose to aggregate the conditional CCDFs as a uniform mixture over ancestors, selecting one representative AST <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> from each ancestor <inline-formula><mml:math id="M188" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> to best represent its alternate realities according to some selection rule (different rules will be evaluated thoroughly for the QG system in Sect. <xref ref-type="sec" rid="Ch1.S6"/>). We write the mixture as

            <disp-formula id="Ch1.E17" content-type="numbered"><label>17</label><mml:math id="M189" display="block"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">M</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          and call it the “MoCTail” estimator of <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, for “Mixture of Conditional Tails”.</p>
      <p id="d2e5385">The recent works <xref ref-type="bibr" rid="bib1.bibx50" id="text.60"/> and <xref ref-type="bibr" rid="bib1.bibx4" id="text.61"/> formulate a different estimator, which makes for an interesting comparison. Rather than summing <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> tail CCDFs, each approximating a ratio of the form (Eq. <xref ref-type="disp-formula" rid="Ch1.E6"/>), they construct a single ratio by summing <inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> numerators and <inline-formula><mml:math id="M193" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denominators. Translated into our own notation, this becomes

            <disp-formula id="Ch1.E18" content-type="numbered"><label>18</label><mml:math id="M194" display="block"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi>P</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          We call this the “PoPTail” estimator of <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, for “Pool of Perturbed Tails”. <xref ref-type="bibr" rid="bib1.bibx4" id="text.62"/> do not model <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> parametrically, but instead use a standard Monte Carlo estimate <inline-formula><mml:math id="M197" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> (fraction of descendants exceeding <inline-formula><mml:math id="M198" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>), which is probably necessary for their high-dimensional perturbations. However, we can convert the PoPTail estimator to our parametric version just by thinking in terms of CCDFs, hence the formulation in Eq. (<xref ref-type="disp-formula" rid="Ch1.E18"/>). The more important difference is that PoPTail avoids the potential degeneracy <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> by “pooling” non-extreme descendants together with extreme ones in the denominator.</p>
      <p id="d2e5646">One could argue for either estimator based on the validity of its underlying assumptions which are challenging to rigorously verify. Here we adopt a more openly empirical perspective in testing the skill of both.</p>
      <p id="d2e5649">An important advantage of both estimators is <italic>extensibility</italic> with respect to the dataset: if the variance is too high, one can always either generate new ancestors by extending the short DNS, or extend the range of ASTs sampled, or enlarge the ensemble at any ASTs deemed promising, without discarding the laborious samples already generated. This is unfortunately not the case with an algorithm like AMS, TEAMS, GKTL, or QDMC: because of the random rules by which ancestors are selected and new members generated, a completed run cannot be enlarged while retaining its estimation properties unless we are willing to do an entirely new additional run and combine estimates from multiple runs as was done in <xref ref-type="bibr" rid="bib1.bibx63" id="text.63"/>, <xref ref-type="bibr" rid="bib1.bibx79" id="text.64"/> and <xref ref-type="bibr" rid="bib1.bibx17" id="text.65"/>. This results in waste during the fine-tuning process of calibrating TEAMS. For example, one might decide in retrospect that a TEAMS run was too aggressive in killing non-extreme simulations and raising the threshold and we cannot easily extend the run with a new set of hyperparameters. With boosting, we can simply go back, perturb those less-extreme simulations, and incorporate them into the dataset, without needing to re-generate everything. To make boosting competitive at sampling the highest levels of severity, we suspect it will be necessary to augment our current scheme with an iterative level-raising schedule, like TEAMS, but with less restriction on the sampling procedure.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Evaluating performance: statistical accuracy and computational cost</title>
      <p id="d2e5674">We evaluate the MoCTail and PoPTail estimators <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">M</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi mathvariant="normal">P</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> by comparing to the ground truth <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> as estimated from a long DNS. DNS is in fact a trivial special case of ensemble boosting with <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> (no descendants), reducing each summand of Eq. (<xref ref-type="disp-formula" rid="Ch1.E17"/>) and the numerator of Eq. (<xref ref-type="disp-formula" rid="Ch1.E18"/>) to <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> and the denominator of Eq. (<xref ref-type="disp-formula" rid="Ch1.E18"/>) to <inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Both estimators reduce to the same vanilla empirical CCDF in this case, and this is what we use to estimate <inline-formula><mml:math id="M206" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e5780">We use <inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>-divergence to measure the disparity of <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">M</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">P</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> from <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>. This is estimated from a discrete histogram with a sequence of thresholds <inline-formula><mml:math id="M211" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>&lt;</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">…</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>&lt;</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>K</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:math></inline-formula>, and define the probability mass function <inline-formula><mml:math id="M212" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> as the probability contained in the <inline-formula><mml:math id="M213" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>th bin (note that <inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>K</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and so <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msubsup><mml:mi>Q</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>). As described further in Sect. <xref ref-type="sec" rid="Ch1.S3.SS3"/>, we select the <inline-formula><mml:math id="M216" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>s as quantiles with consecutively halving exceedance probabilities, i.e., <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mrow><mml:mn mathvariant="normal">5</mml:mn><mml:mo>+</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>≤</mml:mo><mml:mi>k</mml:mi><mml:mo>&lt;</mml:mo><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula>. These quantiles change with latitude, as the tail is different for each. Note the same set of <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>'s based on the climatological distribution is used also for evaluating estimated distributions. The <inline-formula><mml:math id="M220" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>-divergence of either estimator <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>∈</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">M</mml:mi></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">P</mml:mi></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> is then defined as

            <disp-formula id="Ch1.E19" content-type="numbered"><label>19</label><mml:math id="M222" display="block"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>‖</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:msup><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msubsup><mml:mi>Q</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">Θ</mml:mi></mml:msubsup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e6199">We will compute both the MoCTail and PoPTail estimates on the same dataset, and find them numerically quite similar, both in terms of skill and in terms of individual bin estimates. It would be interesting to develop test cases where they differ more systematically, to clarify which (if either) is generally superior.</p>
      <p id="d2e6202">Computational efficiency is another important consideration besides accuracy, as the entire goal of rare event algorithms is to improve efficiency or accuracy (or both) relative to DNS. For a boosting-like rare event algorithm to be useful, its error should decrease faster by perturbing existing ancestors (increasing <inline-formula><mml:math id="M223" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula>) than by extending DNS by generating new ancestors (increasing <inline-formula><mml:math id="M224" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> and not <inline-formula><mml:math id="M225" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula>), at least in some range of <inline-formula><mml:math id="M226" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> that samples the attractor broadly but not exhaustively. However, this paper does not present a complete rare event algorithm per se, in the sense we do not yet stake our claim on a speedup. Rather, we ask a pre-requisite question: does increasing <inline-formula><mml:math id="M227" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> decrease the error <italic>at all</italic>? Clearly boosting can increase the maximum severity, but that could happen in ways that do not respect the tail CCDF's shape, e.g., if perturbations tend to maximize the event's severity while bypassing moderate severities that carry significant statistical weight.</p>
      <p id="d2e6245">We will thus make two comparisons between boosting and DNS: accuracy at fixed <inline-formula><mml:math id="M228" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, and accuracy at fixed cost (where DNS runs an additional length equal to the cost of simulating descendants, allocating its full budget to “exploration” rather than “exploitation”). Specifically, we approximate the cost of the boosting approach for a given AST <inline-formula><mml:math id="M229" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> as

            <disp-formula id="Ch1.E20" content-type="numbered"><label>20</label><mml:math id="M230" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">Average</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">boosting</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">cost</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">per</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">ancestor</mml:mi><mml:mo>=</mml:mo><mml:mi>M</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi>A</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="normal">mean</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">return</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">period</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, the “argmax drift” parameter, accounts for the extra time needed to run after the ancestor's peak to account for changes in peak timing. “Mean return period” is the average time between consecutive independent peaks over the threshold <inline-formula><mml:math id="M232" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, which will be longer than <inline-formula><mml:math id="M233" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> because of de-clustering. The dependence on <inline-formula><mml:math id="M234" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is a complication, as each AST tried would merit a different-length DNS for cost comparison, and we do not want to penalize boosting too severely by summing over all ASTs because in practice we would not bother simulating the obviously sub-optimal ASTs. Rather, we optimistically estimate the cost if <inline-formula><mml:math id="M235" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is already known. On the other hand, our chosen <inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is likely more samples than necessary to fit a satisfactory parametric model, as we have deliberately sampled the perturbation space more generously than we would if chasing a speedup. We simplify the comparison by fixing <inline-formula><mml:math id="M237" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> to <inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E20"/>), which is close to or slightly greater than the optimal values that we found empirically.</p>
      <p id="d2e6427">We will show (Fig. <xref ref-type="fig" rid="F13"/>b) that boosting is unambiguously more accurate than DNS when fixing the number of ancestors <inline-formula><mml:math id="M239" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, and similarly accurate with marginal improvements when fixing cost, though with variation across latitudes and AST criteria. Thus, we do achieve some speedup, even though it is not (yet) our main objective. Any fixed-cost performance gains we achieve here (not our main objective) should be viewed as a lower bound for future algorithms, which will benefit from the conceptual insights into AST that we glean presently.</p>
      <p id="d2e6439">To emphasize the <italic>conditional</italic> nature of the AST – its possible dependence on the ancestor <inline-formula><mml:math id="M240" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> due to initial condition-dependent predictability – we refer to <inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> as the “conditional advance split time” (CAST), and its optimal value (by <inline-formula><mml:math id="M242" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> or other criteria) as the “conditionally optimal advance split time” (COAST). Our goal is to define the COAST, calculate it given extensive sampling from boosted ensembles, and finally to suggest useful criteria to estimate it when sample size is limited, as in a real rare event algorithm deployment.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>AST selection criteria</title>
      <p id="d2e6486">With a data-generating plan and an estimator in place, we return to our central question of interest: how to select the CASTs <inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>? There are three natural kinds of criteria.</p>
      <p id="d2e6508"><list list-type="order">
            <list-item>

      <p id="d2e6513">Choose a single uniform AST <inline-formula><mml:math id="M244" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> for all ancestors (“U” for “uniform”). In this case, the CAST is not really “conditional” at all. In <xref ref-type="bibr" rid="bib1.bibx17" id="text.66"/>, we found the COAST for TEAMS by systematic grid search through candidate ASTs, and found post-hoc an empirical relationship for the COAST: <inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>≈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M246" display="inline"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mi mathvariant="italic">ϵ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the time until an ensemble dispersing from initial condition <inline-formula><mml:math id="M247" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (each member forced by a different noise realization) reaches a fraction <inline-formula><mml:math id="M248" display="inline"><mml:mi mathvariant="italic">ϵ</mml:mi></mml:math></inline-formula> of its asymptotic root-mean-squared-error (RMSE), and <inline-formula><mml:math id="M249" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mi mathvariant="italic">ϵ</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> is the average of <inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mi mathvariant="italic">ϵ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> over different initial conditions <inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. In <xref ref-type="bibr" rid="bib1.bibx17" id="text.67"/>, we sampled <inline-formula><mml:math id="M252" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> from the stationary distribution; here, for computational expediency, we will repurpose the boosting ensembles for estimating <inline-formula><mml:math id="M253" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mi mathvariant="italic">ϵ</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula>, i.e., sampling <inline-formula><mml:math id="M254" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> from pre-peak antecedent conditions.</p>
            </list-item>
            <list-item>

      <p id="d2e6695">Choose the CAST <inline-formula><mml:math id="M255" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> separately for each ancestor <inline-formula><mml:math id="M256" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> such that that an ensemble launched at <inline-formula><mml:math id="M257" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> disperses to a pre-defined threshold at time <inline-formula><mml:math id="M258" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>. One could measure dispersal in different ways like RMSE, but here we opt instead for a <italic>pattern correlation</italic>, defined with respect to spatiotemporal fields <inline-formula><mml:math id="M259" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (from the ancestor) and <inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (from the <inline-formula><mml:math id="M261" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th ensemble member) as

                  <disp-formula id="Ch1.E21" content-type="numbered"><label>21</label><mml:math id="M262" display="block"><mml:mrow><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>:=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:msqrt><mml:mrow><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">where</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi>f</mml:mi><mml:mo>:=</mml:mo><mml:mi>F</mml:mi><mml:mo>-</mml:mo><mml:mo>〈</mml:mo><mml:mi>F</mml:mi><mml:mo>〉</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>〈</mml:mo><mml:mo>⋅</mml:mo><mml:mo>〉</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">time</mml:mi><mml:mtext>-</mml:mtext><mml:mi mathvariant="normal">average</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>(</mml:mo><mml:mi mathvariant="normal">climatology</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">and</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mover accent="true"><mml:mrow><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mi mathvariant="normal">space</mml:mi><mml:mtext>-</mml:mtext><mml:mi mathvariant="normal">average</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:math></disp-formula>

                Unless  noted otherwise, <inline-formula><mml:math id="M263" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> will refer to the average of <inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> over all members <inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M266" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula>. The reason for subtracting time-averages is to fairly weight spatial regions with smaller background <inline-formula><mml:math id="M267" display="inline"><mml:mrow><mml:mo>〈</mml:mo><mml:mi>F</mml:mi><mml:mo>〉</mml:mo></mml:mrow></mml:math></inline-formula>, e.g., poles if <inline-formula><mml:math id="M268" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula> is temperature. Dividing by spatial standard deviations is simply a useful normalization that restricts <inline-formula><mml:math id="M269" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> to the range [<inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, 1] by the Cauchy-Schwarz inequality. <inline-formula><mml:math id="M271" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>  ends to decrease over time from 1 to 0 except for occasional negative values when <inline-formula><mml:math id="M272" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M273" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are similar up to translation (but this effect usually disappears when averaging large-enough ensembles). We then choose some threshold <inline-formula><mml:math id="M274" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and select the corresponding CAST <inline-formula><mml:math id="M275" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:msub><mml:mi>j</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msubsup><mml:mo>[</mml:mo><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> – a function of the threshold—as the smallest sampled AST <inline-formula><mml:math id="M276" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for which <inline-formula><mml:math id="M277" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>  creases from 1 to <inline-formula><mml:math id="M278" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> between the split time <inline-formula><mml:math id="M279" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and the peak time <inline-formula><mml:math id="M280" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>. (PC stands for “pattern correlation”.) Note that the CAST varies with <inline-formula><mml:math id="M281" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, but the correlation threshold, denoted <inline-formula><mml:math id="M282" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, is uniform. Finding the COASTs <inline-formula><mml:math id="M283" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> then boils down to finding the optimal value of <inline-formula><mml:math id="M284" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>.</p>

      <p id="d2e7208">The <inline-formula><mml:math id="M285" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule from <xref ref-type="bibr" rid="bib1.bibx17" id="text.68"/>, which used Euclidean distance <inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>[</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>]</mml:mo><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula> as the dispersion indicator, can be approximately restated in terms of pattern correlation: 

                      <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M287" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E22"><mml:mtd><mml:mtext>22</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〈</mml:mo><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〉</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>〈</mml:mo><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〉</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">saturation</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">value</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">of</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msup><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E23"><mml:mtd><mml:mtext>23</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>⇒</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>+</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">Using</mml:mi><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>=</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E24"><mml:mtd><mml:mtext>24</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:mfenced></mml:mrow><mml:msqrt><mml:mrow><mml:mfenced open="(" close=")"><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:msub><mml:mi>f</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow><mml:msqrt><mml:mrow><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced><mml:mfenced open="(" close=")"><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">ρ</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E25"><mml:mtd><mml:mtext>25</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfenced><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfenced><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow><mml:msqrt><mml:mrow><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">ρ</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">Approximating</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>≈</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E26"><mml:mtd><mml:mtext>26</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>≈</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">Using</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>=</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

                (The approximation invoked in the second-to-last step, <inline-formula><mml:math id="M288" display="inline"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>≈</mml:mo><mml:mo>〈</mml:mo><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi>f</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mo>〉</mml:mo></mml:mrow></mml:math></inline-formula>, will hold when the spatial region is large enough that global fluctuations in the same direction are unlikely.) This calculation shows that the time until RMSE reaches <inline-formula><mml:math id="M289" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> of its saturation value is roughly equivalent to the time at which pattern correlation drops to <inline-formula><mml:math id="M290" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.86</mml:mn></mml:mrow></mml:math></inline-formula>. We do not assume this threshold is optimal, but include it as a reference for comparison. And we stress that the <inline-formula><mml:math id="M291" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule implemented in <xref ref-type="bibr" rid="bib1.bibx17" id="text.69"/> determines a uniform <inline-formula><mml:math id="M292" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, not a conditional <inline-formula><mml:math id="M293" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, because their averaging was performed over the attractor, whereas here we will use <inline-formula><mml:math id="M294" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> as an initial condition-specific diagnostic.</p>
            </list-item>
            <list-item>

      <p id="d2e8061">Define the CAST as the solution to an optimization problem, where we seek to maximize a functional on the boosted severity distribution that favors both a high mean and high variability of the severity. This would implicitly favor intermediate ASTs, as short-AST ensembles have high mean but low variability while long-AST ensembles will have high variability but low mean (approaching the climatological distribution). We propose and evaluate two such functionals in this paper: <list list-type="custom"><list-item><label>a.</label>
      <p id="d2e8066">Expected improvement (EI):<disp-formula id="Ch1.E27" content-type="numbered"><label>27</label><mml:math id="M295" display="block"><mml:mrow><mml:mi mathvariant="double-struck">E</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="normal">Δ</mml:mi><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:munder><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:msub><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:mo>+</mml:mo></mml:msub><mml:mi mathvariant="normal">d</mml:mi><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula><mml:math id="M296" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo></mml:msub><mml:mo>:=</mml:mo><mml:mi mathvariant="normal">max</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and we recall that <inline-formula><mml:math id="M297" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> means no perturbation (i.e., the ancestor) </p></list-item><list-item><label>b.</label>
      <p id="d2e8190">Thresholded entropy (TE):<disp-formula id="Ch1.E28" content-type="numbered"><label>28</label><mml:math id="M298" display="block"><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>S</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced><mml:mo>+</mml:mo></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:munderover><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>Q</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mi>log⁡</mml:mi><mml:mi mathvariant="normal">Δ</mml:mi><mml:msub><mml:mi>Q</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>where the bin boundaries <inline-formula><mml:math id="M299" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> start at <inline-formula><mml:math id="M300" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, and so only the tail part of the conditional CCDF contributes. The thresholded entropy is thus defined based on probability over discrete bins (with the bin boundaries <inline-formula><mml:math id="M301" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> set based on quantiles of the ground-truth distribution) and would change if the bins were changed.</p></list-item></list> Where it does not cause confusion, we will also call the CASTs <inline-formula><mml:math id="M302" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M303" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> themselves COASTs because they optimize something, although it is something different than <inline-formula><mml:math id="M304" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. We conjecture that that these two notions of optimality coincide: if each ancestor separately optimizes  I or TE, the resulting aggregate of distributions (via MoCTail or PoPTail estimators) will minimize <inline-formula><mml:math id="M305" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>-divergence from the true climatological tail. Our results will approximately confirm the conjecture in the case of TE.</p>
            </list-item>
          </list>These criteria are each in turn more complex, but also more theoretically appealing. The correlation-based CASTs <inline-formula><mml:math id="M306" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msubsup><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, unlike the synchronized AST <inline-formula><mml:math id="M307" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, can vary with <inline-formula><mml:math id="M308" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> to respect differences in predictability between different initial conditions, a well-recognized phenomenon in chaotic systems <xref ref-type="bibr" rid="bib1.bibx46" id="paren.70"/>, including the atmosphere <xref ref-type="bibr" rid="bib1.bibx41" id="paren.71"/>. Still, both <inline-formula><mml:math id="M309" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M310" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> require the user to set some arbitrary global threshold. The open question is whether optimizing <inline-formula><mml:math id="M311" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M312" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> individually for each <inline-formula><mml:math id="M313" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> will also optimize the accuracy of the unconditional (MoCTail) climatological CCDF estimator against the ground truth climatological CCDF from a long DNS.</p>
      <p id="d2e8448"><bold>Main result</bold>: climatological tails are estimated more accurately with perturbed ensembles than with un-perturbed ancestors alone (fixed-<inline-formula><mml:math id="M314" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> comparison between DNS and boosting). This holds with few exceptions for both MoCTail and PoPTail estimators, for all COAST selection rules, and for all target spatial locations. At fixed cost, boosting and DNS are tied overall, but with some variation across latitudes and the value that cost is fixed to, suggesting that substantial speedups are possible with more highly optimized boosting-like algorithms. No single selection rule is superior across the board. The EI and TE criteria, however, have a distinct advantage of needing no arbitrary threshold choices. TE-based estimates strike a reasonable compromise between statistical error and arbitrariness, which is strong enough support that <italic>we recommend TE as a generic AST selection rule</italic>.</p>
      <p id="d2e8464">The remainder of the paper demonstrates the theoretical framework above on the QG system. Section <xref ref-type="sec" rid="Ch1.S3"/> specifies the dynamical model and its numerical simulation, displays some representative output, defines the target intensity functions of interest, and reports on their basic tail statistics. Section <xref ref-type="sec" rid="Ch1.S4"/> specifies the perturbation protocol (i.e., the space <inline-formula><mml:math id="M315" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> and probability densities <inline-formula><mml:math id="M316" display="inline"><mml:mrow><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>) and visualizes representative examples of the system's response, providing motivation for our choices of AST selection criteria. Section <xref ref-type="sec" rid="Ch1.S6"/> compares the performances of all proposed AST selection criteria criteria in matching the climatological tail CCDF. Section <xref ref-type="sec" rid="Ch1.S7"/> concludes with a summary and outlook on important future lines of work.</p>
      <p id="d2e8501">Throughout, we present more in-depth results for one select target latitudes just south of the domain center, and only summarize for the wider range of target latitudes, which reveals large-scale variations in extreme event predictability and representability across space.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>The quasigeostrophic model</title>
      <p id="d2e8513">The model setup aims to distill some challenges we have encountered with rare event algorithms. We first recognized the need for advance splitting (or “trying early”) to sample extreme precipitation in an aquaplanet GCM <xref ref-type="bibr" rid="bib1.bibx18" id="paren.72"/>. A minimal surrogate model replicating this challenge was found in Lorenz-96 <xref ref-type="bibr" rid="bib1.bibx40" id="paren.73"/>, which provided a testbed for the first working version of TEAMS and a recognition of an “optimal advance split time” <xref ref-type="bibr" rid="bib1.bibx17" id="paren.74"/>. There is a huge gap in model complexity between Lorenz-96 and the GCM (see Table <xref ref-type="table" rid="T1"/>), and we wish to test our idea in this middle ground where the target spatial location can have an effect. Lorenz-96, with a one-dimensional domain and homogeneous forcing, is too simple. For this reason, and to make closer contact with physics, we selected the two-layer QG model as a suitable intermediate between Lorenz-96 and the GCM.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e8530">Three rungs on the model hierarchy. Left: the Lorenz-96 system used in <xref ref-type="bibr" rid="bib1.bibx17" id="text.75"/> has a one-dimensional spatial domain (“longitude”) divided into discrete sites <inline-formula><mml:math id="M317" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, …, 39, on which generic meteorological variables <inline-formula><mml:math id="M318" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> evolve in time. Its state space dimension is 40. Right: the aquaplanet model used in <xref ref-type="bibr" rid="bib1.bibx18" id="text.76"/> has a three-dimensional spatial domain: latitude <inline-formula><mml:math id="M319" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula>, longitude <inline-formula><mml:math id="M320" display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula>, and pressure normalized by its surface value, <inline-formula><mml:math id="M321" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mi>p</mml:mi><mml:mo>/</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. It has six prognostic fields: zonal wind <inline-formula><mml:math id="M322" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>, meridioal wind <inline-formula><mml:math id="M323" display="inline"><mml:mi>v</mml:mi></mml:math></inline-formula>, temperature <inline-formula><mml:math id="M324" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, and humidity <inline-formula><mml:math id="M325" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> vary in all three dimensions, whereas surface pressure <inline-formula><mml:math id="M326" display="inline"><mml:mrow><mml:msub><mml:mi>p</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and precipitation rate <inline-formula><mml:math id="M327" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> vary only in the horizontal. Center: the 2-layer quasigeostrophic model used in this study has two layers (<inline-formula><mml:math id="M328" display="inline"><mml:mrow><mml:mi>z</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>) of two dimensions each (longitude <inline-formula><mml:math id="M329" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>, latitude <inline-formula><mml:math id="M330" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>), and two dynamical fields: streamfunction <inline-formula><mml:math id="M331" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> which is discretized spectrally, and tracer concentration <inline-formula><mml:math id="M332" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> which is discretized on a grid.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">One-tier Lorenz-96</oasis:entry>
         <oasis:entry colname="col3">2-layer quasigeostrophic channel</oasis:entry>
         <oasis:entry colname="col4">Global aquaplanet</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Domain</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M333" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">39</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M334" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>z</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi>L</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>×</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M335" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">360</mml:mn><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">90</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">90</mml:mn><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Fields</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M336" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M337" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M338" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi>u</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi><mml:mo>,</mml:mo><mml:mi>T</mml:mi><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo><mml:mo>∪</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>R</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Equations of motion and numerical simulation</title>
      <p id="d2e8998">We implement a version of the QG model combining elements of several classic studies. Our numerical method and friction form follow <xref ref-type="bibr" rid="bib1.bibx27" id="text.77"/>, but on a smaller domain with weaker friction magnitude as in <xref ref-type="bibr" rid="bib1.bibx52" id="text.78"/> to contain only 1–2 more energetic zonal jets. We furthermore add bottom topography in the lower layer as in <xref ref-type="bibr" rid="bib1.bibx70" id="text.79"/> to fix preferred latitudes for jets while still allowing them to temporarily split, merge, and meander. Thus climate statistics, and hence the COAST itself, can vary with latitude. Further, we augment the system with a passive tracer to represent a key component of precipitation dynamics, following the spirit of <xref ref-type="bibr" rid="bib1.bibx7" id="text.80"/> and <xref ref-type="bibr" rid="bib1.bibx59 bib1.bibx60" id="text.81"/>  who used turbulent advection-diffusion as a paradigm for intermittency.</p>
      <p id="d2e9016">The model equations are as follows, in non-dimensionalized form using the deformation radius <inline-formula><mml:math id="M339" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> as the length scale and a velocity scale <inline-formula><mml:math id="M340" display="inline"><mml:mi mathvariant="script">U</mml:mi></mml:math></inline-formula>. To make plain the role of the background shear, we define a non-dimensional wind <inline-formula><mml:math id="M341" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> as the ratio between the imposed upper-level zonal wind and <inline-formula><mml:math id="M342" display="inline"><mml:mi mathvariant="script">U</mml:mi></mml:math></inline-formula>. All non-dimensional parameter values are listed in Table <xref ref-type="table" rid="T2"/>. The horizontal coordinates (<inline-formula><mml:math id="M343" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M344" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>) each run from 0 to <inline-formula><mml:math id="M345" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>. The integer-valued vertical coordinate <inline-formula><mml:math id="M346" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula> is an index for the layer (1 for the top and 2 for the bottom, appearing as a subscript). <inline-formula><mml:math id="M347" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> represents the streamfunction minus a background of <inline-formula><mml:math id="M348" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mi>U</mml:mi><mml:mi>y</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math id="M349" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> is the bottom topography which is specified to vary sinusoidally with wavenumber 2 in latitude. <inline-formula><mml:math id="M350" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> represents potential vorticity minus a background of <inline-formula><mml:math id="M351" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mi>y</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, due to planetary vorticity gradient and topography. <inline-formula><mml:math id="M352" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> represents the passive tracer field.

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M353" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E29"><mml:mtd><mml:mtext>29</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mfenced open="[" close=""><mml:mrow><mml:msub><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mfenced open="" close="]"><mml:mrow><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>U</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>h</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="italic">κ</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:msup><mml:mi mathvariant="normal">∇</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ν</mml:mi><mml:msup><mml:mi mathvariant="normal">∇</mml:mi><mml:mn mathvariant="normal">6</mml:mn></mml:msup><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E30"><mml:mtd><mml:mtext>30</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>U</mml:mi><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mi>c</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E31"><mml:mtd><mml:mtext>31</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mi mathvariant="normal">for</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>z</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi>L</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>×</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E32"><mml:mtd><mml:mtext>32</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mi mathvariant="normal">where</mml:mi></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E33"><mml:mtd><mml:mtext>33</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>z</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="normal">∇</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mi>z</mml:mi></mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E34"><mml:mtd><mml:mtext>34</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mi>h</mml:mi><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mi>sin⁡</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>y</mml:mi></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          For <inline-formula><mml:math id="M354" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula>, we impose doubly periodic boundary conditions and timestep with a pseudo-spectral method with 64 Fourier modes in each dimension and standard <inline-formula><mml:math id="M355" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula>-dealiasing (hence, an effective maximum wavenumber of 20). We time-step linear terms with the trapezoid rule (Crank–Nicolson) and nonlinear and topographic terms with a predictor-corrector (Heun's) method. Meanwhile, boundary conditions on <inline-formula><mml:math id="M356" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> are periodic in <inline-formula><mml:math id="M357" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> and Dirichlet in <inline-formula><mml:math id="M358" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, with values (0, 1) at <inline-formula><mml:math id="M359" display="inline"><mml:mrow><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi>L</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Together with a first-order upwind monotone finite-volume scheme, this setup guarantees that <inline-formula><mml:math id="M360" display="inline"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>≤</mml:mo><mml:mi>c</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> everywhere, making clear that its probability distribution has compact support. Note there is no explicit dissipation for <inline-formula><mml:math id="M361" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula>, but the low-order discretization creates some effective diffusivity.</p>

<table-wrap id="T2"><label>Table 2</label><caption><p id="d2e9632">Non-dimensional physical parameters used for the numerical simulation, similar to those chosen in <xref ref-type="bibr" rid="bib1.bibx52" id="text.82"/>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Description</oasis:entry>
         <oasis:entry colname="col2">Symbol</oasis:entry>
         <oasis:entry colname="col3">Value</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Coriolis gradient</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M362" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Ekman friction coefficient</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M363" display="inline"><mml:mi mathvariant="italic">κ</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.05</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Wind shear</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M364" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hyperviscosity</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M365" display="inline"><mml:mi mathvariant="italic">ν</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">(0.292)<sup>3</sup></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Topography amplitude</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M367" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Domain size</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M368" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M369" display="inline"><mml:mrow><mml:mn mathvariant="normal">6</mml:mn><mml:mo>⋅</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e9795">The number of degrees of freedom, or state space dimension, is 

            <disp-formula id="Ch1.E35" content-type="numbered"><label>35</label><mml:math id="M370" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="normal">layers</mml:mi><mml:mo>)</mml:mo><mml:mo>×</mml:mo><mml:mfenced open="(" close=""><mml:mrow><mml:msup><mml:mn mathvariant="normal">41</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="normal">Fourier</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">modes</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">for</mml:mi><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mn mathvariant="normal">64</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="normal">grid</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced open="" close=")"><mml:mrow><mml:mi mathvariant="normal">cells</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">for</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>c</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">554</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          and we will sometimes refer to the full state vector as <inline-formula><mml:math id="M371" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> – not to be confused with the spatial coordinate <inline-formula><mml:math id="M372" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>. For simplicity, we refer to one time unit as a day, which is <inline-formula><mml:math id="M373" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">10</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> of an eddy turnover timescale (see Fig. <xref ref-type="fig" rid="F3"/>). The common timestep for <inline-formula><mml:math id="M374" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M375" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> is 0.025 d, and the output frequency is once per day. The spatiotemporal resolution is coarse by modern standards, but we are not seeking to calculate any real-world physical quantity: we are seeking a general rule that will help make the COAST clear for a wide class of models.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e9967">Snapshots of the QG system configuration in the upper layer. Contours indicate the anomaly streamfunction <inline-formula><mml:math id="M376" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula>, which varies over a non-dimensional range of approximately <inline-formula><mml:math id="M377" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">18</mml:mn></mml:mrow></mml:math></inline-formula>, dashed contours indicating negative anomalies. Colors indicate <bold>(a)</bold> tracer concentration <inline-formula><mml:math id="M378" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula>, <bold>(b)</bold> zonal wind velocity <inline-formula><mml:math id="M379" display="inline"><mml:mrow><mml:mi>u</mml:mi><mml:mo>=</mml:mo><mml:mi>U</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:mi mathvariant="italic">ψ</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M380" display="inline"><mml:mrow><mml:mi>U</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> is the basic background shear, and <bold>(c)</bold> meridional velocity <inline-formula><mml:math id="M381" display="inline"><mml:mrow><mml:mi>v</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub><mml:mi mathvariant="italic">ψ</mml:mi></mml:mrow></mml:math></inline-formula>. The timestamps increase from left to right, and come from the long DNS. The small square represents an example target region in which to sample extremes of the local tracer concentration, in this case centered at <inline-formula><mml:math id="M382" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and extending <inline-formula><mml:math id="M383" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> in both meridional and zonal directions. This same region is the target used in the following results, and we consistently refer to the domain coordinates in fractions of 64 across all figures.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Baseline simulation and statistics</title>
      <p id="d2e10127">We run a “short DNS”  of length <inline-formula><mml:math id="M384" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">short</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> d <inline-formula><mml:math id="M385" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula> years (after a 500 d spinup) to supply the pool of initially un-perturbed (“ancestral”) events. Then, to provide “ground truth” statistics, we run a control simulation, or “long DNS”, of duration <inline-formula><mml:math id="M386" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">long</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">16</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">d</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">44</mml:mn></mml:mrow></mml:math></inline-formula> years, which is <inline-formula><mml:math id="M387" display="inline"><mml:mi>O</mml:mi></mml:math></inline-formula>(1600) eddy turnover times and <inline-formula><mml:math id="M388" display="inline"><mml:mi>O</mml:mi></mml:math></inline-formula>(160) jet meandering times (see Fig. <xref ref-type="fig" rid="F3"/> caption for timescale definitions). However, in estimating climatological statistics from the long DNS, we take advantage of statistical zonal symmetry by concatenating the timeseries of all 64 longitudes, increasing the effective sample size by a factor of <inline-formula><mml:math id="M389" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mi>L</mml:mi><mml:mo>/</mml:mo></mml:mrow></mml:math></inline-formula>(some typical correlation length). Conceptually, the short and long DNS are analogous to “training” and “validation” datasets in standard machine learning procedures, in the sense that we want to infer properties of the validation set using only information extracted from the training set (for example, by perturbing and re-simulating events seen in training). As we show below, simply counting events from the short DNS gives probability estimates that deterioriate at high levels of severity, which we aim to rectify with boosting.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e10222">Hovmöller diagrams of anomalies (departures from time-means) of zonal-mean concentration <bold>(a.i)</bold> and zonal-mean zonal wind <bold>(b.i)</bold>. Contours indicate zonal-mean streamfunction anomaly (range <inline-formula><mml:math id="M390" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>, negatives values dashed). Column <bold>(ii)</bold> shows bottom topography, which <italic>directly</italic> affects the lower layer only, but indirectly sets the preferred jet positions in the upper layer as well. For the same quantities, column <bold>(iii)</bold> shows the zonal and time mean and column <bold>(iv)</bold> shows the zonal mean of the temporal standard deviation. The Hovmöller diagrams give context to the snapshots of <inline-formula><mml:math id="M391" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula> from Fig. <xref ref-type="fig" rid="F2"/>b, which come from times <bold>(i)</bold> 3300, when the upper and lower jets are both shifted south; <bold>(ii)</bold> 3400, when the jets are unusually far apart; and <bold>(iii)</bold> 3500, when the jets are unusually close together. These intermittent, discrete shifts in jet location happen every <inline-formula><mml:math id="M392" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> d, which we call the “jet meandering timescale”. During a typical 100 d timespan of stationary jet, the fields oscillate roughly 10 times (not shown here; see Fig. <xref ref-type="fig" rid="F7"/>); hence we assign the eddy turnover timescale a nominal value of 10 d.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f03.png"/>

        </fig>

      <p id="d2e10291">Figure <xref ref-type="fig" rid="F2"/> shows representative snapshots of three dynamical fields in the upper layer from the long DNS: tracer concentration <inline-formula><mml:math id="M393" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula>, zonal velocity <inline-formula><mml:math id="M394" display="inline"><mml:mrow><mml:mi>u</mml:mi><mml:mo>=</mml:mo><mml:mi>U</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>y</mml:mi></mml:msub><mml:mi mathvariant="italic">ψ</mml:mi></mml:mrow></mml:math></inline-formula>, and meridional velocity <inline-formula><mml:math id="M395" display="inline"><mml:mrow><mml:mi>v</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mo>∂</mml:mo><mml:mi>x</mml:mi></mml:msub><mml:mi mathvariant="italic">ψ</mml:mi></mml:mrow></mml:math></inline-formula>. Figure <xref ref-type="fig" rid="F3"/> shows Hovmöller diagrams of zonal-mean anomalies of <inline-formula><mml:math id="M396" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M397" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula> (not <inline-formula><mml:math id="M398" display="inline"><mml:mi>v</mml:mi></mml:math></inline-formula>, since zonal-mean meridional velocity is zero), as well as their climatological means and standard deviations plotted alongside the topography. These are statistics of the grid-cell values, not zonal means, but depend only on latitude because so does topography. Two eastward jets are prominent in the snapshots Fig. <xref ref-type="fig" rid="F2"/>b and in the zonal mean profile Fig. <xref ref-type="fig" rid="F3"/>b.iii, with preferred latitudes of <inline-formula><mml:math id="M399" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M400" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. The Hovmöller diagram gives a sense of characteristic timescales: jets tend to remain roughly stationary for stretches of <inline-formula><mml:math id="M401" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> d at a time before shifting, as seen by the group of closed contours of <inline-formula><mml:math id="M402" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> and associated dipole of <inline-formula><mml:math id="M403" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula> centered at time <inline-formula><mml:math id="M404" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3400</mml:mn></mml:mrow></mml:math></inline-formula>. and persisting <inline-formula><mml:math id="M405" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> d to either side. Within these stretches of quasi-stationarity, there are shorter undulations of duration <inline-formula><mml:math id="M406" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>, which we identify as the eddy turnover timescale.</p>
      <p id="d2e10460">The tracer statistics (Fig. <xref ref-type="fig" rid="F3"/>a.iii and iv) have some easily explainable large-scale patterns and some subtler small-scale patterns. The tracer time-mean <inline-formula><mml:math id="M407" display="inline"><mml:mrow><mml:mo>〈</mml:mo><mml:mi>c</mml:mi><mml:mo>〉</mml:mo><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> increases linearly overall as <inline-formula><mml:math id="M408" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi>y</mml:mi><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:math></inline-formula>, in keeping with its Dirichlet boundary conditions. However, in the central region of the domain (inside the weak westward jet) the tracer mean varies more rapidly with latitude and has a larger standard deviation (see also dashed curves in Fig. <xref ref-type="fig" rid="F4"/>b and c).</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e10498">Summary statistics of latitude-dependent climatological tail distributions of local tracer concentrations, also called “intensities”, which are denoted <inline-formula><mml:math id="M409" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> and defined as the average concentration <inline-formula><mml:math id="M410" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> over a box <inline-formula><mml:math id="M411" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:msup><mml:mo>]</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math id="M412" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M413" display="inline"><mml:mrow><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">32</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> are fixed, while <inline-formula><mml:math id="M414" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> varies across the midlatitudes from <inline-formula><mml:math id="M415" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M416" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">54</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. Panel <bold>(a)</bold> shows the lower-layer topography in this same range of middle latitudes, <bold>(b)</bold> shows the mean intensity <inline-formula><mml:math id="M417" display="inline"><mml:mrow><mml:mo>〈</mml:mo><mml:mi>R</mml:mi><mml:mo>〉</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, after subtracting a nominal trend of <inline-formula><mml:math id="M418" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:math></inline-formula> to reveal a finer-scale structure that resembles the underlying topography, and <bold>(c)</bold> shows the standard deviation of intensity <inline-formula><mml:math id="M419" display="inline"><mml:msqrt><mml:mrow><mml:mo>〈</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〉</mml:mo><mml:mo>-</mml:mo><mml:mo>〈</mml:mo><mml:mi>R</mml:mi><mml:msup><mml:mo>〉</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:math></inline-formula>. Dashed curves in <bold>(b)</bold> and <bold>(c)</bold> indicate the mean and standard deviation, respectively, of the concentration field <inline-formula><mml:math id="M420" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> without box-averaging. Panels <bold>(d)</bold>–<bold>(f)</bold> summarize the distribution of intensities <inline-formula><mml:math id="M421" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> via the parameters of the generalized Pareto distributions (GPD), inferred by the peaks-over-threshold fitting procedure (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS3"/> for details). The threshold is set to the <inline-formula><mml:math id="M422" display="inline"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>-complementary quantile, denoted <inline-formula><mml:math id="M423" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> and shown in <bold>(d)</bold> with linear trend removed. Panels <bold>(e)</bold> and <bold>(f)</bold> display the estimated (scale, shape) parameters (<inline-formula><mml:math id="M424" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="italic">ξ</mml:mi></mml:mrow></mml:math></inline-formula>).</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f04.png"/>

        </fig>

      <p id="d2e10817">In the eastward jets, the tracer mean varies more slowly with latitude and has a smaller standard deviation. Comparison with the Hovmöller diagram (Fig. <xref ref-type="fig" rid="F3"/>a.i) suggests that the central region owes its high variance to short-lived anomalous pulses, both positive and negative, which are more intense than in surrounding regions. We won't try to explain these patterns from first principles, but simply state that the setup accomplishes our intention to provide a variety of statistical behaviors as a suite of test cases for our approach.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Target variable</title>
      <p id="d2e10830">We define the intensity function of interest <inline-formula><mml:math id="M425" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as the upper-level concentration, <inline-formula><mml:math id="M426" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (henceforth, simply <inline-formula><mml:math id="M427" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula>), averaged over a small square box <inline-formula><mml:math id="M428" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>]</mml:mo><mml:mo>×</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> of half-width <inline-formula><mml:math id="M429" display="inline"><mml:mrow><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>. This function is designed to capture the real-world considerations and algorithmic difficulties that originally motivated the AST: it describes <italic>localized</italic> conditions, similar to concentrated pollution, high heat, or heavy rainfall over a region on Earth, and it is mediated by traveling baroclinic waves, and as a result it displays intermittency, with extreme spikes that come and go quickly. The choice of upper- instead of lower-level concentration is simply to weaken the impact of arbitrary aspects of the model setup like the surface topography. Real-world applications would of course refine this choice in many ways, but our choice is suitable for the QG level of model idealization.</p>
      <p id="d2e10942">To investigate the effects of location-dependent flow regimes, we vary <inline-formula><mml:math id="M430" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> across 23 evenly spaced latitudes <inline-formula><mml:math id="M431" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>∈</mml:mo><mml:mfenced close="}" open="{"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">12</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">54</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, restricted to the central region to avoid boundary effects. The central longitude <inline-formula><mml:math id="M432" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is fixed to <inline-formula><mml:math id="M433" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>, but by zonal homogeneity any longitude would be statistically equivalent. We also repeated the analysis with double the box length, and found results to be qualitatively similar, so we only show results for the smaller box size. The effect of spatial scale is worth considering in its own right with a wider range, which we postpone to future work.</p>
      <p id="d2e11024">Figure <xref ref-type="fig" rid="F4"/> displays some  summary statistics of <inline-formula><mml:math id="M434" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as functions of the target latitude <inline-formula><mml:math id="M435" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>: alongside (a) the topography for reference, we show (b) the meridionally de-trended time-mean <inline-formula><mml:math id="M436" display="inline"><mml:mrow><mml:mo>〈</mml:mo><mml:mi>R</mml:mi><mml:mo>〉</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> and (c) the standard deviation <inline-formula><mml:math id="M437" display="inline"><mml:msqrt><mml:mrow><mml:mo>〈</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>〉</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mo>〈</mml:mo><mml:mi>R</mml:mi><mml:msup><mml:mo>〉</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:math></inline-formula>. Note the restricted latitude range. In Fig. <xref ref-type="fig" rid="F4"/>a and b, dashed lines show the corresponding mean and standard deviation of <inline-formula><mml:math id="M438" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> itself, as in Fig. <xref ref-type="fig" rid="F3"/>c and d, of which <inline-formula><mml:math id="M439" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> is a regional average: note that <inline-formula><mml:math id="M440" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> has the same mean as <inline-formula><mml:math id="M441" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> but a smaller standard deviation, and larger box sizes would reduce it even further.</p>
      <p id="d2e11172">While the low-order moments capture ordinary behavior of intensities <inline-formula><mml:math id="M442" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, the intensity peaks – i.e., severities <inline-formula><mml:math id="M443" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, defined in Sect. <xref ref-type="sec" rid="Ch1.S2"/> – are better viewed from an extreme value theory perspective, and summarized with the peaks-over-threshold procedure <xref ref-type="bibr" rid="bib1.bibx10" id="paren.83"/>. We set a threshold <inline-formula><mml:math id="M444" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> as the <inline-formula><mml:math id="M445" display="inline"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>th complementary quantile of <inline-formula><mml:math id="M446" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, also denoted <inline-formula><mml:math id="M447" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, i.e., the level whose exceedance probability is <inline-formula><mml:math id="M448" display="inline"><mml:mrow><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. Severities <inline-formula><mml:math id="M449" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> are extracted as cluster maxima above <inline-formula><mml:math id="M450" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, with buffer times <inline-formula><mml:math id="M451" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> d and <inline-formula><mml:math id="M452" display="inline"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> d. All cluster maxima from the long DNS are used as input data points to infer the parameters (scale <inline-formula><mml:math id="M453" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, shape <inline-formula><mml:math id="M454" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>) of a generalized Pareto distribution (GPD), using the maximum-likelihood routine of the <monospace>Extremes.jl</monospace> package <xref ref-type="bibr" rid="bib1.bibx31" id="paren.84"/>:

            <disp-formula id="Ch1.E36" content-type="numbered"><label>36</label><mml:math id="M455" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi>r</mml:mi><mml:mo>|</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>≈</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mi mathvariant="italic">μ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mfenced open="[" close="]"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>r</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mo>-</mml:mo><mml:msub><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>r</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced><mml:mo>+</mml:mo></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M456" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">max</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Figure <xref ref-type="fig" rid="F4"/>d–f displays the threshold (detrended by <inline-formula><mml:math id="M457" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:math></inline-formula>), scale parameter <inline-formula><mml:math id="M458" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, and shape parameter <inline-formula><mml:math id="M459" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>. Several characteristics are noteworthy.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e11546">Probability distributions of local tracer concentrations at latitude <inline-formula><mml:math id="M460" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and averaged over a box of half-width <inline-formula><mml:math id="M461" display="inline"><mml:mrow><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. <bold>(a)</bold> The full PDF of intensity <inline-formula><mml:math id="M462" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>. <bold>(b)</bold> The CCDF (tail integral) of intensity <inline-formula><mml:math id="M463" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, restricted to <inline-formula><mml:math id="M464" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:math></inline-formula>. <bold>(c)</bold> Further zoomed-in CCDF of the severity <inline-formula><mml:math id="M465" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="normal">peaks</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">of</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi>R</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">over</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>. In all three panels, solid black and red lines represent estimates from long and short DNS, respectively, with shaded 90 % confidence intervals obtained by repeating the inference 64 times, once for each possible longitudinal rotation of the dataset. Error bars become degenerate at levels experienced by <inline-formula><mml:math id="M466" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> % of longitudes. Black dashed lines show the mean over all longitudinal rotations, our best estimate of ground truth. The gray line in <bold>(b, c)</bold> represents the GPD fit to <inline-formula><mml:math id="M467" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M468" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.52</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M469" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M470" display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.31</mml:mn></mml:mrow></mml:math></inline-formula>, and this is a much better fit to the severities in <bold>(c)</bold> which makes sense given they are defined in terms of peaks.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f05.png"/>

        </fig>

      <p id="d2e11742"><list list-type="bullet">
            <list-item>

      <p id="d2e11747">The detrended threshold <inline-formula><mml:math id="M471" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> has a maximum-over-minimum profile similar to the the detrended mean intensity <inline-formula><mml:math id="M472" display="inline"><mml:mrow><mml:mo>〈</mml:mo><mml:mi>R</mml:mi><mml:mo>〉</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>, but shifted southward. The maximum of <inline-formula><mml:math id="M473" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mi>L</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> is close to the mid-channel maximum in the standard deviation of <inline-formula><mml:math id="M474" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, perhaps because extremes depend more on variability than on average behavior. </p>
            </list-item>
            <list-item>

      <p id="d2e11825">As expected for an upper bounded tail, we find <inline-formula><mml:math id="M475" display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> at all latitudes (Fig. <xref ref-type="fig" rid="F4"/>f).</p>
            </list-item>
            <list-item>

      <p id="d2e11845">The GPD scale parameter, <inline-formula><mml:math id="M476" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, is anti-correlated with the (detrended) mean. The constraint <inline-formula><mml:math id="M477" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> can explain this, as a higher distribution center leaves less room for an expansive tail. In addition, the threshold <inline-formula><mml:math id="M478" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> tracks approximately with the mean, and we can understand the anticorrelation mathematically through the non-uniqueness of GPD parameters: the same tail can be adequately described by two different choices of threshold <inline-formula><mml:math id="M479" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the two corresponding scale parameters will be related by <inline-formula><mml:math id="M480" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Only the shape parameter, <inline-formula><mml:math id="M481" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>, is invariant with respect to <inline-formula><mml:math id="M482" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>. Seeing that <inline-formula><mml:math id="M483" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> is negative and varies only slightly with latitude, <inline-formula><mml:math id="M484" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M485" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> would vary inversely even if the tail itself were not changing.</p>
            </list-item>
            <list-item>

      <p id="d2e11976">The mean appears odd-symmetric and the standard deviation appears even-symmetric about the midline (Fig. <xref ref-type="fig" rid="F4"/>b and c), which is not surprising given the tracer boundary conditions which transform as <inline-formula><mml:math id="M486" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>↦</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:math></inline-formula> when <inline-formula><mml:math id="M487" display="inline"><mml:mrow><mml:mi>y</mml:mi><mml:mo>↦</mml:mo><mml:mi>L</mml:mi><mml:mo>-</mml:mo><mml:mi>y</mml:mi></mml:mrow></mml:math></inline-formula>, negating the sign of fluctuations but leaving their absolute value constant (or perhaps disrupted slightly by topography). However, the GPD parameters are not symmetric (Fig. <xref ref-type="fig" rid="F4"/>d–f), because they describe the <italic>upper</italic> tail of the local <inline-formula><mml:math id="M488" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> distribution, and the transformation <inline-formula><mml:math id="M489" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>↦</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:math></inline-formula> swaps the lower and upper tails. The subsequent figures (Figs. <xref ref-type="fig" rid="F5"/> and <xref ref-type="fig" rid="F7"/>) demonstrate pronounced skewness, so the upper and lower tails are markedly different. These partial symmetries will imprint upon the COAST's latitudinal variation seen later in Figs. <xref ref-type="fig" rid="F14"/> and <xref ref-type="fig" rid="F15"/>.</p>
            </list-item>
          </list></p>
      <p id="d2e12056">We implemented the boosting and estimation procedures for every latitude separately, but for illustration focus the in-depth analysis on <inline-formula><mml:math id="M490" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> (the small boxes in Fig. <xref ref-type="fig" rid="F2"/>), an interesting location where the (detrended) mean and threshold <inline-formula><mml:math id="M491" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> are both low, the GPD scale <inline-formula><mml:math id="M492" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> is large, and the GPD shape slightly more negative than in surrounding regions. Figure <xref ref-type="fig" rid="F5"/> displays the underlying probability distributions at <inline-formula><mml:math id="M493" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> to show the nature of the tails of the distributions and also to help clarify the relationship between intensities, severities, and GPD parameters. The full PDF of intensity, in Fig. <xref ref-type="fig" rid="F5"/>a, has a positive skew and sub-Gaussian tail. Black and red solid curves are estimates obtained from the long and short DNS, respectively, and 90% confidence intervals are obtained by longitudinal translation. Specifically, the shaded intervals are the 5th–95th percentile ranges of intensities at the same <inline-formula><mml:math id="M494" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, but with <inline-formula><mml:math id="M495" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> shifted from its base location of <inline-formula><mml:math id="M496" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> by <inline-formula><mml:math id="M497" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula>, <inline-formula><mml:math id="M498" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M499" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M500" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">63</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. The dashed black curve is the mean of all 64 curves, our best available estimate of ground truth. The discrepancy between short and long DNS is most pronounced in the upper tail, which in Fig. <xref ref-type="fig" rid="F5"/>b is magnified and integrated from the top, giving the CCDF. Gray lines mark the threshold, <inline-formula><mml:math id="M501" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.52</mml:mn></mml:mrow></mml:math></inline-formula>, and its CCDF value <inline-formula><mml:math id="M502" display="inline"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">32</mml:mn></mml:mfrac></mml:mstyle><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>. Above this level, the short DNS becomes rapidly more uncertain (error bar widens), and severely underestimates probabilities smaller than <inline-formula><mml:math id="M503" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.005</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e12266">Both short and long DNS estimates diverge markedly from the GPD fit shown in gray in Fig. <xref ref-type="fig" rid="F5"/>b. This is where the distinction between intensity and severity comes into play: the GPD is fitted to <italic>peaks over the threshold</italic> <inline-formula><mml:math id="M504" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> – i.e., severities – whose distribution differs (most notably in the upward direction) from that of <italic>all</italic> exceedances over <inline-formula><mml:math id="M505" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, which would include the clusters surrounding the peaks. Figure <xref ref-type="fig" rid="F5"/>c confirms that the GPD fit is much more appropriate for severities <inline-formula><mml:math id="M506" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> than for intensities <inline-formula><mml:math id="M507" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, and thereby clarifies the distinction. If the threshold were raised, the clusters would shrink, the sequence of peaks would form a Poisson process, and the CCDFs of <inline-formula><mml:math id="M508" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M509" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> would converge. For computational economy and because non-asymptotic extremes are of interest for climate risk, we keep the threshold at <inline-formula><mml:math id="M510" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> and formally define our goal with boosting as correcting the distribution of severities – not intensities. Hence, our measure of success will be whether the short-DNS severity CCDF in Fig. <xref ref-type="fig" rid="F5"/>c, when augmented by boosting, will become closer to the long-DNS severity CCDF.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Ensemble design</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Stochastic inputs</title>
      <p id="d2e12369">We perturb the QG model with impulsive forcing, which we now specify as a concrete version of the generic form in Sect. <xref ref-type="sec" rid="Ch1.S2"/>. The stochastic input <inline-formula><mml:math id="M511" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> lives in the complex plane <inline-formula><mml:math id="M512" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">C</mml:mi><mml:mo>(</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Ω</mml:mi></mml:mrow></mml:math></inline-formula>, the “input space”), and the state-space perturbation <inline-formula><mml:math id="M513" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> consists of a single Fourier mode to be added to <inline-formula><mml:math id="M514" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula>. We stress that our focus here is on optimizing AST, not the perturbation space <inline-formula><mml:math id="M515" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula>, so the choice of mode is arbitrary so long as <inline-formula><mml:math id="M516" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> remains low-dimensional. The optimal AST would probably change if <inline-formula><mml:math id="M517" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> changes, e.g., if we perturbed a different mode or multiple modes at once; but the <italic>rule for choosing it</italic> based on entropy may well generalize, which will have to be tested in follow-up research.</p>
      <p id="d2e12441">Bearing these caveats in mind, we select a mode  that is likely to grow fast, according to linear stability analysis, which is more easily explained as a procedure than as a closed formula: <list list-type="order"><list-item>
      <p id="d2e12446">Decompose <inline-formula><mml:math id="M518" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> into a Fourier basis <inline-formula><mml:math id="M519" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi></mml:mrow></mml:munder><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi>z</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mi>x</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and write the linearized dynamics (about the baroclinically unstable background state with vertical zonal wind shear and <inline-formula><mml:math id="M520" display="inline"><mml:mrow><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, and neglecting topography) into the abstract form<disp-formula id="Ch1.E37" content-type="numbered"><label>37</label><mml:math id="M521" display="block"><mml:mrow><mml:mi mathvariant="bold">C</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mfenced open="[" close="]"><mml:mtable class="matrix" columnalign="center" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>=</mml:mo><mml:mi mathvariant="bold">D</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="matrix" columnalign="center" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>where <inline-formula><mml:math id="M522" display="inline"><mml:mrow><mml:mi mathvariant="bold">C</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">C</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> represents the conversion from streamfunction to potential vorticity, and <inline-formula><mml:math id="M523" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">C</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> represents the advection and linear dissipation terms (excluding topography).</p></list-item><list-item>
      <p id="d2e12712">Calculate the eigenvalues and eigenvectors <inline-formula><mml:math id="M524" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="italic">φ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>:</mml:mo><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> of the Jacobian matrix <inline-formula><mml:math id="M525" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">C</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="bold">D</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, ordered by stability: <inline-formula><mml:math id="M526" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="italic">}</mml:mo><mml:mo>≥</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, and select <inline-formula><mml:math id="M527" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">argmax</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="italic">{</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, i.e., the linearly most unstable mode from the background state. Restrict the optimization to <inline-formula><mml:math id="M528" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> both nonnegative, and not both zero.</p></list-item><list-item>
      <p id="d2e12951">For <inline-formula><mml:math id="M529" display="inline"><mml:mrow><mml:mi>z</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>, increment <inline-formula><mml:math id="M530" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi>z</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> by <inline-formula><mml:math id="M531" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="italic">φ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>z</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and to maintain the solution's reality add the complex conjugate (c.c.) to <inline-formula><mml:math id="M532" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>z</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The perturbation can be written as a function of space,<disp-formula id="Ch1.E38" content-type="numbered"><label>38</label><mml:math id="M533" display="block"><mml:mrow><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>z</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="italic">φ</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi>z</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced open="(" close=")"><mml:mrow><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mfenced><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mi>x</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mi>y</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="normal">c</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">c</mml:mi><mml:mo>.</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>which can have pointwise magnitudes up to <inline-formula><mml:math id="M534" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula>. In the QG model, the mode we identify is <inline-formula><mml:math id="M535" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M536" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is plotted in Fig. <xref ref-type="fig" rid="F6"/>c for three different example <inline-formula><mml:math id="M537" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>s, which correspond to the points labeled 1, 2, 3 in Fig. <xref ref-type="fig" rid="F6"/>a. All share the same inter-layer <italic>relative</italic> phase and magnitude, as these are properties of <inline-formula><mml:math id="M538" display="inline"><mml:mrow><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M539" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="italic">φ</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>z</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, but differ in <italic>absolute</italic> phase and magnitude. Note that points 2 and 3 are approximately diametrically opposed, and hence spatially <inline-formula><mml:math id="M540" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">180</mml:mn></mml:mrow></mml:math></inline-formula>° out of phase, whereas point 1 is approximately one-quarter revolution away and spatially <inline-formula><mml:math id="M541" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">90</mml:mn></mml:mrow></mml:math></inline-formula>° out of phase with both 2 and 3. Points (2, 3) are (closest to, farthest from) the center of the circle, and hence have the (smallest, largest)-magnitude spatial perturbations. of the three examples shown.</p></list-item></list></p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e13337">Structure of perturbations and their probability distribution. <bold>(a)</bold> Level sets of each considered input distribution from scales <inline-formula><mml:math id="M542" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn></mml:mrow></mml:math></inline-formula> (red) to <inline-formula><mml:math id="M543" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula> (blue), each scale restricted to <inline-formula><mml:math id="M544" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">15</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> of the circle each so that all scales may be seen. Labels on the outer edge of the circle indicate the corresponding scale. Dots show the 21 impulses used at each AST before each ancestor, sampled by quasi-Monte Carlo. <bold>(b)</bold> One-dimensional transects of <inline-formula><mml:math id="M545" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> at each scale. <bold>(c)</bold> The shape of perturbations to the streamfunction corresponding to <inline-formula><mml:math id="M546" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M547" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M548" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Note that the absolute amplitudes and phases vary, sampling the two degrees of freedom in the disc, but the relative amplitudes and phases of the upper and lower layers are fixed.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f06.png"/>

        </fig>

      <p id="d2e13447">The steps above completely specify <inline-formula><mml:math id="M549" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, a linear map from <inline-formula><mml:math id="M550" display="inline"><mml:mi mathvariant="double-struck">C</mml:mi></mml:math></inline-formula> to functions of (<inline-formula><mml:math id="M551" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M552" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M553" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>), which can be easily computed offline before running any ensembles. One could argue for two obvious refinements of this choice: (1) specializing the linearization to the actual initial state, not just the background state, by linearizing the quadratic form <inline-formula><mml:math id="M554" display="inline"><mml:mrow><mml:mi>J</mml:mi><mml:mo>(</mml:mo><mml:mi>q</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and including that in the calculation of <inline-formula><mml:math id="M555" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ℓ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; and (2) accounting for finite time horizons by using the leading singular vector of the <italic>linear propagator</italic>, i.e., the initial <italic>infinitesimal</italic> error whose magnitude amplifies the most over a given time horizon <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx16" id="paren.85"/> and which could be estimated by a bred vector approach <xref ref-type="bibr" rid="bib1.bibx49" id="paren.86"/>.</p>
      <p id="d2e13541">For this study, we stick to the simpler choice of the most unstable modes of the background shear, choosing to focus attention on the less-studied optimization of the advance split time given a fixed perturbation shape. There are several reasons that singular vectors may not be suitable for our goals. First, it is easier to compare different initial conditions, different advance split times, and even different topographies (which we do not do here) when they are all subject to precisely the same perturbation. Second, as our results will demonstrate, the COAST tends to lie beyond the time range where linearized error dynamics are appropriate, which is natural because we aim for finite-amplitude boosts in extreme events. Third, singular vectors are typically designed to optimize global errors, which might not be as relevant for local extremes. Fourth, such highly specialized perturbation shapes might not be accessible in a generic GCM. Nonetheless, sensitivity analysis with respect to perturbation shape leads the agenda for follow-up work.</p>
      <p id="d2e13545">Having fixed a subspace <inline-formula><mml:math id="M556" display="inline"><mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="double-struck">C</mml:mi></mml:mrow></mml:math></inline-formula> for perturbations <inline-formula><mml:math id="M557" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, we need to specify an input distribution <inline-formula><mml:math id="M558" display="inline"><mml:mrow><mml:msup><mml:mi>p</mml:mi><mml:mi mathvariant="normal">Ω</mml:mi></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> over that space. We design the PDF for <inline-formula><mml:math id="M559" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> as a radially symmetric, smooth, “bump function” which has compact support in order to prevent perturbations so large as to induce oscillatory transients. The PDF is parameterized by two scales: <inline-formula><mml:math id="M560" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula> which is the maximum permissible magnitude of <inline-formula><mml:math id="M561" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M562" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> which sets the typical perturbation strength:

            <disp-formula id="Ch1.E39" content-type="numbered"><label>39</label><mml:math id="M563" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo><mml:mo>∝</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:msup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:msup><mml:mo>|</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">for</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>|</mml:mo><mml:mo>&lt;</mml:mo><mml:mi>W</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">and</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">for</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>|</mml:mo><mml:mo>≥</mml:mo><mml:mi>W</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          When <inline-formula><mml:math id="M564" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>≪</mml:mo><mml:mi>W</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M565" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> is approximately a bivariate Gaussian density with diagonal covariance <inline-formula><mml:math id="M566" display="inline"><mml:mrow><mml:msup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi>I</mml:mi></mml:mrow></mml:math></inline-formula>. When <inline-formula><mml:math id="M567" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>≳</mml:mo><mml:mi>W</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M568" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> is approximately uniform over the <inline-formula><mml:math id="M569" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc <inline-formula><mml:math id="M570" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>:</mml:mo><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>|</mml:mo><mml:mo>≤</mml:mo><mml:mi>W</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, with rapid (but mathematically smooth) transition to 0 on the boundary. We fix <inline-formula><mml:math id="M571" display="inline"><mml:mrow><mml:mi>W</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.3</mml:mn></mml:mrow></mml:math></inline-formula>, limiting the maximum possible perturbation amplitude to <inline-formula><mml:math id="M572" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>|</mml:mo><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>W</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.6</mml:mn></mml:mrow></mml:math></inline-formula> (see text after Eq. <xref ref-type="disp-formula" rid="Ch1.E38"/>), which is small compared to the characteristic streamfunction amplitude of <inline-formula><mml:math id="M573" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ψ</mml:mi><mml:mo>|</mml:mo><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>. We include <inline-formula><mml:math id="M574" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> as a parameter to vary because there is no established principle to set the magnitude of impulses for the purpose of rare event sampling. In contrast, numerical weather prediction has an established (if heuristic) practice of tuning noise amplitude to match ensemble spread with model error <xref ref-type="bibr" rid="bib1.bibx3" id="paren.87"><named-content content-type="pre">e.g.,</named-content></xref>. Optimizing for climatological accuracy is a different, murkier goal calling for less prejudice with regard to perturbation magnitude. We therefore vary <inline-formula><mml:math id="M575" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> widely from 0.06 to 0.9 in increments of 0.06 for 15 total values. <inline-formula><mml:math id="M576" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> is the impulsive-forcing analogue to the continuous-forcing amplitude that we called <inline-formula><mml:math id="M577" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in <xref ref-type="bibr" rid="bib1.bibx17" id="text.88"/>, which strongly influenced the perturbation growth rate and therefore the optimal advance split time.</p>
      <p id="d2e13919">Figure <xref ref-type="fig" rid="F6"/>a and b depicts <inline-formula><mml:math id="M578" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in two ways: (a) two-dimensional level sets of the unnormalized density (Eq. <xref ref-type="disp-formula" rid="Ch1.E39"/>) logarithmically spaced from <inline-formula><mml:math id="M579" display="inline"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M580" display="inline"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, each value of <inline-formula><mml:math id="M581" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> occupying one of 15 sectors of the circle; and (b) one-dimensional transects across <inline-formula><mml:math id="M582" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> fixing <inline-formula><mml:math id="M583" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. To save the labor of drawing Monte Carlo samples from <inline-formula><mml:math id="M584" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> separately and simulating the perturbed children for each value of <inline-formula><mml:math id="M585" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>, we compute the MoCTail and PoPTail estimators using numerical quadrature over the <inline-formula><mml:math id="M586" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc using a single set of samples drawn by <italic>quasi</italic>-Monte Carlo (QMC), and displayed as black dots in Fig. <xref ref-type="fig" rid="F6"/>a. QMC is a general strategy which places samples deterministically across the input space in a way that mimics properties of randomness, but with lower <italic>discrepancy</italic> (fewer clumps and patches), thereby aiming to reduce variance in estimated statistics <xref ref-type="bibr" rid="bib1.bibx37" id="paren.89"/>. We specifically use the <monospace>LatticeRuleSampler</monospace> from the <monospace>QuasiMonteCarlo.jl</monospace> Julia library <xref ref-type="bibr" rid="bib1.bibx61" id="paren.90"/> to distribute points <inline-formula><mml:math id="M587" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>U</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>V</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>M</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> quasi-uniformly on the unit square [0, 1]<sup>2</sup>, and transform them to the <inline-formula><mml:math id="M589" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc with the formula

            <disp-formula id="Ch1.E40" content-type="numbered"><label>40</label><mml:math id="M590" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>W</mml:mi><mml:msqrt><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:msqrt><mml:mi>exp⁡</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi><mml:msub><mml:mi>V</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          Since <inline-formula><mml:math id="M591" display="inline"><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a “quasi-random sample” of the uniformly distributed random variable <inline-formula><mml:math id="M592" display="inline"><mml:mrow><mml:mi>U</mml:mi><mml:mo>∼</mml:mo><mml:mi mathvariant="script">U</mml:mi><mml:mo>(</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, we have

            <disp-formula id="Ch1.E41" content-type="numbered"><label>41</label><mml:math id="M593" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>≤</mml:mo><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>|</mml:mo><mml:mo>≤</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>≤</mml:mo><mml:msup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi>U</mml:mi><mml:mo>≤</mml:mo><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>≤</mml:mo><mml:mi>U</mml:mi><mml:mo>≤</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msup><mml:mi>W</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          which is the fraction of the <inline-formula><mml:math id="M594" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc between the radii <inline-formula><mml:math id="M595" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M596" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. The phase <inline-formula><mml:math id="M597" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>V</mml:mi></mml:mrow></mml:math></inline-formula> is clearly <inline-formula><mml:math id="M598" display="inline"><mml:mrow><mml:mi mathvariant="script">U</mml:mi><mml:mo>(</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mo>]</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. If <inline-formula><mml:math id="M599" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M600" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> were independent random variables, we would immediately conclude <inline-formula><mml:math id="M601" display="inline"><mml:mi mathvariant="italic">ω</mml:mi></mml:math></inline-formula> is uniformly distributed over the <inline-formula><mml:math id="M602" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc; in QMC they are not independent, but the conclusion still holds true <xref ref-type="bibr" rid="bib1.bibx37" id="paren.91"/>. In all experiments to follow, <inline-formula><mml:math id="M603" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn></mml:mrow></mml:math></inline-formula>, corresponding to the 21 points plotted in Fig. <xref ref-type="fig" rid="F6"/>a. While other sampling rules are possible, the <monospace>LatticeRuleSampler</monospace> enjoys a distinct advantage of being extensible: sampling 12 points at first and later deciding to add 9 more gives the same result as sampling 21 in one batch.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Sweeping over ancestors and advance split times</title>
      <p id="d2e14477">Following the procedure laid out in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, we apply each perturbation <inline-formula><mml:math id="M604" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>M</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> to a collection of ancestor events <inline-formula><mml:math id="M605" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> at a range of ASTs <inline-formula><mml:math id="M606" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>J</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>. We set the number of ancestors, <inline-formula><mml:math id="M607" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> to whichever is smaller: the total number of cluster maxima (see Sect. <xref ref-type="sec" rid="Ch1.S3"/>) in the short DNS, or 32. Considering all latitudes, the minimum <inline-formula><mml:math id="M608" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> was 14, the median was 22, and the maximum 32 was found at four latitudes including <inline-formula><mml:math id="M609" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> which we consider in more depth. In the equal-cost comparisons to be shown later, we restrict <inline-formula><mml:math id="M610" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> to smaller values. The ASTs sampled are <inline-formula><mml:math id="M611" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi>J</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">40</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, with a two-day spacing chosen as roughly half the period of small fluctuations in <inline-formula><mml:math id="M612" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (see Fig. <xref ref-type="fig" rid="F7"/>).</p>

      <fig id="F7"><label>Figure 7</label><caption><p id="d2e14697">Boosted ensembles of two selected events: <bold>(a)</bold> time <inline-formula><mml:math id="M613" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4152</mml:mn></mml:mrow></mml:math></inline-formula> at latitude <inline-formula><mml:math id="M614" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">38</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, and <bold>(b)</bold> time <inline-formula><mml:math id="M615" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2702</mml:mn></mml:mrow></mml:math></inline-formula> at latitude <inline-formula><mml:math id="M616" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. These are times when the intensity function <inline-formula><mml:math id="M617" display="inline"><mml:mrow><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> from the short DNS (dashed black curves) achieved a peak value (horizontal dashed black lines) above the threshold <inline-formula><mml:math id="M618" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> (horizontal gray lines). For each AST <inline-formula><mml:math id="M619" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>∈</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">40</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, an ensemble of perturbed events (descendants) is launched at <inline-formula><mml:math id="M620" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:mi>A</mml:mi></mml:mrow></mml:math></inline-formula>, indexed by <inline-formula><mml:math id="M621" display="inline"><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, 21. For three selected ASTs <inline-formula><mml:math id="M622" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">16</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">32</mml:mn></mml:mrow></mml:math></inline-formula>, the full timeseries <inline-formula><mml:math id="M623" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">21</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> are shown in <bold>(a, b).i</bold>. The red-to-blue color scale indicates short-to-long ASTs. Each descendant achieves a different severity <inline-formula><mml:math id="M624" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> (peak intensity), indicated by circles in <bold>(a, b).i</bold> at (<inline-formula><mml:math id="M625" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>) for all values of <inline-formula><mml:math id="M626" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. The peaks also occur at different times <inline-formula><mml:math id="M627" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, indicated in <bold>(a, b).ii</bold> by stars at (<inline-formula><mml:math id="M628" display="inline"><mml:mrow><mml:msubsup><mml:mi>t</mml:mi><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), again for all <inline-formula><mml:math id="M629" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> and colored accordingly.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f07.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Results: conditional severity distributions</title>
      <p id="d2e15043">In this section we present some case studies of conditional perturbed ensembles (from individual ancestors) and corresponding dispersion measures to be subsequently used in the MoCTail and PoPTail estimation. The results will add context and motivation to the protocols laid out above, and set the stage for the aggregation of results across ancestors.</p>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Perturbed ensembles: case studies</title>
      <p id="d2e15053">Figure <xref ref-type="fig" rid="F7"/> displays a small but representative sample of boosted ensembles at two target latitudes at the inner edges of the two eastward jets: (a) <inline-formula><mml:math id="M630" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">38</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and (b) <inline-formula><mml:math id="M631" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>. The ancestors' intensity (black dashed curves) reach their respective peaks at times <inline-formula><mml:math id="M632" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4152</mml:mn></mml:mrow></mml:math></inline-formula> for (a) and 2702 for (b). Note the differences in peak value and peak shape: the upper latitude has long-lasting, flat maxima and the lower latitude has brief, spiky maxima. The statistical properties at these two locations, both in Figs. <xref ref-type="fig" rid="F7"/> and <xref ref-type="fig" rid="F3"/>, are approximately equivalent after reflection about <inline-formula><mml:math id="M633" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> (<inline-formula><mml:math id="M634" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>→</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:math></inline-formula>), meaning the upper tail of one resembles the lower tail of the other. This can be understood by the approximate north-south symmetry of the tracer's dynamics imposed by Dirichlet boundary conditions.</p>
      <p id="d2e15145">We show the perturbed intensities launched from three ASTs <inline-formula><mml:math id="M635" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>∈</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">16</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">32</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, colored (red, orange, blue) respectively. Following the split time, the ensemble members spread apart from the parent and from each other, achieving their own peak values (severities) that differ in both amplitude and timing from the ancestor, the discrepancies increasing with <inline-formula><mml:math id="M636" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>. The red curves (<inline-formula><mml:math id="M637" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>) replicate the ancestral peak very closely; the orange curves (<inline-formula><mml:math id="M638" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">16</mml:mn></mml:mrow></mml:math></inline-formula>) peak at substantially higher or lower levels, and up to <inline-formula><mml:math id="M639" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> d earlier or later. Still, the orange peaks are clearly dynamically related to the ancestral peaks. This is no longer true for the blue curves (<inline-formula><mml:math id="M640" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">32</mml:mn></mml:mrow></mml:math></inline-formula>), whose intensity peaks are widely scattered in time and systematically lower than the ancestors' peaks.</p>
      <p id="d2e15226">Besides these three selected ASTs, each descendant is charted in  Fig. <xref ref-type="fig" rid="F7"/>a and b.i as a circle color-coded by AST, positioned vertically at its severity value and horizontally at its launch time. A corresponding star is plotted in Fig. <xref ref-type="fig" rid="F7"/>a and b.ii, positioned vertically at its severity value (on a zoomed-in scale) and horizontally at its peak timing (constrained by the “argmax drift” parameter <inline-formula><mml:math id="M641" display="inline"><mml:mrow><mml:mi mathvariant="italic">δ</mml:mi><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> d <inline-formula><mml:math id="M642" display="inline"><mml:mo>≈</mml:mo></mml:math></inline-formula> half of an eddy turnover timescale, as explained in Sect. <xref ref-type="sec" rid="Ch1.S2.SS1"/>). We can see the transition of the <inline-formula><mml:math id="M643" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> ensemble from tightly clustered (for short AST) to roughly independent and climatologically distributed (for long AST), and in between there is a golden window of opportunity where severities can be both large and diverse. The optimal AST must balance these two objectives, a task akin to the exploitation-exploration tradeoff in Bayesian optimization and reinforcement learning <xref ref-type="bibr" rid="bib1.bibx81" id="paren.92"><named-content content-type="pre">e.g.,</named-content></xref>. In this light, the two functionals defined in Eqs. (<xref ref-type="disp-formula" rid="Ch1.E27"/>) and (<xref ref-type="disp-formula" rid="Ch1.E28"/>) are candidate <italic>acquisition functions</italic>.</p>
</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Relating severities to impulses: case studies</title>
      <p id="d2e15291">We now construct “severity response functions” <inline-formula><mml:math id="M644" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> mapping impulses <inline-formula><mml:math id="M645" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="double-struck">C</mml:mi></mml:mrow></mml:math></inline-formula> to severities <inline-formula><mml:math id="M646" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, approximating the action of the flow map using some empirical parameters <inline-formula><mml:math id="M647" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula>. This will be needed to estimate conditional and unconditional probabilities through the MoCTail and PoPTail estimators (see Eq. <xref ref-type="disp-formula" rid="Ch1.E5"/>), and will also help to understand the joint dependence between impulses <inline-formula><mml:math id="M648" display="inline"><mml:mrow><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="double-struck">C</mml:mi></mml:mrow></mml:math></inline-formula> and the times <inline-formula><mml:math id="M649" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> at which they are applied.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e15396">The response of an extreme event to perturbations: magnitude, phase, and timing. The event is the same as in Fig. <xref ref-type="fig" rid="F7"/>b. Row <bold>(a)</bold> represents impulses as in Fig. <xref ref-type="fig" rid="F6"/>, but additionally shows the responses to them separately at six sampled ASTs (2, 10, 18, 24, 32, and 40 d marked with vertical gray lines in <bold>c</bold>–<bold>e</bold>), which increase from right to left (launch time <inline-formula><mml:math id="M650" display="inline"><mml:mrow><mml:msup><mml:mi>t</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:mi>A</mml:mi></mml:mrow></mml:math></inline-formula> increases left to right). Horizontal and vertical scales are equal. At the shortest AST shown, <inline-formula><mml:math id="M651" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>, the response function is clearly linear: the impulses above and left of center are marked by <inline-formula><mml:math id="M652" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>, representing an increased severity, and those below and right of center are marked by <inline-formula><mml:math id="M653" display="inline"><mml:mo>•</mml:mo></mml:math></inline-formula>, representing decreased severity, with marker sizes representing the magnitude of the change. Colored curves represent level sets of the fitted linear (blue) and quadratic (red) models, with (solid, dashed, dotted) contours to differentiate (positive, zero, negative) changes to <inline-formula><mml:math id="M654" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>. Row <bold>(b)</bold> displays the quality of these models by plotting true vs. fit responses (again, horizontal and vertical scales are equal). As AST increases, the impulses causing higher and lower severities become more intertwined and less linearly separable, as the red contours progressively bend and separate from the blue contours. Accordingly, the modeled linear response ceases to correlate with the true response. The modeled quadratic response has a slightly longer range of good quality, but also fails for AST <inline-formula><mml:math id="M655" display="inline"><mml:mrow><mml:mo>≳</mml:mo><mml:mn mathvariant="normal">26</mml:mn></mml:mrow></mml:math></inline-formula> d. Row <bold>(c)</bold> shows that the linear components <inline-formula><mml:math id="M656" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M657" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are estimated similarly (at least in magnitude) regardless of whether quadratic terms are also included. Row <bold>(d)</bold> shows that the quadratic model implies a local maximum (both eigenvalues nonpositive) for most of the range <inline-formula><mml:math id="M658" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">26</mml:mn></mml:mrow></mml:math></inline-formula>, beyond which the landscape starts looking less like a hilltop and more like a saddle. Row <bold>(e)</bold> displays the coefficients of determination, conventionally denoted <inline-formula><mml:math id="M659" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> but not here so as to avoid confusion with intensity <inline-formula><mml:math id="M660" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f08.png"/>

        </fig>

      <p id="d2e15547">How should the response functions be parameterized? The simplest choice would be a linear model, often used in numerical weather prediction to optimize ensemble spread by perturbing in the most-effective directions, so-called singular vectors <xref ref-type="bibr" rid="bib1.bibx13" id="paren.93"/>. However, linear models are strictly valid only for infinitesimal perturbations, hence short lead times. Similar logic should apply when optimizing for severity instead of ensemble spread, and indeed we demonstrate below that the COAST tends to lie beyond the range where a linear model <inline-formula><mml:math id="M661" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is valid. We therefore construct a quadratic model as well, and it turns out that this minor upgrade is sufficient. Future work with more complex dynamics and objectives may call for more elaborate response functions (orthogonal polynomials, Gaussian processes, and neural networks for example), but we adhere to quadratic models in this study as a proof of concept that is easy to construct and interpret, which we do in the following two figures.</p>
      <p id="d2e15568">The linear and quadratic response functions take the form

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M662" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E42"><mml:mtd><mml:mtext>42</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo>;</mml:mo><mml:mi mathvariant="italic">θ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi>I</mml:mi><mml:mi>m</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">fitted</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">for</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">both</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">linear</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">and</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">quadratic</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">models</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E43"><mml:mtd><mml:mtext>43</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>+</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:msup><mml:mo mathvariant="italic">}</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mi>I</mml:mi><mml:mi>m</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:mi>I</mml:mi><mml:mi>m</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="italic">ω</mml:mi><mml:msup><mml:mo mathvariant="italic">}</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">fitted</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">for</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">quadratic</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">model</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">only</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          We use ordinary least squares regression on the <inline-formula><mml:math id="M663" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn></mml:mrow></mml:math></inline-formula> sampled impulses <inline-formula><mml:math id="M664" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>M</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and associated severities <inline-formula><mml:math id="M665" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, in addition to the non-perturbed ancestor (<inline-formula><mml:math id="M666" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>:=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>) with severity <inline-formula><mml:math id="M667" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mo>*</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>. A different set of coefficients is calculated separately for each ancestor <inline-formula><mml:math id="M668" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> and AST <inline-formula><mml:math id="M669" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The response functions for the same ancestor event as in Fig. <xref ref-type="fig" rid="F7"/>b are visualized in Fig. <xref ref-type="fig" rid="F8"/>, using (a) the two-dimensional response surfaces, (b) the true vs. fitted response values, (c) the overall slope, measured by the linear coefficient magnitudes, (d) the overall curvature, measured by the eigenvalues of the Hessian of the quadratic fit, and (e) the overall linear and quadratic skills, measured by the coefficient of determination. The response surface gradually transforms from a linear plane, to a curved hilltop, to a saddle, to a jagged landscape, as AST increases. Accordingly, the linear and then the quadratic model lose their skill. The quadratic model is slightly better than the linear model for this particular event, but substantially better when averaged across all events (see the forthcoming Fig. <xref ref-type="fig" rid="F9"/>c.i), and so we will use quadratic models only as <inline-formula><mml:math id="M670" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> in the tail estimators.</p>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e15969">Severities and their conditional distributions for the same case study as Fig. <xref ref-type="fig" rid="F7"/>b. For six ASTs (same as Fig. <xref ref-type="fig" rid="F8"/>, decreasing from left to right), perturbed severities are displayed as dark red circles along a vertical line, and the unperturbed (ancestral) severity is marked with a horizontal black line. Colored curves and stars show the severity PDFs above <inline-formula><mml:math id="M671" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.52</mml:mn></mml:mrow></mml:math></inline-formula>  as inferred from the quadratic regression, for a range of scales <inline-formula><mml:math id="M672" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> from 0.06 (red) to 0.9 (blue). Note that the longest ASTs (40 and 32 d) show a substantial probability mass beyond the most-extreme sample. This is a sign of poor quadratic fit, which is consistent with Fig. <xref ref-type="fig" rid="F8"/>e, and fortunately does not affect the later analysis since optimal ASTs are well short of 32 d. Black curves with stars represent the climatological tail PDF, as inferred from the long DNS, which we will seek to estimate by combining conditional distributions over many ancestors (not just the single ancestor considered here).</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f09.png"/>

        </fig>

</sec>
<sec id="Ch1.S5.SS3">
  <label>5.3</label><title>Conditional severity PDFs: case studies</title>
      <p id="d2e16011">Equipped with response functions approximated by quadratic models, we can now construct conditional severity PDFs using Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>), which are displayed in Fig. <xref ref-type="fig" rid="F9"/>. For the same ancestor as in Fig. <xref ref-type="fig" rid="F8"/> and the same six ASTs, we can see the relationship between actually sampled perturbed severities (red circles and lines), fitted severity PDFs (colored curves, one color for each input scale <inline-formula><mml:math id="M673" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>) evaluated at the bins with lower boundaries <inline-formula><mml:math id="M674" display="inline"><mml:mrow><mml:mfenced close="}" open="{"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:mfenced><mml:mo>:</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, and the climatological PDF (black curves). As AST increases from right to left, the severity PDFs morph from narrow spikes centered at the ancestor severity to long, extended lumps reaching far beyond the ancestor severity, and then recede below the threshold <inline-formula><mml:math id="M675" display="inline"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>. The PDF's motion resembles a wave crashing onto a shallow beach, blanketing the sand, and then retreating, hitting the true COAST somewhere in the middle stages. But this general behavior is strongly modulated by the choice of scale <inline-formula><mml:math id="M676" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>: red PDFs, representing the smallest scale <inline-formula><mml:math id="M677" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn></mml:mrow></mml:math></inline-formula>, are narrower and located closer to the ancestral severity (horizontal black line) for all ASTs, whereas blue PDFs, representing the largest scale <inline-formula><mml:math id="M678" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>, spread out further as a result of giving more weight to bigger impulses. This underscores our claim that the input distribution, an arbitrary choice, merits sensitivity analysis, and so we carry it through the remaining steps.</p>
</sec>
<sec id="Ch1.S5.SS4">
  <label>5.4</label><title>AST selection criteria: case studies</title>
      <p id="d2e16129">Figure <xref ref-type="fig" rid="F10"/> display the criteria proposed in Sect. <xref ref-type="sec" rid="Ch1.S2.SS4"/> that might help determine in which stage of “wave breaking” the severity PDF finds the COAST. The EI and TE criteria shown in Fig. <xref ref-type="fig" rid="F10"/>a and b both exhibit non-monotonic behavior by design, maximizing at COASTs denoted <inline-formula><mml:math id="M679" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M680" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> (see Sect. <xref ref-type="sec" rid="Ch1.S2.SS4"/>). The AST dependence can be heuristically understood in light of the PDFs in Fig. <xref ref-type="fig" rid="F9"/>:</p>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e16167">Ensemble dispersion indicators as a function of AST, again for the same case study as Fig. <xref ref-type="fig" rid="F7"/>b: <bold>(a)</bold> expected improvement EI, <bold>(b)</bold> thresholded entropy TE, <bold>(c)</bold> local and <bold>(d)</bold> global correlations. Colors indicate input scales <inline-formula><mml:math id="M681" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>, from small (red: <inline-formula><mml:math id="M682" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn></mml:mrow></mml:math></inline-formula>) to large (blue: <inline-formula><mml:math id="M683" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>). In <bold>(a, b)</bold>, vertical bars mark the respective optimal ASTs, which may depend on the scale. In <bold>(c, d)</bold>, horizontal dashed lines are positioned at <inline-formula><mml:math id="M684" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, corresponding to the rule of thumb from <xref ref-type="bibr" rid="bib1.bibx17" id="text.94"/>.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f10.png"/>

        </fig>

      <p id="d2e16252"><list list-type="bullet">
            <list-item>

      <p id="d2e16258">At small AST, the narrow PDFs have a relatively high <italic>probability</italic> of improvement over the ancestor <inline-formula><mml:math id="M685" display="inline"><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:mo>∼</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, but only by small amounts, hence a small EI. By a similar token, the TE terms in Eq. (<xref ref-type="disp-formula" rid="Ch1.E28"/>) are almost all positive because the PDF is situated well above <inline-formula><mml:math id="M686" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, but being concentrated in a small number of bins makes its information content low.</p>
            </list-item>
            <list-item>

      <p id="d2e16292">At intermediate ASTs of 10–20 d, the PDFs remain roughly centered at the ancestor's severity, meaning that improvements remain highly probable, but are larger when they happen thanks to the long upper tails, contributing to a large EI. Meanwhile, both upper and lower tails contribute to a large TE, which does not directly favor exceptionally high severities but rather <italic>diverse</italic>  severities that are <italic>high enough</italic> to exceed <inline-formula><mml:math id="M687" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>.</p>
            </list-item>
            <list-item>

      <p id="d2e16311">At large AST past <inline-formula><mml:math id="M688" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:math></inline-formula> d, the PDFs have diminishing mass above <inline-formula><mml:math id="M689" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, let alone above the ancestor severity <inline-formula><mml:math id="M690" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>n</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, which zeros out most of the contributions to both EI and TE.</p>
            </list-item>
          </list>The COAST can change with the scale <inline-formula><mml:math id="M691" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>: even though the overall shapes of TE and EI do not change very much, the location of their maxima might. Fortunately, we will find changes in scale for <inline-formula><mml:math id="M692" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>≳</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula> to have negligible impact.</p>
      <p id="d2e16367">Figure <xref ref-type="fig" rid="F10"/>c and d displays two versions of pattern correlation <inline-formula><mml:math id="M693" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>, defined in Sect. <xref ref-type="sec" rid="Ch1.S2.SS4"/> for an arbitrary field <inline-formula><mml:math id="M694" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula>: the “global correlation” <inline-formula><mml:math id="M695" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>[</mml:mo><mml:mi>c</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> uses the whole two-dimensional upper-layer concentration field <inline-formula><mml:math id="M696" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and the “local correlation” <inline-formula><mml:math id="M697" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>[</mml:mo><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> uses only the single-latitude transect <inline-formula><mml:math id="M698" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> at the target latitude <inline-formula><mml:math id="M699" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Both drop off steadily with AST, although local correlation fluctuates more due to averaging a smaller spatial region. The influence of perturbation scale <inline-formula><mml:math id="M700" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> enters at the ensemble-averaging step, where the <inline-formula><mml:math id="M701" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th member's pattern correlation <inline-formula><mml:math id="M702" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> is weighted by <inline-formula><mml:math id="M703" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ω</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Since smaller perturbations take longer to grow, smaller input scales lead to slower dropoff of <inline-formula><mml:math id="M704" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> with <inline-formula><mml:math id="M705" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> – but only at short lead times, where errors are still tiny. Beyond <inline-formula><mml:math id="M706" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:math></inline-formula> and 10 d for global and local correlations respectively, decorrelation proceeds at a similar rate with respect to increasing AST for all scales. The nominal threshold <inline-formula><mml:math id="M707" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></inline-formula> is marked in both, and gives a similar AST for local and global correlations but generally longer than implied by EI or TE.</p>
</sec>
<sec id="Ch1.S5.SS5">
  <label>5.5</label><title>AST selection criteria: aggregate results</title>
      <p id="d2e16635">Figure <xref ref-type="fig" rid="F11"/> goes beyond the case study to show dispersion indicators averaged across all ancestors. The coefficients of determination for linear and quadratic models (Fig. <xref ref-type="fig" rid="F11"/>a) are farther apart on average than they are for the case study (see Fig. <xref ref-type="fig" rid="F8"/>e), the quadratic model enjoying much higher skill especially during the pivotal 10–20 d range when EI and TE tend to maximize (Fig. <xref ref-type="fig" rid="F11"/>b and c). This validates our choice to use the quadratic model. Overall, the EI, TE, global and local correlations (Fig. <xref ref-type="fig" rid="F11"/>b–e) are similar on average to the case study, but smoother.</p>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e16650">Ensemble dispersion metrics averaged across ancestors at <inline-formula><mml:math id="M708" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">26</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. <bold>(a)</bold> Coefficients of determination for linear (cyan) and quadratic (orange) regressions, averaged across ancestors. <bold>(b–e)</bold> Same quantities as in Fig. <xref ref-type="fig" rid="F10"/>a–d but averaged across ancestors, with only the largest and smallest scales shown (red: <inline-formula><mml:math id="M709" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn></mml:mrow></mml:math></inline-formula>, blue: <inline-formula><mml:math id="M710" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>). Shaded regions indicate variation across ancestors, which we quantify using <italic>truncated upper- and lower-means</italic>. For example, the upper truncated mean for correlation <inline-formula><mml:math id="M711" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> is the mean of <inline-formula><mml:math id="M712" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> across ancestors with above-average <inline-formula><mml:math id="M713" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>: <inline-formula><mml:math id="M714" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">E</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>&gt;</mml:mo><mml:mi mathvariant="double-struck">E</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>]</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, separately at each AST. We choose truncated means to avoid the awkward properties of more standard measures of spread: interquartile ranges would be erratic for the relatively small sample size of ancestors, whereas standard deviation envelopes can misleadingly fall outside the bounds [0, ] to which <inline-formula><mml:math id="M715" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> is constrained.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f11.png"/>

        </fig>

      <p id="d2e16773">Note, however, that these averaged dispersion indicators are never used directly in AST selection: the COASTs are chosen separately for each ancestor as the maximizer of its own EI or TE, or at the longest AST such that global or local correlation is above <inline-formula><mml:math id="M716" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>. This nuance is further illustrated in Fig. <xref ref-type="fig" rid="F12"/>a and b, where (EI, TE) are plotted as joint functions of AST and input scale. Whereas the heatmaps are averages over ancestors of EI and TE just like Fig. <xref ref-type="fig" rid="F9"/>c.ii and iii, the red circles indicate the fraction of ancestors whose EI or TE is maximized at a particular AST for each particular scale. We call the red circle sizes “COAST frequencies”. For example, at <inline-formula><mml:math id="M717" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>, the mean EI maximizes at <inline-formula><mml:math id="M718" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:math></inline-formula> d, and that same AST is the most frequent COAST. However, the second-largest circle indicates that <inline-formula><mml:math id="M719" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> d is a close second-most frequent COAST according to EI. At the same scale, the most frequent COASTs according to TE are <inline-formula><mml:math id="M720" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">18</mml:mn></mml:mrow></mml:math></inline-formula> and 20. In general, we gather two patterns from Fig. <xref ref-type="fig" rid="F12"/>a and b: the average EI and TE values (i) are well-correlated with their corresponding COAST frequencies, and (ii) both change rapidly at small scales but stabilize above <inline-formula><mml:math id="M721" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>, at which point the input distributions are close enough to uniform over the <inline-formula><mml:math id="M722" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula>-disc. This relative stability is reassuring, but we generally prefer smaller noise which disturbs the model dynamics less. To balance these considerations, we select <inline-formula><mml:math id="M723" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula> as the nominal scale to examine more closely going forward.</p>

      <fig id="F12" specific-use="star"><label>Figure 12</label><caption><p id="d2e16876">Three optimization landscapes as joint functions of AST and input scale for <inline-formula><mml:math id="M724" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">26</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo>)</mml:mo><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>: <bold>(a)</bold> expected improvement (EI), <bold>(b)</bold> thresholded entropy (TE), and <bold>(c)</bold> <inline-formula><mml:math id="M725" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence between the MoCTail and ground truth. Lighter gray indicates better performance – smaller <inline-formula><mml:math id="M726" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence or larger EI and TE – and the corresponding “best” ASTs consistently fall in the <italic>interior</italic> of the domain, across all scales. Contours of local correlation <inline-formula><mml:math id="M727" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>[</mml:mo><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> are overlaid in <bold>(c)</bold>, giving a rough map of correspondence between correlation levels and AST. The size of red circles in <bold>(a, b)</bold> indicate the “COAST frequency”: the fraction of ancestors whose (EI, TE) is maximized at the corresponding AST while holding the scale fixed. Note the multiple local maxima in mean EI and TE (as indicated by the lightness of the gray color in <bold>a, b</bold>), each of which is the global maximum for some significant set of ancestors.</p></caption>
          <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f12.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Results: climatological severity distributions</title>
      <p id="d2e16990">Having explained the construction of conditional distributions, we now aggregate across ancestors using MoCTail and PoPTail estimators to obtain our estimates of the climatological severity distribution from the boosted ensembles. We evaluate the skill of each AST selection rule by the <inline-formula><mml:math id="M728" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence of the resulting climatological distribution from ground truth as obtained from the long DNS. We first restrict attention to extremes at <inline-formula><mml:math id="M729" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and then assess a broader swath of latitudes.</p>
      <p id="d2e17025">First, consider the simplest AST selection rule <inline-formula><mml:math id="M730" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, a uniform AST over all ancestors. We have no a priori principle for <inline-formula><mml:math id="M731" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, so we search through all possible values from 2 to 40 d. Figure <xref ref-type="fig" rid="F12"/>c displays the resulting <inline-formula><mml:math id="M732" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence between the MoCTail and ground truth, as a function of <inline-formula><mml:math id="M733" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and input scale. A clear optimum emerges at <inline-formula><mml:math id="M734" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:math></inline-formula> d and persists for all scales <inline-formula><mml:math id="M735" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>≳</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>, after rapid changes across smaller scales. Red contours also indicate the local correlation, averaged across ancestors to give a smooth and monotonic function of AST. In terms of correlation, the COAST <inline-formula><mml:math id="M736" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:math></inline-formula> d corresponds to <inline-formula><mml:math id="M737" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.92</mml:mn></mml:mrow></mml:math></inline-formula> depending on the scale, which is slightly above the nominal value <inline-formula><mml:math id="M738" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.86</mml:mn></mml:mrow></mml:math></inline-formula>, meaning one should split a little bit closer to the event than the rule of thumb implies.</p>
      <p id="d2e17160">Overall, the <inline-formula><mml:math id="M739" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> landscape (inverted) roughly aligns with the EI and TE landscapes, as do their respective optima. This is remarkable and encouraging: allowing each ancestor to determine its own COAST independently, with no knowledge of the ground truth or even other ancestors' COASTs, leads to a similar solution as the policy of synchronizing them all. Boosting based on EI and TE, therefore, is more parallelizable (optimizations are decoupled across ancestors), extensible (new ancestors can be added without changing the optimal split times for pre-existing ancestors), and interpretable (one can see the optimum clearly based on a case study, without complicated averaging procedures across initial conditions).</p>

      <fig id="F13" specific-use="star"><label>Figure 13</label><caption><p id="d2e17177">CCDF approximations by various mixing criteria and associated errors, at the latitude <inline-formula><mml:math id="M740" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula> and input scale choice <inline-formula><mml:math id="M741" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>. <bold>(a.i–v)</bold> Tail CCDFs by various estimates using only <inline-formula><mml:math id="M742" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula> ancestors, with lines showing medians and bands showing interquartile ranges across many size-11 subsamples of the total set of 32 ancestors. Dotted lines with open circles are PoPTails, while solid lines with crosses are MoCTails. Dashed black lines show the ground truth estimate. Panel <bold>(a.i)</bold> shows the tail approximation using a single uniform AST indicated at the top: 14 d for MoCTail and 8 d (parenthesized) for PoPTail. Panels <bold>(a.ii, iii)</bold> show the tail approximations using thresholds of (local, global) correlations as AST selection criteria. Panels <bold>(a.iv,v)</bold> show the tail approximations obtained by maximizing (EI, TE), which unlike the other criteria do not rely on knowing the ground truth to select ancestor-wise ASTs, either directly or through threshold choice. <bold>(a.vi)</bold> also shows estimates from DNS with equal cost to boosting on 11 ancestors (black stars, gray envelope) and DNS from only <inline-formula><mml:math id="M743" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula> peaks (brown circles and envelope), in both cases estimating uncertainty by longitudinal rotation. The GPD fit to ground truth is shown as a gray curve.  In <bold>(a.i–iii)</bold>, the thresholds shown at the top (PoPTail thresholds parenthesized) are obtained by using all 32 ancestors, but the CCDFs displayed each choose an AST to minimize <inline-formula><mml:math id="M744" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence from ground truth, separately for each subsample. Because this requires ground truth knowledge, the <inline-formula><mml:math id="M745" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergences must be interpreted as practical lower bounds. The 90 % error bar applies to the MoCTail estimator only, and comes from bootstrapping on entire “families” or in other words mixture components (not individual descendants) and choosing the best AST (by <inline-formula><mml:math id="M746" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence) for each particular subsample. The error bar widths, too, must then represent lower bounds. <bold>(b)</bold> <inline-formula><mml:math id="M747" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for the estimator directly above in each case as a function of <inline-formula><mml:math id="M748" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, and compared with DNS at equal cost and equal <inline-formula><mml:math id="M749" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>. DNS does not run long enough to equal the total cost accrued by boosting 32 ancestors, so the black curve stops before the others.</p></caption>
        <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f13.png"/>

      </fig>

      <p id="d2e17324">Figure <xref ref-type="fig" rid="F13"/> makes a tail-to-tail comparison between all the AST selection rules (a.i–v: <inline-formula><mml:math id="M750" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M751" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> local and global, <inline-formula><mml:math id="M752" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M753" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>), fixing the scale to <inline-formula><mml:math id="M754" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula> and (in the case of <inline-formula><mml:math id="M755" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M756" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>) selecting post-hoc the best-performing threshold to set the COASTs. We used subsets of only 11 of the 32 ancestors, resampling such subsets 64 times to obtain medians (solid) and interquartile ranges (shading) on CCDFs. The numerical values of optimal AST and <inline-formula><mml:math id="M757" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> reported above a.i–iii, with PoPTail optima parenthesized, are the optima obtained from <inline-formula><mml:math id="M758" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">32</mml:mn></mml:mrow></mml:math></inline-formula>, i.e., the best estimates of the true optima; they do not necessarily correspond to the values used for plotting with <inline-formula><mml:math id="M759" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula>, which are optimized separately for each resampling.  The brown CCDF in panel a.vi is the estimate from the unboosted acestors alone (“equal-<inline-formula><mml:math id="M760" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>”), and the black is the estimate from a larger number of ancestors to equal the cost of boosting. The curves underneath in panel (b) show the rate of improvement of <inline-formula><mml:math id="M761" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M762" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>.</p>
      <p id="d2e17465">In terms of quantitative improvements in <inline-formula><mml:math id="M763" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for a fixed cost (vertical differences between curves), all the rules considered (<inline-formula><mml:math id="M764" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M765" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M766" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M767" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>) improve substantially upon an equal-<inline-formula><mml:math id="M768" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> DNS and modestly upon an equal-cost DNS. The size of the advantage varies with <inline-formula><mml:math id="M769" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> in the way that we expect from boosting: substantial improvements in <inline-formula><mml:math id="M770" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> with moderate <inline-formula><mml:math id="M771" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>, (<inline-formula><mml:math id="M772" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>–10) when the DNS has sampled the attractor broadly but sparsely and extremes are within reach by perturbation. The advantage might diminish if <inline-formula><mml:math id="M773" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> increases enough for DNS to see those extremes without perturbation, but we haven't reached that regime yet. MoCTail and PoPTail performances are similar, but not identical: PoPTail seems more suited for threshold-based rules (<inline-formula><mml:math id="M774" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M775" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> local and global in b.i–iii), whereas MoCTail seems more suited for optimization-based rules (<inline-formula><mml:math id="M776" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M777" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> in b.iv and v). Another way to measure boosting advantage is by “speedup”: given a prescribed <inline-formula><mml:math id="M778" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> error, how much extra simulation is needed with DNS relative to boosting to achieve it. These are the horizontal distances between curves. Across all AST criteria, speedup varies from <inline-formula><mml:math id="M779" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.5</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M780" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math></inline-formula>, and accelerates sharply as the DNS curve flattens around <inline-formula><mml:math id="M781" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>∼</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> while the boosting curves continue to decrease linearly. These are modest speedups compared to other published rare event algorithms, which report between one and four orders of magnitude speedup depending on the event definition and the algorithm <xref ref-type="bibr" rid="bib1.bibx62 bib1.bibx18" id="paren.95"><named-content content-type="pre">e.g.,</named-content></xref>, but again, we stress that the computational savings here are only incidental to our main goal of characterizing the COAST. Substantial improvements should be possible by targeted optimization and, potentially, repeated rounds of boosting.</p>

      <fig id="F14" specific-use="star"><label>Figure 14</label><caption><p id="d2e17678">Performance of all AST selection criteria, measured by <inline-formula><mml:math id="M782" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence, across all latitudes for <inline-formula><mml:math id="M783" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M784" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> or 11, whichever is nearest to 1/3 the number of ancestors found for the latitude in question (sometimes less than 32). Black line and gray envelope represent the error from the short DNS and its 90 % error bar according to quantiles across longitudes. Panels <bold>(a)</bold>–<bold>(e)</bold> parallel Fig. <xref ref-type="fig" rid="F13"/>a.ii–vi. Solid lines and crosses represent the MoCTail estimator, while dotted lines with open circles represent the PoPTail estimator.</p></caption>
        <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f14.png"/>

      </fig>

      <p id="d2e17731">We selected <inline-formula><mml:math id="M785" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula> to display the full CCDFs in Fig. <xref ref-type="fig" rid="F13"/>a as the middle range of values tried, and where enough equal-size ancestor subsets are available for uncertainty quantification by bootstrapping. When comparing with DNS CCDFs, all five rules successfully extend the short, equal-<inline-formula><mml:math id="M786" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> DNS tail into a longer tail that tracks closer to the ground truth farther into the extreme severity range. They also all find a larger maximum than even the equal-cost DNS found. However, the threshold-based rules exhibit apparent bias, systematically underestimating probabilities for <inline-formula><mml:math id="M787" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>≳</mml:mo><mml:mn mathvariant="normal">0.64</mml:mn></mml:mrow></mml:math></inline-formula>, whereas the optimization-based rules are both more accurate and more confident. Our hypothesis for this behavior is that each ancestor has its own predictability timescale, physically linked to the frequency of the wave responsible for that particular event, and that these ancestor-varying timescales cannot all be respected at once by a single, globally imposed time like <inline-formula><mml:math id="M788" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, or even a globally imposed correlation threshold to dictate <inline-formula><mml:math id="M789" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>. The optimization-based criteria <inline-formula><mml:math id="M790" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M791" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> are tailored to the ancestor, and might in fact be choosing those predictability timescales implicitly. This is only speculation, however, and must be validated with more detailed analysis than fits in our present scope.</p>
      <p id="d2e17816">The COASTs identified by all rules lie strictly between the shortest and longest ASTs considered. For example, <inline-formula><mml:math id="M792" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">14</mml:mn></mml:mrow></mml:math></inline-formula> according to the MoCTail estimator (using all <inline-formula><mml:math id="M793" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">32</mml:mn></mml:mrow></mml:math></inline-formula> ancestors). By comparing with Fig. <xref ref-type="fig" rid="F12"/>c, we recognize 14 as the minimum of the <inline-formula><mml:math id="M794" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> landscape for <inline-formula><mml:math id="M795" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula> (and larger scales), with an approximate local-correlation equivalent of 0.98.</p>
      <p id="d2e17871">Similar patterns hold across target latitudes, but with some notable caveats. The <inline-formula><mml:math id="M796" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergences of each selection rule are plotted in Fig. <xref ref-type="fig" rid="F14"/>, of which Fig. <xref ref-type="fig" rid="F13"/>c is one slice. The most obvious and important point holds: perturbed ensembles improve upon the DNS equal-<inline-formula><mml:math id="M797" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> estimate, for almost all latitudes and AST selection rules, and they also improve on the equal cost estimate in many cases. But <inline-formula><mml:math id="M798" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is less reliable; its favorable performance noted above in Fig. <xref ref-type="fig" rid="F13"/> is peculiar to the latitude <inline-formula><mml:math id="M799" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>. At some other latitudes, it is similar or worse in skill than equal-<inline-formula><mml:math id="M800" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> and even equal-cost DNS. Even so, it tends to fail by <italic>overestimating</italic> severities, which we have confirmed by examining the corresponding CCDFs (not shown), and thus it may serve as a useful upper bound. The MoCTail and PoPTail estimators are similar in quality across latitudes, but as observed in Fig. <xref ref-type="fig" rid="F13"/>, PoPTail has an advantage with threshold-based rules (<inline-formula><mml:math id="M801" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M802" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> local and global) whereas MoCTail performs better with optimization-based rules (<inline-formula><mml:math id="M803" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">EI</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M804" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">TE</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>).</p>

      <fig id="F15" specific-use="star"><label>Figure 15</label><caption><p id="d2e17990">Optimization landscapes and optimal ASTs across latitudes, again fixing the input scale to <inline-formula><mml:math id="M805" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>. <bold>(a)</bold> Frequencies of <italic>conditionally</italic> optimal ASTs (COASTs), in the maximum-thresholded entropy sense, at each latitude, with whiter shading indicating higher frequency. E.g., at <inline-formula><mml:math id="M806" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>/</mml:mo><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">26</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">64</mml:mn></mml:mrow></mml:math></inline-formula>, the two adjacent bright pixels at AST <inline-formula><mml:math id="M807" display="inline"><mml:mrow><mml:mo>=</mml:mo><mml:mn mathvariant="normal">18</mml:mn></mml:mrow></mml:math></inline-formula>, 20 indicate that for a large fraction of ancestors, the highest-entropy descendant ensemble is the one launched 18 or 20 d in advance of the peak. <bold>(b)</bold> Thresholded entropy as a function of AST, normalized to the range 0–1 (black-white, so brighter is better) separately at each latitude. This landscape is smoother than <inline-formula><mml:math id="M808" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and varies less dramatically with latitude, but exhibits directionally similar trends. <bold>(c)</bold> <inline-formula><mml:math id="M809" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> divergence as a function of AST and latitude, normalized to the range 0–1 (white-black, so brighter is better) separately at each latitude so that different latitudes are visually comparable. Red crosses mark the optimal AST at each latitude. Cyan (solid, dashed) curves mark the AST at which the (global, local) correlations, averaged across ancestors, reach <inline-formula><mml:math id="M810" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. This nominal choice is based on <xref ref-type="bibr" rid="bib1.bibx17" id="text.96"/>, and falls squarely in the middle of the latitude-dependent ASTs. <bold>(d)</bold> Contour map of local correlation, averaged over ancestors, as a function of AST and latitude. The levels range from 0.22 (left-most dotted black curve, fragmented by boundary) to 0.99 (rightmost solid black curve), evenly spaced in a stretched sigmoid scale (levels are shown only for qualitative purposes). The reference level <inline-formula><mml:math id="M811" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> appears dashed in cyan. <bold>(e)</bold> Bottom topography for reference.</p></caption>
        <graphic xlink:href="https://npg.copernicus.org/articles/33/233/2026/npg-33-233-2026-f15.png"/>

      </fig>

      <p id="d2e18129">The various estimators and AST selection rules have differences in skill, but a more important commonality: all of them indicate that <italic>an optimal advance split time exists</italic> that is strictly positive, which is not a foregone conclusion in light of standard rare event algorithms like adaptive multilevel splitting <xref ref-type="bibr" rid="bib1.bibx38" id="paren.97"><named-content content-type="pre">AMS;</named-content></xref> without “trying early”. Figure <xref ref-type="fig" rid="F12"/> shows clear intermediate optima when targeting the single latitude <inline-formula><mml:math id="M812" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, and Fig. <xref ref-type="fig" rid="F15"/> extends this result to all latitudes by stacking together cross-sections of the per-latitude counterparts of Fig. <xref ref-type="fig" rid="F12"/> at <inline-formula><mml:math id="M813" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.24</mml:mn></mml:mrow></mml:math></inline-formula>. The COAST frequency and mean-TE landscapes have broad ridges that meander slowly in AST space with latitude, approximately in phase with topography: smaller ASTs are favored at <inline-formula><mml:math id="M814" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>≈</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">26</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, where topography is minimized and meridional wind shear is negative, and larger ASTs are favored at <inline-formula><mml:math id="M815" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>≈</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">38</mml:mn><mml:mn mathvariant="normal">64</mml:mn></mml:mfrac></mml:mstyle><mml:mi>L</mml:mi></mml:mrow></mml:math></inline-formula>, where topography is maximized and meridional wind shear is positive. A similar pattern, but with bigger swings, is seen in the <inline-formula><mml:math id="M816" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> landscape. All these patterns are a bit noisy, especially for the COAST frequencies and <inline-formula><mml:math id="M817" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>-COAST locations, since both come from an inherently unstable “argmax” function. Nonetheless, the detailed latitude dependence is only a secondary effect on top of the main point, which is clearly demonstrated: splitting is most effective at intermediate ASTs rather than very short or long ASTs.</p>
      <p id="d2e18244">We can also now evaluate the <inline-formula><mml:math id="M818" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule from <xref ref-type="bibr" rid="bib1.bibx17" id="text.98"/> in this broader multi-latitude context, though here we simplify the procedure by first averaging <inline-formula><mml:math id="M819" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> across ancestors and then calculating <inline-formula><mml:math id="M820" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> as a threshold-crossing time of that average, which we call <inline-formula><mml:math id="M821" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow><mml:mi mathvariant="normal">U</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, rather than averaging times <inline-formula><mml:math id="M822" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>n</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msubsup><mml:mo>[</mml:mo><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> across ancestors. The same conclusion holds either way. The AST values <inline-formula><mml:math id="M823" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow><mml:mi mathvariant="normal">U</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are overlaid on the <inline-formula><mml:math id="M824" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> heatmap (Fig. <xref ref-type="fig" rid="F15"/>d) as blue curves. The solid curve, representing a level set of ancestor-averaged global correlation, should be constant with latitude and varies only due to sampling errors. Likewise, the dashed curve, representing a level set of ancestor-averaged local correlation, should be approximately symmetric with respect to latitude because of the symmetric tracer boundary conditions and approximate mirror symmetry in velocities, as should all the level sets in panel c. Since the <inline-formula><mml:math id="M825" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> varies differently with latitude, exhibiting roughly odd symmetry about the midline, the <inline-formula><mml:math id="M826" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule cannot possibly be optimal for all latitudes simultaneously. More fundamentally, the COAST depends on more than just a generic metric for ensemble dispersion: it must also depend on the features of the tail being sampled, which in this case is the only possible source of meridional variation (see Fig. <xref ref-type="fig" rid="F4"/>).</p>
      <p id="d2e18396">However, both versions of <inline-formula><mml:math id="M827" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow><mml:mi mathvariant="normal">U</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> run right through the mean position of the meandering <inline-formula><mml:math id="M828" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> valley and associated COASTs, performing about as well as any such highly-constrained synchronized <inline-formula><mml:math id="M829" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mtext>U</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> could do. Thus, the <inline-formula><mml:math id="M830" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule retains its relevance as a starting point for more refined optimization more tailored to the event, at least for this QG system. Whether the <inline-formula><mml:math id="M831" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> rule generalizes to more heterogeneous systems as the “optimal synchronized AST” requires further investigation. We found it provides some guidance for temperature and precipitation extremes in an idealized general circulation model, but overestimated the optimal AST in both cases <xref ref-type="bibr" rid="bib1.bibx18" id="paren.99"/>.</p>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusion</title>
      <p id="d2e18473">Rare event sampling is a promising strategy to study extreme weather more efficiently with computer models by repeatedly cloning, perturbing, and re-simulating the most extreme events in an ensemble while tracking statistical weights. However, sudden and transient events such as mid-latitude precipitation present a particular challenge for rare event algorithms, leaving ensembles little time to diversify before the event passes by. Ensemble boosting <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx22 bib1.bibx20 bib1.bibx4" id="paren.100"/> and “trying-early adaptive multilevel splitting” <xref ref-type="bibr" rid="bib1.bibx17 bib1.bibx18" id="paren.101"><named-content content-type="pre">TEAMS;</named-content></xref> get around this problem by perturbing events farther in advance by some <italic>advance split time</italic> (AST) to allow ensembles to spread, but this opens a pivotal question: how should we choose the AST for maximal accuracy and efficiency? If AST is too short, perturbations cannot grow enough to give useful samples, and if it is too long, they regress to climatology. To deploy advance-splitting methods at scale, we need more reliable ways to set the AST as well as other hyperparameters. The AST itself may be a property of the physical system, not of algorithmic parameters like ensemble size, which would simplify its optimization while also yielding physical insight into causal mechanisms of the event.</p>
      <p id="d2e18487">In this paper, we have pursued this hypothesis and established the <italic>conditionally optimal advance split time</italic> (COAST) as a quantity more intrinsic to the dynamical system than to the idiosyncrasies of a particular rare event algorithm by removing the confounding effect of randomly selecting ensemble members to split. The COAST also depends on the target observable of interest, the imposed distribution over perturbations, and the initial conditions which may vary in their predictability. We formulate COAST mathematically as the solution to an optimization problem, and through a systematic boosting-based sampling and estimation procedure we discern the optimization landscape in the context of an idealized physical model: a baroclinically unstable quasi-geostrophic (QG) flow, with local passive tracer fluctuations as our extreme event of interest. To facilitate more efficient rare event sampling applications, we have further proposed various parsimonious rules for finding the COAST, and evaluated these rules empirically in the QG model.</p>
      <p id="d2e18493">We have four conclusions to report: <list list-type="order"><list-item>
      <p id="d2e18498">A boosting procedure, generated with a suitable AST, can well-approximate a probability distribution's tail using either of two estimators: “MoCTail”, which we formulate here, and “PoPTail”, due to <xref ref-type="bibr" rid="bib1.bibx4" id="text.102"/>.</p></list-item><list-item>
      <p id="d2e18505">The optimal AST is strictly greater than zero and varies slowly with latitude, appearing  smaller in regions of negative meridional wind shear (e.g., the northern edges of westerly jets) and larger in regions of positive meridional wind shear (e.g., the southern edges of westerly jets).</p></list-item><list-item>
      <p id="d2e18509">Several different rules for selecting the COAST are equally effective. Beyond the simplest option of setting a single fixed AST (called <inline-formula><mml:math id="M832" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>), one can set a conditional AST (called <inline-formula><mml:math id="M833" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>) by thresholding on ensemble dispersion. Both <inline-formula><mml:math id="M834" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M835" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">PC</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> perform similarly at tail reconstruction, but both unfortunately require a threshold choice, which there is no established method for selecting. Here we selected thresholds post hoc with knowledge of the ground truth. The rule proposed in <xref ref-type="bibr" rid="bib1.bibx17" id="text.103"/>- – that <inline-formula><mml:math id="M836" display="inline"><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mi mathvariant="normal">U</mml:mi></mml:msup><mml:mo>≈</mml:mo></mml:mrow></mml:math></inline-formula> the time until ensembles disperse to <inline-formula><mml:math id="M837" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">8</mml:mn></mml:mfrac></mml:mstyle></mml:math></inline-formula> their saturation value – appears to be a good single choice, but further improvement is possible by tailoring AST to the target location and the initial condition.</p></list-item><list-item>
      <p id="d2e18585">An attractive alternative to thresholding is <italic>optimizing</italic> some functional of the ensemble severity distribution designed to favor both high extremes and wide spread. We have found a suitable functional in <italic>thresholded entropy</italic> (TE), the expected information contained in that part of the ensemble's severity distribution exceeding the pre-selected threshold. Optimization-based AST rules open the door to using Bayesian optimization strategies to home in on the COASTs adaptively during an actual rare event sampling algorithm, avoiding the exhaustive grid searches we have performed here.</p></list-item></list></p>
      <p id="d2e18594">There are many important avenues of research indicated by the present study, both methodology-oriented and science-oriented. On the algorithmic front, it remains to be seen whether thresholded entropy succeeds at matching tail statistics in general systems, but the consistency across different targets within the QG model is encouraging. We suspect that <italic>some</italic> similar objective function over distributions is broadly applicable. Furthermore, the <italic>shape</italic> of perturbations is a possibly very important lever on the potency of perturbations, acting in concert with their timing. While we limited our present study to a two-dimensional perturbation space based on linearized dynamics about a baroclinically unstable background flow, a natural extension would be to use flow-dependent singular vectors as in operational weather forecasting. By design, they effect faster ensemble spread in the small-perturbation regime; however, it must be checked if their advantages carry into the finite-amplitude regime needed for effective rare event sampling. Computational tools such as adjoints, especially in novel machine learning models, invite the use of gradient-based optimization <xref ref-type="bibr" rid="bib1.bibx77 bib1.bibx76 bib1.bibx80" id="paren.104"/>. Since exhaustive grid search over ASTs and perturbation spaces is not an option when deploying rare event algorithms in practice, we are actively pursuing efficient optimization strategies, which are important to make use of this research.</p>
      <p id="d2e18607">Intriguing dynamical questions also arise from the latitude dependence of the COAST, which can be seen as a predictability index tailored to extremes: how do the physical parameters such as topography, rotation rate, and the spatial domain affect COAST? Is the effect entirely explainable through the extreme value statistics, as we have speculated, or can two similarly shaped tails belie extremely different COAST behavior? These questions merit further parameter exploration, both within and beyond the quasigeostrophic framework. We expect to draw insight from recent theoretical advances relating extreme value theory to the geometry of chaotic attractors <xref ref-type="bibr" rid="bib1.bibx42" id="paren.105"/>.</p>
      <p id="d2e18613">In summary, our work makes empirical progress on important theoretical and algorithmic questions regarding the  probabilities of the most extreme weather events. Perturbed ensemble forecasts of individual weather events are distinct from the climatological distribution, but here we have given quantitative evidence for a relationship between the two – so long as the perturbations are well-timed – that can be exploited for efficient risk analysis via judicious perturbed simulations. Our work has elucidated what it means to be “well-timed”, and furthermore provided quantitative optimization criteria for perturbation timing. Only with this basic pre-requisite information on what to optimize, should we proceed to invest effort into optimizing efficiently.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Langevin model</title>
      <p id="d2e18628">The schematic in Fig. <xref ref-type="fig" rid="F1"/> comes from Langevin dynamics, consisting of a single particle moving in one dimension with position <inline-formula><mml:math id="M838" display="inline"><mml:mrow><mml:mi>X</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and momentum <inline-formula><mml:math id="M839" display="inline"><mml:mrow><mml:mi>Y</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> subject to a potential gradient force, friction, and stochastic Gaussian white-noise forcing <inline-formula><mml:math id="M840" display="inline"><mml:mrow><mml:mi>W</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>:

              <disp-formula specific-use="align" content-type="numbered"><mml:math id="M841" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.S1.E44"><mml:mtd><mml:mtext>A1</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">d</mml:mi><mml:mi>X</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>m</mml:mi></mml:mfrac></mml:mstyle><mml:mi>Y</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S1.E45"><mml:mtd><mml:mtext>A2</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">d</mml:mi><mml:mi>Y</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mi>V</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mi>Y</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">d</mml:mi><mml:mi>W</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

        where the potential function <inline-formula><mml:math id="M842" display="inline"><mml:mrow><mml:mi>V</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> has a quadratic core and logarithmic wings,

          <disp-formula id="App1.Ch1.S1.E46" content-type="numbered"><label>A3</label><mml:math id="M843" display="block"><mml:mrow><mml:mi>V</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mfenced close="" open="{"><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi mathvariant="italic">β</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ϵ</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi mathvariant="italic">β</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle><mml:mi>log⁡</mml:mi><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo><mml:mo>&gt;</mml:mo><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>

        which leads to a heavy-tailed (in <inline-formula><mml:math id="M844" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>) steady-state probability density <inline-formula><mml:math id="M845" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>∝</mml:mo><mml:mi>exp⁡</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>V</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>m</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>∼</mml:mo><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:msup><mml:mo>|</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for large <inline-formula><mml:math id="M846" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula>. Constant parameters are <inline-formula><mml:math id="M847" display="inline"><mml:mrow><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> for friction, <inline-formula><mml:math id="M848" display="inline"><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1.2</mml:mn></mml:mrow></mml:math></inline-formula> for mass, <inline-formula><mml:math id="M849" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.005</mml:mn></mml:mrow></mml:math></inline-formula> for stochastic forcing strength, <inline-formula><mml:math id="M850" display="inline"><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn></mml:mrow></mml:math></inline-formula> for the extent of the quadratic core of the potential, <inline-formula><mml:math id="M851" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.1</mml:mn></mml:mrow></mml:math></inline-formula> which sets the tail weight, and <inline-formula><mml:math id="M852" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>m</mml:mi><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>/</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> which is the inverse temperature.</p>
</app>
  </app-group><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d2e19112">The code to generate all results, including simulation, statistical analysis, and plotting, is available at the Zenodo repository COAST (<ext-link xlink:href="https://doi.org/10.5281/zenodo.17355215" ext-link-type="DOI">10.5281/zenodo.17355215</ext-link>, <xref ref-type="bibr" rid="bib1.bibx33" id="altparen.106"/>). Justin Finkel is happy to provide guidance on use and extension of the code upon request.</p>
  </notes><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d2e19124">All data used here was generated using the code we have available in the Zenodo repository, COAST, accessible at <ext-link xlink:href="https://doi.org/10.5281/zenodo.17355215" ext-link-type="DOI">10.5281/zenodo.17355215</ext-link> <xref ref-type="bibr" rid="bib1.bibx33" id="paren.107"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e19136">JF formulated the initial study, carried out numerical computations, and wrote the initial draft. PO and JF both contributed to refining the methodology and substantially revising the manuscript.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e19142">The contact author has declared that neither of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e19149">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e19155">We thank Glenn Flierl, Andre Souza, and Talia Tamarin-Brodsky for helpful discussions and advice on theoretical and computational aspects of this work. This research is part of the MIT Climate Grand Challenge on Weather and Climate Extremes. Computations were performed on the MIT Engaging cluster.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e19160">This research was supported by Schmidt Sciences.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e19166">This paper was edited by Stéphane Vannitsem and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Au and Beck(2001)</label><mixed-citation>Au, S.-K. and Beck, J. L.: Estimation of small failure probabilities in high dimensions by subset simulation, Probab. Eng. Mech., 16, 263–277, <ext-link xlink:href="https://doi.org/10.1016/S0266-8920(01)00019-4" ext-link-type="DOI">10.1016/S0266-8920(01)00019-4</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Baars et al.(2021)Baars, Castellana, Wubs, and Dijkstra</label><mixed-citation>Baars, S., Castellana, D., Wubs, F., and Dijkstra, H.: Application of adaptive multilevel splitting to high-dimensional dynamical systems, J. Comput. Phys., 424, 109876, <ext-link xlink:href="https://doi.org/10.1016/j.jcp.2020.109876" ext-link-type="DOI">10.1016/j.jcp.2020.109876</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Berner et al.(2015)Berner, Fossell, Ha, Hacker, and Snyder</label><mixed-citation>Berner, J., Fossell, K. R., Ha, S.-Y., Hacker, J. P., and Snyder, C.: Increasing the Skill of Probabilistic Forecasts: Understanding Performance Improvements from Model-Error Representations, Mon. Weather Rev., 143, 1295–1320, <ext-link xlink:href="https://doi.org/10.1175/MWR-D-14-00091.1" ext-link-type="DOI">10.1175/MWR-D-14-00091.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bloin-Wibe et al.(2025)Bloin-Wibe, Noyelle, Humphrey, Beyerle, Knutti, and Fischer</label><mixed-citation>Bloin-Wibe, L., Noyelle, R., Humphrey, V., Beyerle, U., Knutti, R., and Fischer, E.: Estimating return periods for extreme events in climate models through Ensemble Boosting, Weather Clim. Dynam., 6, 1147–1177, <ext-link xlink:href="https://doi.org/10.5194/wcd-6-1147-2025" ext-link-type="DOI">10.5194/wcd-6-1147-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Blonigan et al.(2019)Blonigan, Farazmand, and Sapsis</label><mixed-citation>Blonigan, P. J., Farazmand, M., and Sapsis, T. P.: Are extreme dissipation events predictable in turbulent fluid flows?, Phys. Rev. Fluids, 4, 044606, <ext-link xlink:href="https://doi.org/10.1103/PhysRevFluids.4.044606" ext-link-type="DOI">10.1103/PhysRevFluids.4.044606</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Boulaguiem et al.(2022)Boulaguiem, Zscheischler, Vignotto, van der Wiel, and Engelke</label><mixed-citation>Boulaguiem, Y., Zscheischler, J., Vignotto, E., van der Wiel, K., and Engelke, S.: Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks, Environ. Data Sci., 1, e5, <ext-link xlink:href="https://doi.org/10.1017/eds.2022.4" ext-link-type="DOI">10.1017/eds.2022.4</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Bourlioux and Majda(2002)</label><mixed-citation>Bourlioux, A. and Majda, A. J.: Elementary models with probability distribution function intermittency for passive scalars with a mean gradient, Phys. Fluids, 14, 881–897, <ext-link xlink:href="https://doi.org/10.1063/1.1430736" ext-link-type="DOI">10.1063/1.1430736</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Breitung(2021)</label><mixed-citation>Breitung, K.: SORM, Design Points, Subset Simulation, and Markov Chain Monte Carlo, ASCE-ASME J. Risk Uncertain. Eng. Syst. A, 7, 04021052, <ext-link xlink:href="https://doi.org/10.1061/AJRUA6.0001166" ext-link-type="DOI">10.1061/AJRUA6.0001166</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Castaing et al.(1989)Castaing, Gunaratne, Heslot, Kadanoff, Libchaber, Thomae, Wu, Zaleski, and Zanetti</label><mixed-citation>Castaing, B., Gunaratne, G., Heslot, F., Kadanoff, L., Libchaber, A., Thomae, S., Wu, X.-Z., Zaleski, S., and Zanetti, G.: Scaling of hard thermal turbulence in Rayleigh-Bénard convection, J. Fluid Mech., 204, 1–30, <ext-link xlink:href="https://doi.org/10.1017/S0022112089001643" ext-link-type="DOI">10.1017/S0022112089001643</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Coles(2001)</label><mixed-citation>Coles, S.: An introduction to statistical modeling of extreme values, in: Springer Series in Statistics, 1st Edn., Springer, ISBN 978-1-85233-459-8, <ext-link xlink:href="https://doi.org/10.1007/978-1-4471-3675-0" ext-link-type="DOI">10.1007/978-1-4471-3675-0</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Cérou and Guyader(2007)</label><mixed-citation>Cérou, F. and Guyader, A.: Adaptive Multilevel Splitting for Rare Event Analysis, Stoch. Anal. Appl., 25, 417–443, <ext-link xlink:href="https://doi.org/10.1080/07362990601139628" ext-link-type="DOI">10.1080/07362990601139628</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Dematteis et al.(2019)Dematteis, Grafke, and Vanden-Eijnden</label><mixed-citation>Dematteis, G., Grafke, T., and Vanden-Eijnden, E.: Extreme Event Quantification in Dynamical Systems with Random Components, SIAM/ASA J. Uncertain. Quant., 7, 1029–1059, <ext-link xlink:href="https://doi.org/10.1137/18M1211003" ext-link-type="DOI">10.1137/18M1211003</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Diaconescu and Laprise(2012)</label><mixed-citation>Diaconescu, E. P. and Laprise, R.: Singular vectors in atmospheric sciences: A review, Earth-Sci. Rev., 113, 161–175, <ext-link xlink:href="https://doi.org/10.1016/j.earscirev.2012.05.005" ext-link-type="DOI">10.1016/j.earscirev.2012.05.005</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Farazmand and Sapsis(2017)</label><mixed-citation>Farazmand, M. and Sapsis, T. P.: A variational approach to probing extreme events in turbulent dynamical systems, Sci. Adv., 3, e1701533, <ext-link xlink:href="https://doi.org/10.1126/sciadv.1701533" ext-link-type="DOI">10.1126/sciadv.1701533</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Farrell and Ioannou(1996a)</label><mixed-citation>Farrell, B. F. and Ioannou, P. J.: Generalized Stability Theory. Part I: Autonomous Operators, J. Atmos. Sci., 53, 2025–2040, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1996)053&lt;2025:GSTPIA&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1996)053&lt;2025:GSTPIA&gt;2.0.CO;2</ext-link>, 1996a.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Farrell and Ioannou(1996b)</label><mixed-citation>Farrell, B. F. and Ioannou, P. J.: Generalized Stability Theory. Part II: Nonautonomous Operators, J. Atmos. Sci., 53, 2041–2053, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1996)053&lt;2041:GSTPIN&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1996)053&lt;2041:GSTPIN&gt;2.0.CO;2</ext-link>, 1996b.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Finkel and O'Gorman(2024)</label><mixed-citation>Finkel, J. and O'Gorman, P. A.: Bringing Statistics to Storylines: Rare Event Sampling for Sudden, Transient Extreme Events, J. Adv. Model. Earth Syst., 16, e2024MS004264, <ext-link xlink:href="https://doi.org/10.1029/2024MS004264" ext-link-type="DOI">10.1029/2024MS004264</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Finkel and O'Gorman(2026)</label><mixed-citation>Finkel, J. and O'Gorman, P. A.: Rare Event Sampling for Moving Targets: Extremes of Temperature and Daily Precipitation in a General Circulation Model, J. Adv. Model. Earth Syst., 18, e2025MS005456, <ext-link xlink:href="https://doi.org/10.1029/2025MS005456" ext-link-type="DOI">10.1029/2025MS005456</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Finkel et al.(2023)Finkel, Gerber, Abbot, and Weare</label><mixed-citation>Finkel, J., Gerber, E. P., Abbot, D. S., and Weare, J.: Revealing the Statistics of Extreme Events Hidden in Short Weather Forecast Data, AGU Adv., 4, e2023AV000881, <ext-link xlink:href="https://doi.org/10.1029/2023AV000881" ext-link-type="DOI">10.1029/2023AV000881</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Fischer et al.(2023)Fischer, Beyerle, Bloin-Wibe, Gessner, Humphrey, Lehner, Pendergrass, Sippel, Zeder, and Knutti</label><mixed-citation>Fischer, E. M., Beyerle, U., Bloin-Wibe, L., Gessner, C., Humphrey, V., Lehner, F., Pendergrass, A. G., Sippel, S., Zeder, J., and Knutti, R.: Storylines for unprecedented heatwaves based on ensemble boosting, Nat. Commun., 14, 4643, <ext-link xlink:href="https://doi.org/10.1038/s41467-023-40112-4" ext-link-type="DOI">10.1038/s41467-023-40112-4</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Gálfi et al.(2017)Gálfi, Bódai, and Lucarini</label><mixed-citation>Gálfi, V. M., Bódai, T., and Lucarini, V.: Convergence of Extreme Value Statistics in a Two-Layer Quasi-Geostrophic Atmospheric Model, Complexity, 2017, 5340858, <ext-link xlink:href="https://doi.org/10.1155/2017/5340858" ext-link-type="DOI">10.1155/2017/5340858</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Gessner(2022)</label><mixed-citation>Gessner, C.: Physical storylines for very rare climate extremes, PhD thesis, ETH Zurich, <uri>https://www.research-collection.ethz.ch/entities/publication/2405b8fb-a51d-41df-b4e5-e7afa0a93719</uri> (last access: 17 May 2026), 2022.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Gessner et al.(2021)Gessner, Fischer, Beyerle, and Knutti</label><mixed-citation>Gessner, C., Fischer, E. M., Beyerle, U., and Knutti, R.: Very Rare Heat Extremes: Quantifying and Understanding Using Ensemble Reinitialization, J. Climate, 34, 6619–6634, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-20-0916.1" ext-link-type="DOI">10.1175/JCLI-D-20-0916.1</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Ghil et al.(2011)Ghil, Yiou, Hallegatte, Malamud, Naveau, Soloviev, Friederichs, Keilis-Borok, Kondrashov, Kossobokov, Mestre, Nicolis, Rust, Shebalin, Vrac, Witt, and Zaliapin</label><mixed-citation>Ghil, M., Yiou, P., Hallegatte, S., Malamud, B. D., Naveau, P., Soloviev, A., Friederichs, P., Keilis-Borok, V., Kondrashov, D., Kossobokov, V., Mestre, O., Nicolis, C., Rust, H. W., Shebalin, P., Vrac, M., Witt, A., and Zaliapin, I.: Extreme events: dynamics, statistics and prediction, Nonlin. Processes Geophys., 18, 295–350, <ext-link xlink:href="https://doi.org/10.5194/npg-18-295-2011" ext-link-type="DOI">10.5194/npg-18-295-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Giorgini et al.(2024)Giorgini, Deck, Bischoff, and Souza</label><mixed-citation>Giorgini, L. T., Deck, K., Bischoff, T., and Souza, A.: Response Theory via Generative Score Modeling, Phys. Rev. Lett., 133, 267302, <ext-link xlink:href="https://doi.org/10.1103/PhysRevLett.133.267302" ext-link-type="DOI">10.1103/PhysRevLett.133.267302</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Gollub et al.(1991)Gollub, Clarke, Gharib, Lane, and Mesquita</label><mixed-citation>Gollub, J. P., Clarke, J., Gharib, M., Lane, B., and Mesquita, O. N.: Fluctuations and transport in a stirred fluid with a mean gradient, Phys. Rev. Lett., 67, 3507–3510, <ext-link xlink:href="https://doi.org/10.1103/PhysRevLett.67.3507" ext-link-type="DOI">10.1103/PhysRevLett.67.3507</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Haidvogel and Held(1980)</label><mixed-citation>Haidvogel, D. B. and Held, I. M.: Homogeneous Quasi-Geostrophic Turbulence Driven by a Uniform Temperature Gradient, J. Atmos. Sci., 37, 2644–2660, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1980)037&lt;2644:HQGTDB&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1980)037&lt;2644:HQGTDB&gt;2.0.CO;2</ext-link>, 1980.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Huang et al.(2016)Huang, Stein, McInerney, Sun, and Moyer</label><mixed-citation>Huang, W. K., Stein, M. L., McInerney, D. J., Sun, S., and Moyer, E. J.: Estimating changes in temperature extremes from millennial-scale climate simulations using generalized extreme value (GEV) distributions, Adv. Stat. Climatol. Meteorol. Oceanogr., 2, 79–103, <ext-link xlink:href="https://doi.org/10.5194/ascmo-2-79-2016" ext-link-type="DOI">10.5194/ascmo-2-79-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Huser and Wadsworth(2022)</label><mixed-citation>Huser, R. and Wadsworth, J. L.: Advances in statistical modeling of spatial extremes, WIREs Comput. Stat. 14, e1537, <ext-link xlink:href="https://doi.org/10.1002/wics.1537" ext-link-type="DOI">10.1002/wics.1537</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Huser et al.(2025)Huser, Opitz, and Wadsworth</label><mixed-citation>Huser, R., Opitz, T., and Wadsworth, J. L.: Modeling of spatial extremes in environmental data science: time to move away from max-stable processes, Environ. Data Sci., 4, e3, <ext-link xlink:href="https://doi.org/10.1017/eds.2024.54" ext-link-type="DOI">10.1017/eds.2024.54</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Jalbert et al.(2024)Jalbert, Farmer, Gobeil, and Roy</label><mixed-citation>Jalbert, J., Farmer, M., Gobeil, G., and Roy, P.: Extremes.jl: Extreme Value Analysis in Julia, J. Statist. Softw., 109, 1–35, <ext-link xlink:href="https://doi.org/10.18637/jss.v109.i06" ext-link-type="DOI">10.18637/jss.v109.i06</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>John et al.(2022)John, Douville, Ribes, and Yiou</label><mixed-citation>John, A., Douville, H., Ribes, A., and Yiou, P.: Quantifying CMIP6 model uncertainties in extreme precipitation projections, Weather Clim. Ext., 36, 100435, <ext-link xlink:href="https://doi.org/10.1016/j.wace.2022.100435" ext-link-type="DOI">10.1016/j.wace.2022.100435</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>justinfocus12(2025)</label><mixed-citation>justinfocus12: justinfocus12/COAST: Initial release for submission of BEST COAST paper to NPG, Zenodo [code and data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.17355215" ext-link-type="DOI">10.5281/zenodo.17355215</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Kabir et al.(2018)Kabir, Khosravi, Hosen, and Nahavandi</label><mixed-citation>Kabir, H. M. D., Khosravi, A., Hosen, M. A., and Nahavandi, S.: Neural Network-Based Uncertainty Quantification: A Survey of Methodologies and Applications, IEEE Access, 6, 36218–36234, <ext-link xlink:href="https://doi.org/10.1109/ACCESS.2018.2836917" ext-link-type="DOI">10.1109/ACCESS.2018.2836917</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Kahn and Harris(1951)</label><mixed-citation>Kahn, H. and Harris, T. E.: Estimation of particle transmission by random sampling, series 12, National Bureau of Standards applied mathematics 27–30, <uri>https://people.bordeaux.inria.fr/pierre.delmoral/kahn-harris.pdf</uri> (last access: 17 May 2026), 1951.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Lapeyre and Held(2004)</label><mixed-citation>Lapeyre, G. and Held, I. M.: The Role of Moisture in the Dynamics and Energetics of Turbulent Baroclinic Eddies, J. Atmos. Sci., 61, 1693–1710, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(2004)061&lt;1693:TROMIT&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(2004)061&lt;1693:TROMIT&gt;2.0.CO;2</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Leobacher and Pillichshammer(2014)</label><mixed-citation>Leobacher, G. and Pillichshammer, F.: Introduction to quasi-Monte Carlo integration and applications, Springer, ISBN 978-3-032-05446-3, <ext-link xlink:href="https://doi.org/10.1007/978-3-032-05446-3" ext-link-type="DOI">10.1007/978-3-032-05446-3</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Lestang et al.(2018)Lestang, Ragone, Bréhier, Herbert, and Bouchet</label><mixed-citation>Lestang, T., Ragone, F., Bréhier, C.-E., Herbert, C., and Bouchet, F.: Computing return times or return periods with rare event algorithms, J. Stat. Mech.: Theory Exp., 2018, 043213, <ext-link xlink:href="https://doi.org/10.1088/1742-5468/aab856" ext-link-type="DOI">10.1088/1742-5468/aab856</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Linz et al.(2020)Linz, Chen, Zhang, and Zhang</label><mixed-citation>Linz, M., Chen, G., Zhang, B., and Zhang, P.: A Framework for Understanding How Dynamics Shape Temperature Distributions, Geophys. Res. Lett., 47, e2019GL085684, <ext-link xlink:href="https://doi.org/10.1029/2019GL085684" ext-link-type="DOI">10.1029/2019GL085684</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Lorenz and Emanuel(1998)</label><mixed-citation>Lorenz, E. N. and Emanuel, K. A.: Optimal Sites for Supplementary Weather Observations: Simulation with a Small Model, J. Atmos. Sci., 55, 399–414, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1998)055&lt;0399:OSFSWO&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1998)055&lt;0399:OSFSWO&gt;2.0.CO;2</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Lucarini and Gritsun(2020)</label><mixed-citation>Lucarini, V. and Gritsun, A.: A new mathematical framework for atmospheric blocking events, Clim. Dynam., 54, 575–598, <ext-link xlink:href="https://doi.org/10.1007/s00382-019-05018-2" ext-link-type="DOI">10.1007/s00382-019-05018-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Lucarini et al.(2016)Lucarini, Faranda, de Freitas, Holland, Kuna, Nicol, Todd, Vaienti et al.</label><mixed-citation>Lucarini, V., Faranda, D., Moreira Freitas, A. C., Freitas, J. M., Kuna, T., Holland, M., Nicol, M., Todd, M., and Vaienti, S.: Extremes and recurrence in dynamical systems, John Wiley &amp; Sons, <ext-link xlink:href="https://doi.org/10.1002/9781118632321" ext-link-type="DOI">10.1002/9781118632321</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Lucente et al.(2022)Lucente, Rolland, Herbert, and Bouchet</label><mixed-citation>Lucente, D., Rolland, J., Herbert, C., and Bouchet, F.: Coupling rare event algorithms with data-based learned committor functions using the analogue Markov chain, J. Stat. Mech.: Theory Exp., 2022, 083201, <ext-link xlink:href="https://doi.org/10.1088/1742-5468/ac7aa7" ext-link-type="DOI">10.1088/1742-5468/ac7aa7</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Mahesh et al.(2024a)Mahesh, Collins, Bonev, Brenowitz, Cohen, Elms, Harrington, Kashinath, Kurth, North, OBrien, Pritchard, Pruitt, Risser, Subramanian, and Willard</label><mixed-citation>Mahesh, A., Collins, W., Bonev, B., Brenowitz, N., Cohen, Y., Elms, J., Harrington, P., Kashinath, K., Kurth, T., North, J., OBrien, T., Pritchard, M., Pruitt, D., Risser, M., Subramanian, S., and Willard, J.: Huge Ensembles Part I: Design of Ensemble Weather Forecasts using Spherical Fourier Neural Operators, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2408.03100" ext-link-type="DOI">10.48550/arXiv.2408.03100</ext-link>, 2024a.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Mahesh et al.(2024b)Mahesh, Collins, Bonev, Brenowitz, Cohen, Harrington, Kashinath, Kurth, North, OBrien, Pritchard, Pruitt, Risser, Subramanian, and Willard</label><mixed-citation>Mahesh, A., Collins, W., Bonev, B., Brenowitz, N., Cohen, Y., Harrington, P., Kashinath, K., Kurth, T., North, J., OBrien, T., Pritchard, M., Pruitt, D., Risser, M., Subramanian, S., and Willard, J.: Huge Ensembles Part II: Properties of a Huge Ensemble of Hindcasts Generated with Spherical Fourier Neural Operators, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2408.01581" ext-link-type="DOI">10.48550/arXiv.2408.01581</ext-link>, 2024b.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Maiocchi et al.(2024)Maiocchi, Lucarini, Gritsun, and Sato</label><mixed-citation>Maiocchi, C. C., Lucarini, V., Gritsun, A., and Sato, Y.: Heterogeneity of the attractor of the Lorenz'96 model: Lyapunov analysis, unstable periodic orbits, and shadowing properties, Physica D, 457, 133970, <ext-link xlink:href="https://doi.org/10.1016/j.physd.2023.133970" ext-link-type="DOI">10.1016/j.physd.2023.133970</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Mohamad and Sapsis(2018)</label><mixed-citation>Mohamad, M. A. and Sapsis, T. P.: Sequential sampling strategy for extreme event statistics in nonlinear dynamical systems, P. Natl. Acad. Sci. USA, 115, 11138–11143, <ext-link xlink:href="https://doi.org/10.1073/pnas.1813263115" ext-link-type="DOI">10.1073/pnas.1813263115</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Neelin et al.(2010)Neelin, Lintner, Tian, Li, Zhang, Patra, Chahine, and Stechmann</label><mixed-citation>Neelin, J. D., Lintner, B. R., Tian, B., Li, Q., Zhang, L., Patra, P. K., Chahine, M. T., and Stechmann, S. N.: Long tails in deep columns of natural and anthropogenic tropospheric tracers, Geophys. Res. Lett., 37, <ext-link xlink:href="https://doi.org/10.1029/2009GL041726" ext-link-type="DOI">10.1029/2009GL041726</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Norwood et al.(2013)Norwood, Kalnay, Ide, Yang, and Wolfe</label><mixed-citation>Norwood, A., Kalnay, E., Ide, K., Yang, S.-C., and Wolfe, C.: Lyapunov, singular and bred vectors in a multi-scale system: an empirical exploration of vectors related to instabilities, J. Phys. A, 46, 254021, <ext-link xlink:href="https://doi.org/10.1088/1751-8113/46/25/254021" ext-link-type="DOI">10.1088/1751-8113/46/25/254021</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Noyelle(2024)</label><mixed-citation>Noyelle, R.: Statistical and dynamical aspects of extreme heatwaves in the mid-latitudes, Theses, Université Paris-Saclay, <uri>https://hal.science/tel-04632646</uri> (last access: 17 May 2026), 2024.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>O'Gorman and Schneider(2009)</label><mixed-citation>O'Gorman, P. A. and Schneider, T.: Scaling of Precipitation Extremes over a Wide Range of Climates Simulated with an Idealized GCM, J. Climate, 22, 5676–5685, <ext-link xlink:href="https://doi.org/10.1175/2009JCLI2701.1" ext-link-type="DOI">10.1175/2009JCLI2701.1</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Panetta(1993)</label><mixed-citation>Panetta, R. L.: Zonal Jets in Wide Baroclinically Unstable Regions: Persistence and Scale Selection, J. Atmos. Sci., 50, 2073–2106, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1993)050&lt;2073:ZJIWBU&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1993)050&lt;2073:ZJIWBU&gt;2.0.CO;2</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Pavliotis(2014)</label><mixed-citation>Pavliotis, G. A.: Stochastic processes and applications: diffusion processes, the Fokker-Planck and Langevin equations, in: vol. 60, Springer, <ext-link xlink:href="https://doi.org/10.1007/978-1-4939-1323-7" ext-link-type="DOI">10.1007/978-1-4939-1323-7</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Penland and Magorian(1993)</label><mixed-citation>Penland, C. and Magorian, T.: Prediction of Niño 3 Sea Surface Temperatures Using Linear Inverse Modeling, J. Climate, 6, 1067–1076, <ext-link xlink:href="https://doi.org/10.1175/1520-0442(1993)006&lt;1067:PONSST&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0442(1993)006&lt;1067:PONSST&gt;2.0.CO;2</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Phillips(1956)</label><mixed-citation>Phillips, N. A.: The general circulation of the atmosphere: A numerical experiment, Q. J. Roy. Meteorol. Soc., 82, 123–164, <ext-link xlink:href="https://doi.org/10.1002/qj.49708235202" ext-link-type="DOI">10.1002/qj.49708235202</ext-link>, 1956.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Pickering et al.(2022)Pickering, Guth, Karniadakis, and Sapsis</label><mixed-citation>Pickering, E., Guth, S., Karniadakis, G. E., and Sapsis, T. P.: Discovering and forecasting extreme events via active learning in neural operators, Na. Comput. Sci., 2, 823–833, <ext-link xlink:href="https://doi.org/10.1038/s43588-022-00376-0" ext-link-type="DOI">10.1038/s43588-022-00376-0</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Pons et al.(2024)Pons, Yiou, Jzquel, and Messori</label><mixed-citation>Pons, F. M. E., Yiou, P., Jézéquel, A., and Messori, G.: Simulating the Western North America heatwave of 2021 with analogue importance sampling, Weather Clim. Ext., 43, 100651, <ext-link xlink:href="https://doi.org/10.1016/j.wace.2024.100651" ext-link-type="DOI">10.1016/j.wace.2024.100651</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Pumir et al.(1991)Pumir, Shraiman, and Siggia</label><mixed-citation>Pumir, A., Shraiman, B. I., and Siggia, E. D.: Exponential tails and random advection, Phys. Rev. Lett., 66, 2984–2987, <ext-link xlink:href="https://doi.org/10.1103/PhysRevLett.66.2984" ext-link-type="DOI">10.1103/PhysRevLett.66.2984</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Qi and Majda(2016)</label><mixed-citation> Qi, D. and Majda, A. J.: Predicting fat-tailed intermittent probability distributions in passive scalar turbulence with imperfect models through empirical information theory, Commun. Math. Sci., 14, 1687–1722, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Qi and Majda(2018)</label><mixed-citation> Qi, D. and Majda, A. J.: Predicting extreme events for passive scalar turbulence in two-layer baroclinic flows through reduced-order stochastic models, Commun. Math. Sci., 16, 17–51, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Rackauckas(2023)</label><mixed-citation>Rackauckas, C.: QuasiMonteCarlo.jl, GitHub [code], <uri>https://github.com/SciML/QuasiMonteCarlo.jl</uri> (last access: 9 May 2025), 2023.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Ragone and Bouchet(2021)</label><mixed-citation>Ragone, F. and Bouchet, F.: Rare Event Algorithm Study of Extreme Warm Summers and Heatwaves Over Europe, Geophys. Res. Lett., 48, e2020GL091197, <ext-link xlink:href="https://doi.org/10.1029/2020GL091197" ext-link-type="DOI">10.1029/2020GL091197</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Ragone et al.(2018)Ragone, Wouters, and Bouchet</label><mixed-citation>Ragone, F., Wouters, J., and Bouchet, F.: Computation of extreme heat waves in climate models using a large deviation algorithm, P. Natl. Acad. Sci. USA, 115, 24–29, <ext-link xlink:href="https://doi.org/10.1073/pnas.1712645115" ext-link-type="DOI">10.1073/pnas.1712645115</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Rampal et al.(2025)Rampal, Gibson, Sherwood, Abramowitz, and Hobeichi</label><mixed-citation>Rampal, N., Gibson, P. B., Sherwood, S., Abramowitz, G., and Hobeichi, S.: A Reliable Generative Adversarial Network Approach for Climate Downscaling and Weather Generation, J. Adv. Model. Earth Syst., 17, e2024MS004668, <ext-link xlink:href="https://doi.org/10.1029/2024MS004668" ext-link-type="DOI">10.1029/2024MS004668</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Rolland(2022)</label><mixed-citation>Rolland, J.: Collapse of transitional wall turbulence captured using a rare events algorithm, J. Fluid Mech., 931, A22, <ext-link xlink:href="https://doi.org/10.1017/jfm.2021.957" ext-link-type="DOI">10.1017/jfm.2021.957</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Saha and Ravela(2024)</label><mixed-citation>Saha, A. and Ravela, S.: Statistical-Physical Adversarial Learning From Data and Models for Downscaling Rainfall Extremes, J. Adv. Model. Earth Syst., 16, e2023MS003860, <ext-link xlink:href="https://doi.org/10.1029/2023MS003860" ext-link-type="DOI">10.1029/2023MS003860</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Sapsis(2020)</label><mixed-citation>Sapsis, T. P.: Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples, P. Roy. Soc. A:, 476, 20190834, <ext-link xlink:href="https://doi.org/10.1098/rspa.2019.0834" ext-link-type="DOI">10.1098/rspa.2019.0834</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Sundar et al.(2024)Sundar, Parashar, Blanchard, and Dodov</label><mixed-citation>Sundar, R., Parashar, N., Blanchard, A., and Dodov, B.: TAUDiff: Improving statistical downscaling for extreme weather events using generative diffusion models, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2412.13627" ext-link-type="DOI">10.48550/arXiv.2412.13627</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Tebaldi et al.(2020)Tebaldi, Armbruster, Engler, and Link</label><mixed-citation>Tebaldi, C., Armbruster, A., Engler, H. P., and Link, R.: Emulating climate extreme indices, Environ. Res. Lett., 15, 074006, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ab8332" ext-link-type="DOI">10.1088/1748-9326/ab8332</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Thompson(2010)</label><mixed-citation>Thompson, A. F.: Jet Formation and Evolution in Baroclinic Turbulence with Simple Topography, J. Phys. Oceanogr., 40, 257–278, <ext-link xlink:href="https://doi.org/10.1175/2009JPO4218.1" ext-link-type="DOI">10.1175/2009JPO4218.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Thompson et al.(2017)Thompson, Dunstone, Scaife, Smith, Slingo, Brown, and Belcher</label><mixed-citation>Thompson, V., Dunstone, N. J., Scaife, A. A., Smith, D. M., Slingo, J. M., Brown, S., and Belcher, S. E.: High risk of unprecedented UK rainfall in the current climate, Nat. Commun., 8, 107, <ext-link xlink:href="https://doi.org/10.1038/s41467-017-00275-3" ext-link-type="DOI">10.1038/s41467-017-00275-3</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Tong et al.(2021)Tong, Vanden-Eijnden, and Stadler</label><mixed-citation> Tong, S., Vanden-Eijnden, E., and Stadler, G.: Extreme event probability estimation using PDE-constrained optimization and large deviation theory, with application to tsunamis, Commun. Appl. Math. Comput. Sci., 16, 181–225, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Vandal et al.(2017)Vandal, Kodra, Ganguly, Michaelis, Nemani, and Ganguly</label><mixed-citation>Vandal, T., Kodra, E., Ganguly, S., Michaelis, A., Nemani, R., and Ganguly, A. R.: DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution, in: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '17, Association for Computing Machinery, New York, NY, USA, 1663–1672, ISBN 9781450348874, <ext-link xlink:href="https://doi.org/10.1145/3097983.3098004" ext-link-type="DOI">10.1145/3097983.3098004</ext-link>, 2017. </mixed-citation></ref>
      <ref id="bib1.bibx74"><label>van den Dool(1989)</label><mixed-citation>van den Dool, H. M.: A New Look at Weather Forecasting through Analogues, Mon. Weather Rev., 117, 2230–2247, <ext-link xlink:href="https://doi.org/10.1175/1520-0493(1989)117&lt;2230:ANLAWF&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0493(1989)117&lt;2230:ANLAWF&gt;2.0.CO;2</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>van Kekem and Sterk(2018)</label><mixed-citation>van Kekem, D. L. and Sterk, A. E.: Wave propagation in the Lorenz-96 model, Nonlin. Processes Geophys., 25, 301–314, <ext-link xlink:href="https://doi.org/10.5194/npg-25-301-2018" ext-link-type="DOI">10.5194/npg-25-301-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>Vonich and Hakim(2024)</label><mixed-citation>Vonich, P. T. and Hakim, G. J.: Predictability Limit of the 2021 Pacific Northwest Heatwave From Deep-Learning Sensitivity Analysis, Geophys. Res. Lett., 51, e2024GL110651, <ext-link xlink:href="https://doi.org/10.1029/2024GL110651" ext-link-type="DOI">10.1029/2024GL110651</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>Wang et al.(2020)Wang, Mu, and Sun</label><mixed-citation>Wang, Q., Mu, M., and Sun, G.: A useful approach to sensitivity and predictability studies in geophysical fluid dynamics: conditional non-linear optimal perturbation, Natl. Sci. Rev., 7, 214–223, <ext-link xlink:href="https://doi.org/10.1093/nsr/nwz039" ext-link-type="DOI">10.1093/nsr/nwz039</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Watt and Mansfield(2024)</label><mixed-citation>Watt, R. A. and Mansfield, L. A.: Generative Diffusion-based Downscaling for Climate, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2404.17752" ext-link-type="DOI">10.48550/arXiv.2404.17752</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx79"><label>Webber et al.(2019)Webber, Plotkin, ONeill, Abbot, and Weare</label><mixed-citation>Webber, R. J., Plotkin, D. A., O'Neill, M. E., Abbot, D. S., and Weare, J.: Practical rare event sampling for extreme mesoscale weather, Chaos, 29, 053109, <ext-link xlink:href="https://doi.org/10.1063/1.5081461" ext-link-type="DOI">10.1063/1.5081461</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx80"><label>Whittaker and Luca(2025)</label><mixed-citation>Whittaker, T. and Luca, A. D.: Constructing Extreme Heatwave Storylines with Differentiable Climate Models, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2506.10660" ext-link-type="DOI">10.48550/arXiv.2506.10660</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx81"><label>Yang et al.(2022)Yang, Blanchard, Sapsis, and Perdikaris</label><mixed-citation>Yang, Y., Blanchard, A., Sapsis, T., and Perdikaris, P.: Output-weighted sampling for multi-armed bandits with extreme payoffs, P. Roy. Soc. A, 478, 20210781, <ext-link xlink:href="https://doi.org/10.1098/rspa.2021.0781" ext-link-type="DOI">10.1098/rspa.2021.0781</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx82"><label>Yiou and Jézéquel(2020)</label><mixed-citation>Yiou, P. and Jézéquel, A.: Simulation of extreme heat waves with empirical importance sampling, Geosci. Model Dev., 13, 763–781, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-763-2020" ext-link-type="DOI">10.5194/gmd-13-763-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx83"><label>Zuckerman and Chong(2017)</label><mixed-citation>Zuckerman, D. M. and Chong, L. T.: Weighted Ensemble Simulation: Review of Methodology, Applications, and Software, Annu. Rev. Biophys., 46, 43–57, <ext-link xlink:href="https://doi.org/10.1146/annurev-biophys-070816-033834" ext-link-type="DOI">10.1146/annurev-biophys-070816-033834</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx84"><label>Zuev(2015)</label><mixed-citation>Zuev, K.: Subset Simulation Method for Rare Event Estimation: An Introduction, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1505.03506" ext-link-type="DOI">10.48550/arXiv.1505.03506</ext-link>, 2015.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Boosting ensembles for statistics of tails at  conditionally optimal advance split times</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Au and Beck(2001)</label><mixed-citation>
      
Au, S.-K. and Beck, J. L.: Estimation of small failure probabilities in high
dimensions by subset simulation, Probab. Eng. Mech., 16, 263–277, <a href="https://doi.org/10.1016/S0266-8920(01)00019-4" target="_blank">https://doi.org/10.1016/S0266-8920(01)00019-4</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Baars et al.(2021)Baars, Castellana, Wubs, and
Dijkstra</label><mixed-citation>
      
Baars, S., Castellana, D., Wubs, F., and Dijkstra, H.: Application of adaptive multilevel splitting to high-dimensional dynamical systems, J. Comput. Phys., 424, 109876, <a href="https://doi.org/10.1016/j.jcp.2020.109876" target="_blank">https://doi.org/10.1016/j.jcp.2020.109876</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Berner et al.(2015)Berner, Fossell, Ha, Hacker, and
Snyder</label><mixed-citation>
      
Berner, J., Fossell, K. R., Ha, S.-Y., Hacker, J. P., and Snyder, C.:
Increasing the Skill of Probabilistic Forecasts: Understanding Performance
Improvements from Model-Error Representations, Mon. Weather Rev., 143, 1295–1320, <a href="https://doi.org/10.1175/MWR-D-14-00091.1" target="_blank">https://doi.org/10.1175/MWR-D-14-00091.1</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Bloin-Wibe et al.(2025)Bloin-Wibe, Noyelle, Humphrey, Beyerle,
Knutti, and Fischer</label><mixed-citation>
      
Bloin-Wibe, L., Noyelle, R., Humphrey, V., Beyerle, U., Knutti, R., and Fischer, E.: Estimating return periods for extreme events in climate models through Ensemble Boosting, Weather Clim. Dynam., 6, 1147–1177, <a href="https://doi.org/10.5194/wcd-6-1147-2025" target="_blank">https://doi.org/10.5194/wcd-6-1147-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Blonigan et al.(2019)Blonigan, Farazmand, and
Sapsis</label><mixed-citation>
      
Blonigan, P. J., Farazmand, M., and Sapsis, T. P.: Are extreme dissipation
events predictable in turbulent fluid flows?, Phys. Rev. Fluids, 4, 044606,
<a href="https://doi.org/10.1103/PhysRevFluids.4.044606" target="_blank">https://doi.org/10.1103/PhysRevFluids.4.044606</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Boulaguiem et al.(2022)Boulaguiem, Zscheischler, Vignotto, van der
Wiel, and Engelke</label><mixed-citation>
      
Boulaguiem, Y., Zscheischler, J., Vignotto, E., van der Wiel, K., and Engelke, S.: Modeling and simulating spatial extremes by combining extreme value theory with generative adversarial networks, Environ. Data Sci., 1, e5, <a href="https://doi.org/10.1017/eds.2022.4" target="_blank">https://doi.org/10.1017/eds.2022.4</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Bourlioux and Majda(2002)</label><mixed-citation>
      
Bourlioux, A. and Majda, A. J.: Elementary models with probability distribution function intermittency for passive scalars with a mean gradient, Phys. Fluids, 14, 881–897, <a href="https://doi.org/10.1063/1.1430736" target="_blank">https://doi.org/10.1063/1.1430736</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Breitung(2021)</label><mixed-citation>
      
Breitung, K.: SORM, Design Points, Subset Simulation, and Markov Chain Monte
Carlo, ASCE-ASME J. Risk Uncertain. Eng. Syst. A, 7, 04021052, <a href="https://doi.org/10.1061/AJRUA6.0001166" target="_blank">https://doi.org/10.1061/AJRUA6.0001166</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Castaing et al.(1989)Castaing, Gunaratne, Heslot, Kadanoff,
Libchaber, Thomae, Wu, Zaleski, and Zanetti</label><mixed-citation>
      
Castaing, B., Gunaratne, G., Heslot, F., Kadanoff, L., Libchaber, A., Thomae,
S., Wu, X.-Z., Zaleski, S., and Zanetti, G.: Scaling of hard thermal
turbulence in Rayleigh-Bénard convection, J. Fluid Mech., 204, 1–30, <a href="https://doi.org/10.1017/S0022112089001643" target="_blank">https://doi.org/10.1017/S0022112089001643</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Coles(2001)</label><mixed-citation>
      
Coles, S.: An introduction to statistical modeling of extreme values, in: Springer Series in Statistics, 1st Edn., Springer, ISBN 978-1-85233-459-8,
<a href="https://doi.org/10.1007/978-1-4471-3675-0" target="_blank">https://doi.org/10.1007/978-1-4471-3675-0</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Cérou and Guyader(2007)</label><mixed-citation>
      
Cérou, F. and Guyader, A.: Adaptive Multilevel Splitting for Rare Event
Analysis, Stoch. Anal. Appl., 25, 417–443, <a href="https://doi.org/10.1080/07362990601139628" target="_blank">https://doi.org/10.1080/07362990601139628</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Dematteis et al.(2019)Dematteis, Grafke, and
Vanden-Eijnden</label><mixed-citation>
      
Dematteis, G., Grafke, T., and Vanden-Eijnden, E.: Extreme Event Quantification in Dynamical Systems with Random Components, SIAM/ASA J. Uncertain. Quant., 7, 1029–1059, <a href="https://doi.org/10.1137/18M1211003" target="_blank">https://doi.org/10.1137/18M1211003</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Diaconescu and Laprise(2012)</label><mixed-citation>
      
Diaconescu, E. P. and Laprise, R.: Singular vectors in atmospheric sciences: A review, Earth-Sci. Rev., 113, 161–175, <a href="https://doi.org/10.1016/j.earscirev.2012.05.005" target="_blank">https://doi.org/10.1016/j.earscirev.2012.05.005</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Farazmand and Sapsis(2017)</label><mixed-citation>
      
Farazmand, M. and Sapsis, T. P.: A variational approach to probing extreme
events in turbulent dynamical systems, Sci. Adv., 3, e1701533,
<a href="https://doi.org/10.1126/sciadv.1701533" target="_blank">https://doi.org/10.1126/sciadv.1701533</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Farrell and Ioannou(1996a)</label><mixed-citation>
      
Farrell, B. F. and Ioannou, P. J.: Generalized Stability Theory. Part I:
Autonomous Operators, J. Atmos. Sci., 53, 2025–2040,
<a href="https://doi.org/10.1175/1520-0469(1996)053&lt;2025:GSTPIA&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1996)053&lt;2025:GSTPIA&gt;2.0.CO;2</a>, 1996a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Farrell and Ioannou(1996b)</label><mixed-citation>
      
Farrell, B. F. and Ioannou, P. J.: Generalized Stability Theory. Part II:
Nonautonomous Operators, J. Atmos. Sci., 53, 2041–2053,
<a href="https://doi.org/10.1175/1520-0469(1996)053&lt;2041:GSTPIN&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1996)053&lt;2041:GSTPIN&gt;2.0.CO;2</a>, 1996b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Finkel and O'Gorman(2024)</label><mixed-citation>
      
Finkel, J. and O'Gorman, P. A.: Bringing Statistics to Storylines: Rare Event
Sampling for Sudden, Transient Extreme Events, J. Adv. Model. Earth Syst., 16, e2024MS004264, <a href="https://doi.org/10.1029/2024MS004264" target="_blank">https://doi.org/10.1029/2024MS004264</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Finkel and O'Gorman(2026)</label><mixed-citation>
      
Finkel, J. and O'Gorman, P. A.: Rare Event Sampling for Moving Targets:
Extremes of Temperature and Daily Precipitation in a General Circulation
Model, J. Adv. Model. Earth Syst., 18, e2025MS005456, <a href="https://doi.org/10.1029/2025MS005456" target="_blank">https://doi.org/10.1029/2025MS005456</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Finkel et al.(2023)Finkel, Gerber, Abbot, and
Weare</label><mixed-citation>
      
Finkel, J., Gerber, E. P., Abbot, D. S., and Weare, J.: Revealing the
Statistics of Extreme Events Hidden in Short Weather Forecast Data, AGU Adv., 4, e2023AV000881, <a href="https://doi.org/10.1029/2023AV000881" target="_blank">https://doi.org/10.1029/2023AV000881</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Fischer et al.(2023)Fischer, Beyerle, Bloin-Wibe, Gessner, Humphrey, Lehner, Pendergrass, Sippel, Zeder, and Knutti</label><mixed-citation>
      
Fischer, E. M., Beyerle, U., Bloin-Wibe, L., Gessner, C., Humphrey, V., Lehner, F., Pendergrass, A. G., Sippel, S., Zeder, J., and Knutti, R.: Storylines for unprecedented heatwaves based on ensemble boosting, Nat. Commun., 14, 4643, <a href="https://doi.org/10.1038/s41467-023-40112-4" target="_blank">https://doi.org/10.1038/s41467-023-40112-4</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Gálfi et al.(2017)Gálfi, Bódai, and
Lucarini</label><mixed-citation>
      
Gálfi, V. M., Bódai, T., and Lucarini, V.: Convergence of Extreme Value Statistics in a Two-Layer Quasi-Geostrophic Atmospheric Model, Complexity, 2017, 5340858, <a href="https://doi.org/10.1155/2017/5340858" target="_blank">https://doi.org/10.1155/2017/5340858</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Gessner(2022)</label><mixed-citation>
      
Gessner, C.: Physical storylines for very rare climate extremes, PhD thesis, ETH Zurich, <a href="https://www.research-collection.ethz.ch/entities/publication/2405b8fb-a51d-41df-b4e5-e7afa0a93719" target="_blank"/> (last access: 17 May 2026), 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Gessner et al.(2021)Gessner, Fischer, Beyerle, and
Knutti</label><mixed-citation>
      
Gessner, C., Fischer, E. M., Beyerle, U., and Knutti, R.: Very Rare Heat
Extremes: Quantifying and Understanding Using Ensemble Reinitialization, J. Climate, 34, 6619–6634, <a href="https://doi.org/10.1175/JCLI-D-20-0916.1" target="_blank">https://doi.org/10.1175/JCLI-D-20-0916.1</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Ghil et al.(2011)Ghil, Yiou, Hallegatte, Malamud, Naveau, Soloviev,
Friederichs, Keilis-Borok, Kondrashov, Kossobokov, Mestre, Nicolis, Rust,
Shebalin, Vrac, Witt, and Zaliapin</label><mixed-citation>
      
Ghil, M., Yiou, P., Hallegatte, S., Malamud, B. D., Naveau, P., Soloviev, A.,
Friederichs, P., Keilis-Borok, V., Kondrashov, D., Kossobokov, V., Mestre, O., Nicolis, C., Rust, H. W., Shebalin, P., Vrac, M., Witt, A., and Zaliapin,
I.: Extreme events: dynamics, statistics and prediction, Nonlin. Processes
Geophys., 18, 295–350, <a href="https://doi.org/10.5194/npg-18-295-2011" target="_blank">https://doi.org/10.5194/npg-18-295-2011</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Giorgini et al.(2024)Giorgini, Deck, Bischoff, and
Souza</label><mixed-citation>
      
Giorgini, L. T., Deck, K., Bischoff, T., and Souza, A.: Response Theory via
Generative Score Modeling, Phys. Rev. Lett., 133, 267302,
<a href="https://doi.org/10.1103/PhysRevLett.133.267302" target="_blank">https://doi.org/10.1103/PhysRevLett.133.267302</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Gollub et al.(1991)Gollub, Clarke, Gharib, Lane, and
Mesquita</label><mixed-citation>
      
Gollub, J. P., Clarke, J., Gharib, M., Lane, B., and Mesquita, O. N.:
Fluctuations and transport in a stirred fluid with a mean gradient, Phys.
Rev. Lett., 67, 3507–3510, <a href="https://doi.org/10.1103/PhysRevLett.67.3507" target="_blank">https://doi.org/10.1103/PhysRevLett.67.3507</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Haidvogel and Held(1980)</label><mixed-citation>
      
Haidvogel, D. B. and Held, I. M.: Homogeneous Quasi-Geostrophic Turbulence
Driven by a Uniform Temperature Gradient, J. Atmos. Sci., 37, 2644–2660, <a href="https://doi.org/10.1175/1520-0469(1980)037&lt;2644:HQGTDB&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1980)037&lt;2644:HQGTDB&gt;2.0.CO;2</a>, 1980.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Huang et al.(2016)Huang, Stein, McInerney, Sun, and
Moyer</label><mixed-citation>
      
Huang, W. K., Stein, M. L., McInerney, D. J., Sun, S., and Moyer, E. J.:
Estimating changes in temperature extremes from millennial-scale climate
simulations using generalized extreme value (GEV) distributions, Adv. Stat. Climatol. Meteorol. Oceanogr., 2, 79–103, <a href="https://doi.org/10.5194/ascmo-2-79-2016" target="_blank">https://doi.org/10.5194/ascmo-2-79-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Huser and Wadsworth(2022)</label><mixed-citation>
      
Huser, R. and Wadsworth, J. L.: Advances in statistical modeling of spatial
extremes, WIREs Comput. Stat. 14, e1537, <a href="https://doi.org/10.1002/wics.1537" target="_blank">https://doi.org/10.1002/wics.1537</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Huser et al.(2025)Huser, Opitz, and Wadsworth</label><mixed-citation>
      
Huser, R., Opitz, T., and Wadsworth, J. L.: Modeling of spatial extremes in
environmental data science: time to move away from max-stable processes,
Environ. Data Sci., 4, e3, <a href="https://doi.org/10.1017/eds.2024.54" target="_blank">https://doi.org/10.1017/eds.2024.54</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Jalbert et al.(2024)Jalbert, Farmer, Gobeil, and
Roy</label><mixed-citation>
      
Jalbert, J., Farmer, M., Gobeil, G., and Roy, P.: Extremes.jl: Extreme Value
Analysis in Julia, J. Statist. Softw., 109, 1–35, <a href="https://doi.org/10.18637/jss.v109.i06" target="_blank">https://doi.org/10.18637/jss.v109.i06</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>John et al.(2022)John, Douville, Ribes, and
Yiou</label><mixed-citation>
      
John, A., Douville, H., Ribes, A., and Yiou, P.: Quantifying CMIP6 model
uncertainties in extreme precipitation projections, Weather Clim. Ext., 36, 100435, <a href="https://doi.org/10.1016/j.wace.2022.100435" target="_blank">https://doi.org/10.1016/j.wace.2022.100435</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>justinfocus12(2025)</label><mixed-citation>
      
justinfocus12: justinfocus12/COAST: Initial release for submission of BEST
COAST paper to NPG, Zenodo [code and data set], <a href="https://doi.org/10.5281/zenodo.17355215" target="_blank">https://doi.org/10.5281/zenodo.17355215</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Kabir et al.(2018)Kabir, Khosravi, Hosen, and
Nahavandi</label><mixed-citation>
      
Kabir, H. M. D., Khosravi, A., Hosen, M. A., and Nahavandi, S.: Neural
Network-Based Uncertainty Quantification: A Survey of Methodologies and
Applications, IEEE Access, 6, 36218–36234, <a href="https://doi.org/10.1109/ACCESS.2018.2836917" target="_blank">https://doi.org/10.1109/ACCESS.2018.2836917</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Kahn and Harris(1951)</label><mixed-citation>
      
Kahn, H. and Harris, T. E.: Estimation of particle transmission by random
sampling, series 12, National Bureau of Standards applied mathematics
27–30, <a href="https://people.bordeaux.inria.fr/pierre.delmoral/kahn-harris.pdf" target="_blank"/> (last access: 17 May 2026), 1951.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Lapeyre and Held(2004)</label><mixed-citation>
      
Lapeyre, G. and Held, I. M.: The Role of Moisture in the Dynamics and
Energetics of Turbulent Baroclinic Eddies, J. Atmos. Sci., 61, 1693–1710,
<a href="https://doi.org/10.1175/1520-0469(2004)061&lt;1693:TROMIT&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(2004)061&lt;1693:TROMIT&gt;2.0.CO;2</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Leobacher and Pillichshammer(2014)</label><mixed-citation>
      
Leobacher, G. and Pillichshammer, F.: Introduction to quasi-Monte Carlo
integration and applications, Springer, ISBN 978-3-032-05446-3, <a href="https://doi.org/10.1007/978-3-032-05446-3" target="_blank">https://doi.org/10.1007/978-3-032-05446-3</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Lestang et al.(2018)Lestang, Ragone, Bréhier, Herbert, and
Bouchet</label><mixed-citation>
      
Lestang, T., Ragone, F., Bréhier, C.-E., Herbert, C., and Bouchet, F.:
Computing return times or return periods with rare event algorithms, J. Stat. Mech.: Theory Exp., 2018, 043213, <a href="https://doi.org/10.1088/1742-5468/aab856" target="_blank">https://doi.org/10.1088/1742-5468/aab856</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Linz et al.(2020)Linz, Chen, Zhang, and Zhang</label><mixed-citation>
      
Linz, M., Chen, G., Zhang, B., and Zhang, P.: A Framework for Understanding How Dynamics Shape Temperature Distributions, Geophys. Res. Lett., 47,
e2019GL085684, <a href="https://doi.org/10.1029/2019GL085684" target="_blank">https://doi.org/10.1029/2019GL085684</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Lorenz and Emanuel(1998)</label><mixed-citation>
      
Lorenz, E. N. and Emanuel, K. A.: Optimal Sites for Supplementary Weather
Observations: Simulation with a Small Model, J. Atmos. Sci., 55, 399–414,
<a href="https://doi.org/10.1175/1520-0469(1998)055&lt;0399:OSFSWO&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1998)055&lt;0399:OSFSWO&gt;2.0.CO;2</a>, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Lucarini and Gritsun(2020)</label><mixed-citation>
      
Lucarini, V. and Gritsun, A.: A new mathematical framework for atmospheric
blocking events, Clim. Dynam., 54, 575–598, <a href="https://doi.org/10.1007/s00382-019-05018-2" target="_blank">https://doi.org/10.1007/s00382-019-05018-2</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Lucarini et al.(2016)Lucarini, Faranda, de Freitas, Holland, Kuna,
Nicol, Todd, Vaienti et al.</label><mixed-citation>
      
Lucarini, V., Faranda, D., Moreira Freitas, A. C., Freitas, J. M., Kuna, T., Holland, M., Nicol, M., Todd, M., and Vaienti, S.: Extremes and recurrence in dynamical systems, John Wiley &amp; Sons, <a href="https://doi.org/10.1002/9781118632321" target="_blank">https://doi.org/10.1002/9781118632321</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Lucente et al.(2022)Lucente, Rolland, Herbert, and
Bouchet</label><mixed-citation>
      
Lucente, D., Rolland, J., Herbert, C., and Bouchet, F.: Coupling rare event
algorithms with data-based learned committor functions using the analogue
Markov chain, J. Stat. Mech.: Theory Exp., 2022, 083201, <a href="https://doi.org/10.1088/1742-5468/ac7aa7" target="_blank">https://doi.org/10.1088/1742-5468/ac7aa7</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Mahesh et al.(2024a)Mahesh, Collins, Bonev, Brenowitz,
Cohen, Elms, Harrington, Kashinath, Kurth, North, OBrien, Pritchard, Pruitt, Risser, Subramanian, and Willard</label><mixed-citation>
      
Mahesh, A., Collins, W., Bonev, B., Brenowitz, N., Cohen, Y., Elms, J.,
Harrington, P., Kashinath, K., Kurth, T., North, J., OBrien, T., Pritchard,
M., Pruitt, D., Risser, M., Subramanian, S., and Willard, J.: Huge Ensembles
Part I: Design of Ensemble Weather Forecasts using Spherical Fourier Neural
Operators, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2408.03100" target="_blank">https://doi.org/10.48550/arXiv.2408.03100</a>, 2024a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Mahesh et al.(2024b)Mahesh, Collins, Bonev, Brenowitz,
Cohen, Harrington, Kashinath, Kurth, North, OBrien, Pritchard, Pruitt,
Risser, Subramanian, and Willard</label><mixed-citation>
      
Mahesh, A., Collins, W., Bonev, B., Brenowitz, N., Cohen, Y., Harrington, P.,
Kashinath, K., Kurth, T., North, J., OBrien, T., Pritchard, M., Pruitt, D.,
Risser, M., Subramanian, S., and Willard, J.: Huge Ensembles Part II:
Properties of a Huge Ensemble of Hindcasts Generated with Spherical Fourier
Neural Operators, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2408.01581" target="_blank">https://doi.org/10.48550/arXiv.2408.01581</a>, 2024b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Maiocchi et al.(2024)Maiocchi, Lucarini, Gritsun, and
Sato</label><mixed-citation>
      
Maiocchi, C. C., Lucarini, V., Gritsun, A., and Sato, Y.: Heterogeneity of the attractor of the Lorenz'96 model: Lyapunov analysis, unstable periodic
orbits, and shadowing properties, Physica D, 457, 133970, <a href="https://doi.org/10.1016/j.physd.2023.133970" target="_blank">https://doi.org/10.1016/j.physd.2023.133970</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Mohamad and Sapsis(2018)</label><mixed-citation>
      
Mohamad, M. A. and Sapsis, T. P.: Sequential sampling strategy for extreme
event statistics in nonlinear dynamical systems, P. Natl. Acad. Sci. USA, 115, 11138–11143, <a href="https://doi.org/10.1073/pnas.1813263115" target="_blank">https://doi.org/10.1073/pnas.1813263115</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Neelin et al.(2010)Neelin, Lintner, Tian, Li, Zhang, Patra, Chahine, and Stechmann</label><mixed-citation>
      
Neelin, J. D., Lintner, B. R., Tian, B., Li, Q., Zhang, L., Patra, P. K.,
Chahine, M. T., and Stechmann, S. N.: Long tails in deep columns of natural
and anthropogenic tropospheric tracers, Geophys. Res. Lett., 37,
<a href="https://doi.org/10.1029/2009GL041726" target="_blank">https://doi.org/10.1029/2009GL041726</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Norwood et al.(2013)Norwood, Kalnay, Ide, Yang, and
Wolfe</label><mixed-citation>
      
Norwood, A., Kalnay, E., Ide, K., Yang, S.-C., and Wolfe, C.: Lyapunov,
singular and bred vectors in a multi-scale system: an empirical exploration
of vectors related to instabilities, J. Phys. A, 46, 254021, <a href="https://doi.org/10.1088/1751-8113/46/25/254021" target="_blank">https://doi.org/10.1088/1751-8113/46/25/254021</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Noyelle(2024)</label><mixed-citation>
      
Noyelle, R.: Statistical and dynamical aspects of extreme heatwaves in the
mid-latitudes, Theses, Université Paris-Saclay, <a href="https://hal.science/tel-04632646" target="_blank"/> (last access: 17 May 2026), 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>O'Gorman and Schneider(2009)</label><mixed-citation>
      
O'Gorman, P. A. and Schneider, T.: Scaling of Precipitation Extremes over a
Wide Range of Climates Simulated with an Idealized GCM, J. Climate, 22, 5676–5685, <a href="https://doi.org/10.1175/2009JCLI2701.1" target="_blank">https://doi.org/10.1175/2009JCLI2701.1</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Panetta(1993)</label><mixed-citation>
      
Panetta, R. L.: Zonal Jets in Wide Baroclinically Unstable Regions: Persistence and Scale Selection, J. Atmos. Sci., 50, 2073–2106,
<a href="https://doi.org/10.1175/1520-0469(1993)050&lt;2073:ZJIWBU&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1993)050&lt;2073:ZJIWBU&gt;2.0.CO;2</a>, 1993.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Pavliotis(2014)</label><mixed-citation>
      
Pavliotis, G. A.: Stochastic processes and applications: diffusion processes,
the Fokker-Planck and Langevin equations, in: vol. 60, Springer, <a href="https://doi.org/10.1007/978-1-4939-1323-7" target="_blank">https://doi.org/10.1007/978-1-4939-1323-7</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Penland and Magorian(1993)</label><mixed-citation>
      
Penland, C. and Magorian, T.: Prediction of Niño 3 Sea Surface Temperatures Using Linear Inverse Modeling, J. Climate, 6, 1067–1076,
<a href="https://doi.org/10.1175/1520-0442(1993)006&lt;1067:PONSST&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0442(1993)006&lt;1067:PONSST&gt;2.0.CO;2</a>, 1993.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Phillips(1956)</label><mixed-citation>
      
Phillips, N. A.: The general circulation of the atmosphere: A numerical
experiment, Q. J. Roy. Meteorol. Soc., 82, 123–164, <a href="https://doi.org/10.1002/qj.49708235202" target="_blank">https://doi.org/10.1002/qj.49708235202</a>, 1956.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Pickering et al.(2022)Pickering, Guth, Karniadakis, and
Sapsis</label><mixed-citation>
      
Pickering, E., Guth, S., Karniadakis, G. E., and Sapsis, T. P.: Discovering and forecasting extreme events via active learning in neural operators, Na.
Comput. Sci., 2, 823–833, <a href="https://doi.org/10.1038/s43588-022-00376-0" target="_blank">https://doi.org/10.1038/s43588-022-00376-0</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Pons et al.(2024)Pons, Yiou, Jzquel, and
Messori</label><mixed-citation>
      
Pons, F. M. E., Yiou, P., Jézéquel, A., and Messori, G.: Simulating the Western North America heatwave of 2021 with analogue importance sampling, Weather Clim. Ext., 43, 100651, <a href="https://doi.org/10.1016/j.wace.2024.100651" target="_blank">https://doi.org/10.1016/j.wace.2024.100651</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Pumir et al.(1991)Pumir, Shraiman, and Siggia</label><mixed-citation>
      
Pumir, A., Shraiman, B. I., and Siggia, E. D.: Exponential tails and random
advection, Phys. Rev. Lett., 66, 2984–2987, <a href="https://doi.org/10.1103/PhysRevLett.66.2984" target="_blank">https://doi.org/10.1103/PhysRevLett.66.2984</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Qi and Majda(2016)</label><mixed-citation>
      
Qi, D. and Majda, A. J.: Predicting fat-tailed intermittent probability
distributions in passive scalar turbulence with imperfect models through
empirical information theory, Commun. Math. Sci., 14, 1687–1722, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Qi and Majda(2018)</label><mixed-citation>
      
Qi, D. and Majda, A. J.: Predicting extreme events for passive scalar
turbulence in two-layer baroclinic flows through reduced-order stochastic
models, Commun. Math. Sci., 16, 17–51, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Rackauckas(2023)</label><mixed-citation>
      
Rackauckas, C.: QuasiMonteCarlo.jl, GitHub [code], <a href="https://github.com/SciML/QuasiMonteCarlo.jl" target="_blank"/> (last access: 9 May 2025), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Ragone and Bouchet(2021)</label><mixed-citation>
      
Ragone, F. and Bouchet, F.: Rare Event Algorithm Study of Extreme Warm Summers and Heatwaves Over Europe, Geophys. Res. Lett., 48, e2020GL091197,
<a href="https://doi.org/10.1029/2020GL091197" target="_blank">https://doi.org/10.1029/2020GL091197</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Ragone et al.(2018)Ragone, Wouters, and
Bouchet</label><mixed-citation>
      
Ragone, F., Wouters, J., and Bouchet, F.: Computation of extreme heat waves in climate models using a large deviation algorithm, P. Natl. Acad. Sci. USA, 115, 24–29, <a href="https://doi.org/10.1073/pnas.1712645115" target="_blank">https://doi.org/10.1073/pnas.1712645115</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Rampal et al.(2025)Rampal, Gibson, Sherwood, Abramowitz, and
Hobeichi</label><mixed-citation>
      
Rampal, N., Gibson, P. B., Sherwood, S., Abramowitz, G., and Hobeichi, S.: A
Reliable Generative Adversarial Network Approach for Climate Downscaling and
Weather Generation, J. Adv. Model. Earth Syst., 17, e2024MS004668, <a href="https://doi.org/10.1029/2024MS004668" target="_blank">https://doi.org/10.1029/2024MS004668</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Rolland(2022)</label><mixed-citation>
      
Rolland, J.: Collapse of transitional wall turbulence captured using a rare
events algorithm, J. Fluid Mech., 931, A22, <a href="https://doi.org/10.1017/jfm.2021.957" target="_blank">https://doi.org/10.1017/jfm.2021.957</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Saha and Ravela(2024)</label><mixed-citation>
      
Saha, A. and Ravela, S.: Statistical-Physical Adversarial Learning From Data
and Models for Downscaling Rainfall Extremes, J. Adv. Model. Earth Syst., 16, e2023MS003860, <a href="https://doi.org/10.1029/2023MS003860" target="_blank">https://doi.org/10.1029/2023MS003860</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Sapsis(2020)</label><mixed-citation>
      
Sapsis, T. P.: Output-weighted optimal sampling for Bayesian regression and
rare event statistics using few samples, P. Roy. Soc. A:, 476, 20190834,
<a href="https://doi.org/10.1098/rspa.2019.0834" target="_blank">https://doi.org/10.1098/rspa.2019.0834</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Sundar et al.(2024)Sundar, Parashar, Blanchard, and
Dodov</label><mixed-citation>
      
Sundar, R., Parashar, N., Blanchard, A., and Dodov, B.: TAUDiff: Improving
statistical downscaling for extreme weather events using generative diffusion
models, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2412.13627" target="_blank">https://doi.org/10.48550/arXiv.2412.13627</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Tebaldi et al.(2020)Tebaldi, Armbruster, Engler, and
Link</label><mixed-citation>
      
Tebaldi, C., Armbruster, A., Engler, H. P., and Link, R.: Emulating climate
extreme indices, Environ. Res. Lett., 15, 074006, <a href="https://doi.org/10.1088/1748-9326/ab8332" target="_blank">https://doi.org/10.1088/1748-9326/ab8332</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Thompson(2010)</label><mixed-citation>
      
Thompson, A. F.: Jet Formation and Evolution in Baroclinic Turbulence with
Simple Topography, J. Phys. Oceanogr., 40, 257–278, <a href="https://doi.org/10.1175/2009JPO4218.1" target="_blank">https://doi.org/10.1175/2009JPO4218.1</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Thompson et al.(2017)Thompson, Dunstone, Scaife, Smith, Slingo,
Brown, and Belcher</label><mixed-citation>
      
Thompson, V., Dunstone, N. J., Scaife, A. A., Smith, D. M., Slingo, J. M.,
Brown, S., and Belcher, S. E.: High risk of unprecedented UK rainfall in the
current climate, Nat. Commun., 8, 107, <a href="https://doi.org/10.1038/s41467-017-00275-3" target="_blank">https://doi.org/10.1038/s41467-017-00275-3</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Tong et al.(2021)Tong, Vanden-Eijnden, and Stadler</label><mixed-citation>
      
Tong, S., Vanden-Eijnden, E., and Stadler, G.: Extreme event probability
estimation using PDE-constrained optimization and large deviation theory,
with application to tsunamis, Commun. Appl. Math. Comput. Sci., 16, 181–225, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Vandal et al.(2017)Vandal, Kodra, Ganguly, Michaelis, Nemani, and
Ganguly</label><mixed-citation>
      
Vandal, T., Kodra, E., Ganguly, S., Michaelis, A., Nemani, R., and Ganguly,
A. R.: DeepSD: Generating High Resolution Climate Change Projections through
Single Image Super-Resolution, in: Proceedings of the 23rd ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining, KDD '17,
Association for Computing Machinery, New York, NY, USA, 1663–1672, ISBN 9781450348874, <a href="https://doi.org/10.1145/3097983.3098004" target="_blank">https://doi.org/10.1145/3097983.3098004</a>, 2017.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>van den Dool(1989)</label><mixed-citation>
      
van den Dool, H. M.: A New Look at Weather Forecasting through Analogues,
Mon. Weather Rev., 117, 2230–2247, <a href="https://doi.org/10.1175/1520-0493(1989)117&lt;2230:ANLAWF&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0493(1989)117&lt;2230:ANLAWF&gt;2.0.CO;2</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>van Kekem and Sterk(2018)</label><mixed-citation>
      
van Kekem, D. L. and Sterk, A. E.: Wave propagation in the Lorenz-96 model,
Nonlin. Processes Geophys., 25, 301–314, <a href="https://doi.org/10.5194/npg-25-301-2018" target="_blank">https://doi.org/10.5194/npg-25-301-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Vonich and Hakim(2024)</label><mixed-citation>
      
Vonich, P. T. and Hakim, G. J.: Predictability Limit of the 2021 Pacific
Northwest Heatwave From Deep-Learning Sensitivity Analysis, Geophys. Res. Lett., 51, e2024GL110651, <a href="https://doi.org/10.1029/2024GL110651" target="_blank">https://doi.org/10.1029/2024GL110651</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Wang et al.(2020)Wang, Mu, and Sun</label><mixed-citation>
      
Wang, Q., Mu, M., and Sun, G.: A useful approach to sensitivity and
predictability studies in geophysical fluid dynamics: conditional non-linear
optimal perturbation, Natl. Sci. Rev., 7, 214–223, <a href="https://doi.org/10.1093/nsr/nwz039" target="_blank">https://doi.org/10.1093/nsr/nwz039</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Watt and Mansfield(2024)</label><mixed-citation>
      
Watt, R. A. and Mansfield, L. A.: Generative Diffusion-based Downscaling for
Climate, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2404.17752" target="_blank">https://doi.org/10.48550/arXiv.2404.17752</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Webber et al.(2019)Webber, Plotkin, ONeill, Abbot, and
Weare</label><mixed-citation>
      
Webber, R. J., Plotkin, D. A., O'Neill, M. E., Abbot, D. S., and Weare, J.:
Practical rare event sampling for extreme mesoscale weather, Chaos, 29, 053109, <a href="https://doi.org/10.1063/1.5081461" target="_blank">https://doi.org/10.1063/1.5081461</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Whittaker and Luca(2025)</label><mixed-citation>
      
Whittaker, T. and Luca, A. D.: Constructing Extreme Heatwave Storylines with
Differentiable Climate Models, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2506.10660" target="_blank">https://doi.org/10.48550/arXiv.2506.10660</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Yang et al.(2022)Yang, Blanchard, Sapsis, and
Perdikaris</label><mixed-citation>
      
Yang, Y., Blanchard, A., Sapsis, T., and Perdikaris, P.: Output-weighted
sampling for multi-armed bandits with extreme payoffs, P. Roy. Soc. A, 478,
20210781, <a href="https://doi.org/10.1098/rspa.2021.0781" target="_blank">https://doi.org/10.1098/rspa.2021.0781</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>Yiou and Jézéquel(2020)</label><mixed-citation>
      
Yiou, P. and Jézéquel, A.: Simulation of extreme heat waves with empirical importance sampling, Geosci. Model Dev., 13, 763–781,
<a href="https://doi.org/10.5194/gmd-13-763-2020" target="_blank">https://doi.org/10.5194/gmd-13-763-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>Zuckerman and Chong(2017)</label><mixed-citation>
      
Zuckerman, D. M. and Chong, L. T.: Weighted Ensemble Simulation: Review of
Methodology, Applications, and Software, Annu. Rev. Biophys., 46, 43–57, <a href="https://doi.org/10.1146/annurev-biophys-070816-033834" target="_blank">https://doi.org/10.1146/annurev-biophys-070816-033834</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>Zuev(2015)</label><mixed-citation>
      
Zuev, K.: Subset Simulation Method for Rare Event Estimation: An Introduction, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1505.03506" target="_blank">https://doi.org/10.48550/arXiv.1505.03506</a>, 2015.

    </mixed-citation></ref-html>--></article>
