<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \bartext{Research article}?><?xmltex \hack{\allowdisplaybreaks}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NPG</journal-id><journal-title-group>
    <journal-title>Nonlinear Processes in Geophysics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NPG</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nonlin. Processes Geophys.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7946</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/npg-30-129-2023</article-id><title-group><article-title>Data-driven reconstruction of partially observed dynamical systems</article-title><alt-title>Data-driven reconstruction of partially observed dynamical systems</alt-title>
      </title-group><?xmltex \runningtitle{Data-driven reconstruction of partially observed dynamical systems}?><?xmltex \runningauthor{P. Tandeo et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2 aff3">
          <name><surname>Tandeo</surname><given-names>Pierre</given-names></name>
          <email>pierre.tandeo@imt-atlantique.fr</email>
        <ext-link>https://orcid.org/0000-0003-1647-8239</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Ailliot</surname><given-names>Pierre</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5 aff2">
          <name><surname>Sévellec</surname><given-names>Florian</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>IMT Atlantique, Lab-STICC, UMR CNRS 6285, 29238, Brest, France</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Odyssey, Inria/IMT/CNRS, Rennes, France</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>RIKEN Center for Computational Science, Kobe, 650-0047, Japan</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Laboratoire de Mathematiques de Bretagne Atlantique, Univ Brest, UMR CNRS 6205, Brest, France</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Laboratoire d’Océanographie Physique et Spatiale, Univ Brest CNRS IRD Ifremer, Brest, France</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Pierre Tandeo (pierre.tandeo@imt-atlantique.fr)</corresp></author-notes><pub-date><day>9</day><month>June</month><year>2023</year></pub-date>
      
      <volume>30</volume>
      <issue>2</issue>
      <fpage>129</fpage><lpage>137</lpage>
      <history>
        <date date-type="received"><day>22</day><month>November</month><year>2022</year></date>
           <date date-type="rev-request"><day>29</day><month>November</month><year>2022</year></date>
           <date date-type="rev-recd"><day>14</day><month>April</month><year>2023</year></date>
           <date date-type="accepted"><day>2</day><month>May</month><year>2023</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 Pierre Tandeo et al.</copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023.html">This article is available from https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023.html</self-uri><self-uri xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023.pdf">The full text article is available as a PDF file from https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e129">The state of the atmosphere, or of the ocean, cannot be exhaustively observed. Crucial parts might remain out of reach of proper monitoring. Also, defining the exact set of equations driving the atmosphere and ocean is virtually impossible because of their complexity. The goal of this paper is to obtain predictions of a partially observed dynamical system without knowing the model equations. In this data-driven context, the article focuses on the Lorenz-63 system, where only the second and third components are observed and access to the equations is not allowed. To account for those strong constraints, a combination of machine learning and data assimilation techniques is proposed. The key aspects are the following: the introduction of latent variables, a linear approximation of the dynamics and a database that is updated iteratively, maximizing the likelihood. We find that the latent variables inferred by the procedure are related to the successive derivatives of the observed components of the dynamical system. The method is also able to reconstruct accurately the local dynamics of the partially observed system. Overall, the proposed methodology is simple, is easy to code and gives promising results, even in the case of small numbers of observations.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e141">In geophysics, even if one has perfect knowledge of the studied dynamical system, it remains difficult to predict because of the existence of nonlinear processes <xref ref-type="bibr" rid="bib1.bibx13" id="paren.1"/>. Beyond this important difficulty, achieving this perfect knowledge of the system is often impossible. Consequently, the governing differential equations are often not known in full because of their complexity, in particular regarding scale interactions (e.g., turbulent closures are often assumed rather than “known” per se). On top of these two major difficulties, the state of the system is not and cannot be exhaustively observed. Potentially crucial components are and might remain partly or fully out of reach of proper monitoring (e.g., deep ocean or small-scale features). Predicting a partially observed and partially known system is therefore a key issue in current geophysics and in particular for ocean, climate and atmospheric sciences.</p>
      <p id="d1e147">A typical example of such a framework is the use of climate indices (e.g., global mean temperature, Niño 3.4 index, North Atlantic Oscillation index) and the study of their links and their dynamics. In this context, the direct relationship between those indices is unknown, even if their more indirect and complex relations exist, through full knowledge of the climate dynamics. Also, it is highly possible that climate indices are dependent on components of the climate that are not currently considered key indices and so are not fully monitored. However, these key indices could be sufficient to describe the most important aspect of climate, leading to accurate and reliable predictions and enabling cost-effective adaptation and mitigation.</p>
      <?pagebreak page130?><p id="d1e150">Hence, an alternative to physics-based models is to use available observations of the system and statistical approaches to discover equations and then make predictions. This has been introduced in several papers using combinations and polynomials of observed variables as well as sparse regressions or model selection strategies <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx20 bib1.bibx14" id="paren.2"/>. Those methods have then been extended to the case of noisy and irregular observation sampling, using a Bayesian framework as in data assimilation <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx15" id="paren.3"/>. Alternatively, some authors used data assimilation and local linear regressions based on analogs <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx12" id="paren.4"/> or iterative data assimilation coupled with neural networks <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx8 bib1.bibx3" id="paren.5"/> to make data-driven predictions without discovering equations.</p>
      <p id="d1e165">However, many approaches cited above assume that the full state of the system is observed, which is a strong assumption. Indeed, in a lot of applications in geophysics, important components of the system are never or only partially observed, such as the deep ocean (see, e.g., <xref ref-type="bibr" rid="bib1.bibx9" id="altparen.6"/>), and data-driven methods fail to make good predictions. To deal with those strong constraints, i.e., when the model is unknown and when some components of the system are never observed, combination of data assimilation and machine learning shows potential (see, e.g., <xref ref-type="bibr" rid="bib1.bibx28" id="altparen.7"/>). Additionally, an option is to use time-delay embedding of the available components of the system <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx5" id="paren.8"/>, whereas another option is to find latent representations of the dynamical system (see, e.g., <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx16" id="altparen.9"/>). In this study, we will show that there are strong relationships between those two approaches.</p>
      <p id="d1e181">Here, we propose a simple algorithm using linear and Gaussian assumptions based on a state-space formulation. This classic Bayesian framework, used in data assimilation, is able to deal with a dynamical model (physics- or data-driven) and observations (partial and noisy). Three main ideas are used: (i) augmented state formulation <xref ref-type="bibr" rid="bib1.bibx10" id="paren.10"/>, (ii) global linear approximation of the dynamical system <xref ref-type="bibr" rid="bib1.bibx11" id="paren.11"/> and (iii) estimation of the dynamical parameters using an iterative algorithm combined with Kalman recursions <xref ref-type="bibr" rid="bib1.bibx23" id="paren.12"/>. The current paper is thus an extension of <xref ref-type="bibr" rid="bib1.bibx23" id="text.13"/> to never-observed components of a dynamical system, using a state-augmentation strategy. The proposed framework is probabilistic, where the state of the system is approximated using a Gaussian distribution (with a mean vector and a covariance matrix). The algorithm is iterative, where a catalog is updated at each iteration and used to learn a linear dynamical model. The final estimate of this catalog corresponds to a new system of variables, including latent ones.</p>
      <p id="d1e196">The proposed methodology is based on an important assumption: the surrogate model is linear. Although it can be considered a disadvantage compared to nonlinear models, this linear assumption also has interesting properties. Indeed, nonlinear models combined with state augmentation are a very broad family of models and may lead to identifiability issues. Using linear dynamics already leads to a very flexible family of models since the latent variable may describe nonlinearities and include, for example, any transformation of the observed or non-observed components of a dynamical model. Furthermore, it allows rigorous estimation of the parameters using well-established statistical algorithms which can be run at a low computational cost. The proposed methodology is evaluated on a low-dimensional and weakly nonlinear chaotic model. As this paper is a proof of concept, a linear surrogate model is certainly well suited for this situation.</p>
      <p id="d1e199">The paper is organized as follows. Firstly, the methodology is explained in Sect. <xref ref-type="sec" rid="Ch1.S2"/>. Secondly, Sect. <xref ref-type="sec" rid="Ch1.S3"/> describes the experiment using the Lorenz-63 system. Thirdly, the results are reported in Sect. <xref ref-type="sec" rid="Ch1.S4"/>. The conclusions and perspectives are given in Sect. <xref ref-type="sec" rid="Ch1.S5"/>.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
      <p id="d1e218">The methodology proposed in this paper is borrowed from data assimilation, machine learning and dynamical systems. It is summarized in Fig. <xref ref-type="fig" rid="Ch1.F1"/> and explained below.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e225">Schematic of the proposed methodology, illustrated using the Lorenz-63 system. The algorithm is initialized with a Gaussian random noise for the hidden component (i.e., <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and with partial observations of the system (i.e., <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>). Then, an iterative procedure is applied with a linear regression, a covariance computation, the Kalman recursions and a random sampling. This algorithm iteratively maximizes the likelihood of the observations denoted <inline-formula><mml:math id="M4" display="inline"><mml:mi mathvariant="script">L</mml:mi></mml:math></inline-formula>. After convergence of the algorithm, a hidden component <inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is stabilized and represented by a Gaussian distribution represented by the mean <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and variance <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msubsup><mml:mi>P</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>.</p></caption>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f01.png"/>

      </fig>

      <p id="d1e312">In data assimilation, the goal is to estimate, from partial and noisy observations <inline-formula><mml:math id="M8" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>, the full state of a system <inline-formula><mml:math id="M9" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>. When the dynamical model used to propagate <inline-formula><mml:math id="M10" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> in time is available (i.e., when model equations are given), classic data assimilation techniques are used to retrieve unobserved components of the system. For instance, in the Lorenz-63 system <xref ref-type="bibr" rid="bib1.bibx13" id="paren.14"/>, if only two variables (<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in the example defined below) are observed, knowing the Lorenz equations (system of three ordinary differential equations), it is possible to retrieve the unobserved one (<inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in our example below). However, this estimation requires good estimates of model and observation error statistics (see, e.g., <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx19" id="altparen.15"/>).</p>
      <p id="d1e377">Now, if the model equations are not known and observations of the system are available over a sufficient period of time, it is possible to use data-driven methods to mathematically approximate the system dynamics. In this paper, a linear approximation is used to model the relationship of the state vector <inline-formula><mml:math id="M14" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> between two time steps. It is parameterized with the matrix <inline-formula><mml:math id="M15" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula>, whose dimension is equal to the square of the state space. Moreover, a linear observation operator is introduced to relate the partial observations <inline-formula><mml:math id="M16" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> and the state <inline-formula><mml:math id="M17" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>. It is written using a matrix <inline-formula><mml:math id="M18" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula>, with its dimension equal to the observation-space times the state-space dimensions. Nonlinear and adaptive operators and noisy observations could be taken into account but, for the sake of simplicity, only the linear and non-noisy case is considered in this paper.</p>
      <p id="d1e415">Mathematically, matrices (<inline-formula><mml:math id="M19" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M20" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula>) and vectors (<inline-formula><mml:math id="M21" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M22" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>) are linked using a Gaussian and linear state-space model such that<?xmltex \setcounter{equation}{0}?>

              <disp-formula id="Ch1.E1" specific-use="align" content-type="subnumberedsingle"><mml:math id="M23" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E1.2"><mml:mtd><mml:mtext>1a</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mi mathvariant="bold">M</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E1.3"><mml:mtd><mml:mtext>1b</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mi mathvariant="bold">H</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M24" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> is the time index and <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are unbiased Gaussian vectors, representing the model and observation errors, respectively. Their error covariance matrices are denoted <inline-formula><mml:math id="M27" display="inline"><mml:mi mathvariant="bold">Q</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula>, respectively. Those matrices indirectly control the respective weight given to the model and to the observations. It constitutes an important tuning part of the state-space models (see <xref ref-type="bibr" rid="bib1.bibx27" id="altparen.16"/>, for a more in-depth discussion).</p>
      <p id="d1e573">In such a data-driven problem where only a part of the system is observed, a first natural step is to consider that the state <inline-formula><mml:math id="M29" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is directly related to the observations <inline-formula><mml:math id="M30" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. For instance, in the example of the Lorenz-63 system introduced previously, observations correspond to the second and third components of the system (i.e., <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, formally defined later).</p>
      <p id="d1e612">In this paper, we propose introducing a hidden vector denoted <inline-formula><mml:math id="M33" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula>, corresponding to one or more hidden components that are not observed. For this purpose, the state is augmented using this hidden component <inline-formula><mml:math id="M34" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula>, the observation vector <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> does not change, and the operator <inline-formula><mml:math id="M36" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> is a truncated identity matrix. The use of augmented state space is classic in data assimilation and mostly refers to the estimation of unknown parameters of the dynamical model (see <xref ref-type="bibr" rid="bib1.bibx21" id="altparen.17"/>, for further details).</p>
      <p id="d1e646">The hidden vector <inline-formula><mml:math id="M37" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula> is now accounted in the linear model <inline-formula><mml:math id="M38" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> given in Eq. (<xref ref-type="disp-formula" rid="Ch1.E1.2"/>) whose dimension has increased. The hidden components are completely unknown and thus randomly initialized using Gaussian white noises and are parameterized by <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, their level of variance. The next step is to infer <inline-formula><mml:math id="M40" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula> using a statistical estimation method. Starting from the random initialization, an iterative procedure is proposed based on the maximization of the likelihood.</p>
      <p id="d1e683">The proposed approach is based on a linear and Gaussian state-space model given in Eq. (1) and thus uses the classic Kalman filter and smoother equations. The Kalman filter (forward in time) is used to get the information of the likelihood, whereas the Kalman smoother (forward and backward in time) is used to get the best estimate of the state. The proposed approach is inspired by the expectation-maximization algorithm (denoted EM; see <xref ref-type="bibr" rid="bib1.bibx23" id="altparen.18"/>) and is able to iteratively estimate the matrices <inline-formula><mml:math id="M41" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M42" display="inline"><mml:mi mathvariant="bold">Q</mml:mi></mml:math></inline-formula>. In this paper, <inline-formula><mml:math id="M43" display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula> is assumed to be known and negligible. The criterion used to update those matrices is based on the innovations defined by the difference between the observations <inline-formula><mml:math id="M44" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> and the forecast of the model <inline-formula><mml:math id="M45" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula>, denoted <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>. The likelihood of the innovations, denoted <inline-formula><mml:math id="M47" display="inline"><mml:mi mathvariant="script">L</mml:mi></mml:math></inline-formula>, is computed using <inline-formula><mml:math id="M48" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> time steps such that
          <disp-formula id="Ch1.E4" content-type="numbered"><label>2</label><mml:math id="M49" display="block"><mml:mrow><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="script">L</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>≜</mml:mo><mml:mi>p</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>T</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msubsup><mml:mi mathvariant="bold">x</mml:mi><mml:mi>T</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>∝</mml:mo><mml:munderover><mml:mo movablelimits="false">∏</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>T</mml:mi></mml:munderover><mml:mi>exp⁡</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mo>-</mml:mo><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold">Hx</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>⊤</mml:mo></mml:msup><mml:msubsup><mml:mi mathvariant="bold">Σ</mml:mi><mml:mi>t</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold">Hx</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">Σ</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold">HP</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mo>⊤</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">R</mml:mi></mml:mrow></mml:math></inline-formula>, with <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">P</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold">MP</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>a</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">M</mml:mi><mml:mo>⊤</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">Q</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">P</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>a</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> corresponding to the state covariance estimated by the Kalman filter at time <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>. The innovation likelihood given in Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>) is interesting because it corresponds to the squared distance between the observations and the forecast normalized by their uncertainties, represented by the covariance <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">Σ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <?pagebreak page132?><p id="d1e989">At each iteration of the augmented Kalman procedure, the estimate of the matrix <inline-formula><mml:math id="M55" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> is given by the least-square estimator, using a linear regression such that
          <disp-formula id="Ch1.E5" content-type="numbered"><label>3</label><mml:math id="M56" display="block"><mml:mrow><mml:msup><mml:mi mathvariant="bold">M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow><mml:mi>T</mml:mi></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mo>)</mml:mo><mml:mo>⊤</mml:mo></mml:msup></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> corresponds to the output catalog of the previous iteration (the result of a Kalman smoothing and a Gaussian sampling, explained in more detail below). Following Eq. (<xref ref-type="disp-formula" rid="Ch1.E1.2"/>), the covariance <inline-formula><mml:math id="M58" display="inline"><mml:mi mathvariant="bold">Q</mml:mi></mml:math></inline-formula> is estimated empirically using the estimate of <inline-formula><mml:math id="M59" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> given in Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>), such that
          <disp-formula id="Ch1.E6" content-type="numbered"><label>4</label><mml:math id="M60" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9.5}{9.5}\selectfont$\displaystyle}?><mml:msup><mml:mi mathvariant="bold">Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow><mml:mi>T</mml:mi></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>⊤</mml:mo></mml:msup></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e1329">Then, a Kalman smoother is applied using the <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">M</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">Q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> matrices estimated in Eqs. (<xref ref-type="disp-formula" rid="Ch1.E5"/>) and (<xref ref-type="disp-formula" rid="Ch1.E6"/>). At each time <inline-formula><mml:math id="M63" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, it results in a Gaussian mean vector <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and a covariance matrix <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">P</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>. As input of the next iteration of the algorithm, the catalog <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">x</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is updated using a Gaussian random sampling using <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">x</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">P</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> at each time <inline-formula><mml:math id="M69" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>. This random sampling is used to exploit the linear correlations between the components of the state vector that appear in the nondiagonal terms of <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">P</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>. The random sampling is also used to avoid being trapped in a local maximum, as in stochastic EM procedures <xref ref-type="bibr" rid="bib1.bibx6" id="paren.19"/>.</p>
      <p id="d1e1466">The likelihood calculated at each iteration of the procedure increases until convergence. The algorithm is stopped when the likelihood difference between two iterations becomes small. The solutions of the proposed method are the last Gaussian mean vectors <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and covariance matrices <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">P</mml:mi><mml:mi>t</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> calculated at each time <inline-formula><mml:math id="M73" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>. The component corresponding to the latent component <inline-formula><mml:math id="M74" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula> is finally retrieved with information on its uncertainty.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Experiment and evaluation metrics</title>
      <p id="d1e1517">The methodology is tested on the Lorenz-63 system <xref ref-type="bibr" rid="bib1.bibx13" id="paren.20"/>. This three-dimensional dynamical system models the evolution of the convection (<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) as a function of horizontal (<inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and vertical temperature gradients (<inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>). The evolution of the system is governed by three ordinary differential equations, i.e.,<?xmltex \setcounter{equation}{4}?>

              <disp-formula id="Ch1.E7" specific-use="align" content-type="subnumberedsingle"><mml:math id="M78" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E7.8"><mml:mtd><mml:mtext>5a</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7.9"><mml:mtd><mml:mtext>5b</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mn mathvariant="normal">28</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7.10"><mml:mtd><mml:mtext>5c</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>=</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">8</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:mfrac></mml:mstyle><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d1e1699">Runge–Kutta 4-5 is used to integrate the Lorenz-63 equations to generate <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. In this paper, it is assumed that <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is never observed: only <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are observed on 10 model time units of the Lorenz-63 system every <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula> time steps (Fig. <xref ref-type="fig" rid="Ch1.F2"/>a). The observation vector is thus <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. In what follows, only those data are available, not the set of Eq. (5).</p>
      <p id="d1e1811">The methodology is applied to the Lorenz-63 system, adding sequentially a new hidden component in the state of the system as follows. At the beginning, the state is augmented such that <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is randomly initialized with a white noise, with variance <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>. The observations are stored in the vector <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. The observation operator is thus the <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> matrix <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:mi mathvariant="bold">H</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>|</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. After <inline-formula><mml:math id="M93" display="inline"><mml:mn mathvariant="normal">30</mml:mn></mml:math></inline-formula> iterations of the algorithm presented in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, the hidden component <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> has converged. After that, a new white noise <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is used to augment the state such that <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, the vector <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> remains the same, and the iterative algorithm is applied until stabilization of <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. As long as the stabilized likelihood continues to increase with the addition of a hidden component, this state-augmentation procedure is repeated.</p>
      <p id="d1e2058">Note that several hidden components can be added all at once, with a similar performance to the sequential procedure described above (results not shown). In this all-at-once case, the interpretation of the retrieved components is not as informative, and thus we decided to retain the sequential case. Note also that the methodology has been tested with larger <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:math></inline-formula> (i.e., <inline-formula><mml:math id="M100" display="inline"><mml:mn mathvariant="normal">0.01</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M101" display="inline"><mml:mn mathvariant="normal">0.1</mml:mn></mml:math></inline-formula>). The conclusion is that, by increasing the time delay between observations, it significantly increases the number of latent variables (results not shown). Finally, the assimilation window length corresponds to <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> time steps. By reducing this length (e.g., to <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), the conclusions remain the same as for <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
      <p id="d1e2153">Using the experiment presented in Sect. <xref ref-type="sec" rid="Ch1.S3"/>, three hidden components <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> were sequentially added. They are reported in Fig. <xref ref-type="fig" rid="Ch1.F2"/> with the true Lorenz components <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Although they do not fit the hidden variable <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> of the Lorenz system, the first two hidden components <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> show time variations. By contrast, <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> remains close to 0, with a large confidence interval. This suggests that our method has identified that two hidden variables are enough to retrieve the dynamics of the two observed variables. This result is consistent with the effective dimension of the Lorenz-63 system, which is between two and three. Here, as the estimated dynamical model <inline-formula><mml:math id="M117" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> is a linear approximation, the dimension of the augmented state and the observed components is higher than the effective one.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e2281">True components of the Lorenz-63 model <bold>(a)</bold> and hidden components estimated using the iterative and augmented Kalman procedure <bold>(b)</bold>. The shaded colors correspond to the 95 % Gaussian confidence intervals.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f02.png"/>

      </fig>

      <p id="d1e2296">This is confirmed by the evaluation of the likelihood of the observations <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> with different linear models, obtained with or without the use of hidden components <inline-formula><mml:math id="M120" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="Ch1.F3"/>). This likelihood is useful for diagnosing the optimal number of dimensions needed to emulate the dynamics of the observed components. As the proposed method is stochastic, 50 independent realizations of the likelihood are shown for each experiment. The 50 realizations vary from the random values<?pagebreak page133?> given to the added hidden variable at the beginning of the iterative procedure. In the naive case where the state of the system is <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (black dashed line), the likelihood is small. Then, adding successively <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (green lines) and <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (red lines), after 30 iterations of the proposed algorithm, the likelihood significantly increases. Finally, due to a significant increase in the forecast covariance <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold">P</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>), the inclusion of <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> reduces the likelihood (purple lines). This suggests that a third variable is not needed and is even detrimental to the skill of the reconstruction. Those results indicate that the best linear model for predicting the variations of the observations <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is the one using two hidden components. Thus, for the rest of the paper, the focus is placed on the model with the following augmented state: <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e2463">The question is now the following: what is the significance of those hidden components <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> estimated using the proposed methodology? Are they correlated with the unobserved component <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> or with the observed ones <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>? Are they somehow proxies of the unobserved component? Using symbolic regression (i.e., using basic mathematical transformations of <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> as regressors to explain <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>), it has been found that the hidden components <inline-formula><mml:math id="M138" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula> correspond to linear combinations of the derivatives of the observations such that<?xmltex \setcounter{equation}{5}?>

              <disp-formula id="Ch1.E11" specific-use="align" content-type="subnumberedsingle"><mml:math id="M139" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E11.12"><mml:mtd><mml:mtext>6a</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>=</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E11.13"><mml:mtd><mml:mtext>6b</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          When developing Eq. (<xref ref-type="disp-formula" rid="Ch1.E11.13"/>) using Eq. (<xref ref-type="disp-formula" rid="Ch1.E11.12"/>), the second hidden component is written as <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">¨</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow><mml:mo mathvariant="normal">¨</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula>. It shows that <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> uses the first derivative of <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, whereas <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> uses the second derivatives. This result makes the link with the Taylor and Takens theorem, which shows that an unobserved component (i.e., <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) can be replaced by the observed components (i.e., <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) at different time lags. Note that, due to the stochastic behavior of the algorithm, the <inline-formula><mml:math id="M148" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M149" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> coefficients are not fixed, and several combinations of them can reach the same performance in terms of likelihood. This is illustrated in Fig. <xref ref-type="fig" rid="Ch1.F3"/>a, with 50 independent realizations of the proposed algorithm. When considering only <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (green lines), the algorithm converges to various solutions but is mainly restricted around two solutions (corresponding to a minimum and a maximum of likelihood). As shown in Fig. <xref ref-type="fig" rid="Ch1.F3"/>b, the minimum likelihood corresponds to <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and the maximum likelihood corresponds to <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. Thus, the likelihood of <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is higher than <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. This suggests that <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is more important than <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">˙</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in explaining the variations of the Lorenz system (this is consistent with the investigation of <xref ref-type="bibr" rid="bib1.bibx22" id="altparen.21"/>, in a modified version of the Lorenz-63 model). Interestingly, the scatter plot between <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> shows a circular relationship. This is also the case for  <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (results not shown). Then, in Fig. <xref ref-type="fig" rid="Ch1.F3"/>a, when considering <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> (red lines), the 50 independent realizations reach the same likelihood after 30 iterations. This means that if <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> when considering only <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, then <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> when introducing <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. In terms of forecast performance, this is similar to <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, because the likelihoods converge to the same value (red lines after 30 iterations).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e3162">Likelihoods as a function of the iteration of the augmented Kalman procedure <bold>(a)</bold> and estimation of the <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> parameters <bold>(b)</bold>. Different dynamical models are considered, from none to three hidden components in <inline-formula><mml:math id="M171" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula>, whereas only <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are observed in the Lorenz-63 model. The likelihoods of 50 independent realizations of the iterative and augmented Kalman procedure are shown.</p></caption>
        <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f03.png"/>

      </fig>

      <?pagebreak page134?><p id="d1e3229">To compare the performance of the naive linear model <inline-formula><mml:math id="M174" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> with <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> and the ones with <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, their forecasts are evaluated. After applying the proposed algorithm, the <inline-formula><mml:math id="M178" display="inline"><mml:mover accent="true"><mml:mi mathvariant="bold">M</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math id="M179" display="inline"><mml:mover accent="true"><mml:mi mathvariant="bold">Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> estimated matrices are used to derive probabilistic forecast, starting from the last available observation <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, using<?xmltex \setcounter{equation}{6}?>

              <disp-formula id="Ch1.E14" specific-use="align" content-type="subnumberedsingle"><mml:math id="M181" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E14.15"><mml:mtd><mml:mtext>7a</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>E</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold">M</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>E</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>]</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E14.16"><mml:mtd><mml:mtext>7b</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">Cov</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold">M</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi mathvariant="normal">Cov</mml:mi><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>]</mml:mo><mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold">M</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mi>T</mml:mi></mml:msup><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold">Q</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          with <inline-formula><mml:math id="M182" display="inline"><mml:mi>E</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M183" display="inline"><mml:mi mathvariant="normal">Cov</mml:mi></mml:math></inline-formula> the expectation and the covariance, respectively. To test the predictability of the different linear models (i.e., with or without hidden components <inline-formula><mml:math id="M184" display="inline"><mml:mi mathvariant="bold-italic">z</mml:mi></mml:math></inline-formula>), a test set has been created, starting from the end of the sequence of observations <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>T</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> used in the assimilation window. This test set also corresponds to <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">4</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> time steps with <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:mi mathvariant="normal">d</mml:mi><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula>. It is used to compute two metrics, the root mean square error (RMSE) and the coverage probability at 50 %. The RMSE is used to evaluate the precision of the forecasts, comparing the true <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> components to the estimated ones, whereas the coverage probability is used to evaluate the reliability of the prediction, evaluating the proportion of true trajectories falling within the 50 % prediction interval of <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Examples of predictions are given in Fig. <xref ref-type="fig" rid="Ch1.F4"/>. It shows bad linear predictions of the model with only <inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (dashed black lines). As the <inline-formula><mml:math id="M193" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> operator is not time-dependent, the predictions are quite similar, close to the persistence. Then, adding one (green) or two (red) hidden components in the <inline-formula><mml:math id="M194" display="inline"><mml:mi mathvariant="bold">M</mml:mi></mml:math></inline-formula> operators creates some nonlinearities in the forecasts.</p>
      <p id="d1e3703">In Fig. <xref ref-type="fig" rid="Ch1.F5"/>, the predictions are evaluated over the whole test dataset for different lead times. By introducing hidden components, the RMSE decreases for both <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> components (panels a and b). For instance, for a lead time of <inline-formula><mml:math id="M197" display="inline"><mml:mn mathvariant="normal">0.05</mml:mn></mml:math></inline-formula>, when considering two hidden components, the RMSE is halved when it is compared to the naive linear model without hidden components. The coverage probability metric is also largely improved (panels c and d). Indeed, the results with two hidden components are close to 50 %, the optimal value.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e3739">Example of three statistical forecasts of <inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(a)</bold> and <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(b)</bold> with their <inline-formula><mml:math id="M200" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> % prediction interval using three different linear operators with no hidden component (dashed black), one hidden component (green) and two hidden components (red). These predictions are obtained using sequential statistical forecasts, as explained in Eq. (7), on an independent test dataset.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f04.png"/>

      </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e3786">Root mean square error <bold>(a, b)</bold> and 50 % coverage probability <bold>(c, d)</bold> as a function of the lead time (<inline-formula><mml:math id="M201" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis) for the reconstruction of the components <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(a, c)</bold> and <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(b, d)</bold>. These metrics are evaluated on an independent test dataset.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f05.png"/>

      </fig>

      <p id="d1e3837">To evaluate where the linear model with <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> performs better than the one with <inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, the Euclidean distances between the forecasts (for a lead time of <inline-formula><mml:math id="M206" display="inline"><mml:mn mathvariant="normal">0.1</mml:mn></mml:math></inline-formula>) and the truth are computed. Those errors are evaluated at each time step of the test dataset, in the <inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> space. Based on those errors, Fig. <xref ref-type="fig" rid="Ch1.F6"/> shows the relative improvement between the model without and the model with hidden components. When the two models have similar performance, values are close to <inline-formula><mml:math id="M208" display="inline"><mml:mn mathvariant="normal">0</mml:mn></mml:math></inline-formula> (white), and when the model including <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is better, values are close to <inline-formula><mml:math id="M211" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> (red). Figure <xref ref-type="fig" rid="Ch1.F6"/> clearly shows that error reduction is not homogeneous in the attractor. The improvement is moderate on the outside of the wings of the attractor but important in the wing transition. This suggests that the introduction of the hidden components <inline-formula><mml:math id="M212" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> makes it possible to provide information on the position in the attractor and thus to make better predictions, especially in bifurcation regions.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e3993">Relative forecast improvement measured as 1 minus the ratio between two Euclidean distances: the one calculated with model <inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (at the numerator) and the one calculated with model <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (at the denominator). The Euclidean distances are calculated in the <inline-formula><mml:math id="M216" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> space and correspond to the error between the forecasts (for a lead time of <inline-formula><mml:math id="M217" display="inline"><mml:mn mathvariant="normal">0.1</mml:mn></mml:math></inline-formula>) and the truth, evaluated on an independent test dataset.</p></caption>
        <?xmltex \igopts{width=284.527559pt}?><graphic xlink:href="https://npg.copernicus.org/articles/30/129/2023/npg-30-129-2023-f06.png"/>

      </fig>

</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d1e4097">In this article, the goal is to retrieve hidden components of a dynamical system that is partially observed. The proposed methodology is purely data-driven, not physics-driven (i.e., without the use of any equations of the dynamical model). It is based on the combination of data assimilation and machine learning techniques. Three main ideas are used in the methodology: an augmented state strategy, a linear approximation of a dynamical system and an iterative procedure. The methodology is easy to implement using simple strategies and well-established algorithms: Kalman filter and smoother, linear regression using least squares, an iterative procedure inspired by the EM recursions and Gaussian random sampling for the stochastic aspect.</p>
      <p id="d1e4100">The methodology is tested on the Lorenz-63 system, where only two components of the system are observed in a short period of time. Several hidden components are introduced sequentially in the system. Although the hidden components are initialized randomly, only a few iterations of the proposed algorithm are necessary to retrieve relevant information. The recovered components are expressed with Gaussian distributions. The new components correspond to linear combinations of successive derivatives of the observed variables. This result is consistent with the theorems of Taylor and Takens, which show that time-delay embedding is useful for improving the forecasts of the system. In our case, this is evaluated using the likelihood, a metric that evaluates the innovation (i.e., the difference between Gaussian forecasts and Gaussian observations).</p>
      <p id="d1e4103">Using our methodology, we do not retrieve the true missing Lorenz component and need two hidden variables to<?pagebreak page135?> represent a single missing one. The reason for this mismatch is two-fold and is mainly the linear approximation of the dynamical system, which implies that (1) the true missing component, which does not have to be linear combinations of the observed variables, is impossible to retrieve in our framework and (2) two variables, using combinations of the time derivatives of the observed variables, are needed to accurately represent the complexity of the dynamics. However, it is important to note that, even if two variables are needed to replace a single one, the dynamical evolution of the system is relatively well captured, for short lead times, with our methodology. This correct representation of the evolution might ultimately be the most important (e.g., for accurate and reliable forecasting).</p>
      <?pagebreak page136?><p id="d1e4106">The proposed methodology uses a strong assumption: the linear approximation of the dynamical system is global (i.e., fixed for the whole observation period). A perspective is to use adaptive approximations of the model using local linear regressions. This strategy is computationally more expensive because a linear regression is adjusted at each time step but shows some improvements in chaotic systems (see <xref ref-type="bibr" rid="bib1.bibx17 bib1.bibx18" id="altparen.22"/>). In this context of an adaptive linear dynamical model, the proposed methodology could be easily plugged into an ensemble Kalman procedure based on analog forecasts <xref ref-type="bibr" rid="bib1.bibx12" id="paren.23"/>. In future works, we plan to compare the global and local linear approaches (i.e., a fixed or adaptive linear surrogate model). We also plan to compare them to nonlinear surrogate models, based on neural network architectures with latent information encoded in an augmented space or in hidden layers (e.g., long short-term memory – LSTM).</p>
      <p id="d1e4116">In this paper, we have demonstrated the feasibility of the method on an idealized and comprehensive problem using the Lorenz-63 system. In the future, we plan to apply the methodology to more challenging problems, like the Lorenz-96 system or a quasi-geostrophic model. For application to real data, we plan to use a database of observed climate indices and try to find latent variables that help to make data-driven predictions.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e4123">The Python code is available at <uri>https://github.com/ptandeo/Kalman</uri> under the GNU license and the data are
generated using the Lorenz-63 system <xref ref-type="bibr" rid="bib1.bibx13" id="paren.24"/>.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e4132">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/npg-30-129-2023-supplement" xlink:title="zip">https://doi.org/10.5194/npg-30-129-2023-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e4141">PT wrote the article. PT and PA developed the algorithm. FS and PA helped with the redaction of the paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e4147">At least one of the (co-)authors is a member of the editorial board of <italic>Nonlinear Processes in Geophysics</italic>. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e4156">Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e4162">This paper is the result of a project proposed in a course on “Data Assimilation” in the masters program “Ocean Data Science” at Univ Brest, ENSTA Bretagne, and IMT Atlantique, France. The authors would like to thank the students for their participation in the project: Nils Niebaum, Zackary Vanche, Benoit Presse, Dimitri Vlahopoulos, Yanis Grit and Joséphine Schmutz. The authors would like to thank Noémie Le Carrer for her proofreading of the paper and Paul Platzer, Said Ouala, Lucas Drumetz, Juan Ruiz, Manuel Pulido and Takemasa Miyoshi for their valuable comments.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e4167">This work was supported by ISblue project, Interdisciplinary graduate school for the blue planet (ANR-17-EURE-0015) and co-funded by a grant from the French government under the program “Investissements d'Avenir” embedded in France 2030. This work was also supported by LEFE program (LEFE IMAGO projects ARVOR).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e4174">This paper was edited by Natale Alberto Carrassi and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Bocquet et~al.(2019)Bocquet, Brajard, Carrassi, and
Bertino}}?><label>Bocquet et al.(2019)Bocquet, Brajard, Carrassi, and
Bertino</label><?label bocquet2019data?><mixed-citation>Bocquet, M., Brajard, J., Carrassi, A., and Bertino, L.: Data assimilation as a learning tool to infer ordinary differential equation representations of dynamical models, Nonlin. Processes Geophys., 26, 143–162, <ext-link xlink:href="https://doi.org/10.5194/npg-26-143-2019" ext-link-type="DOI">10.5194/npg-26-143-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{{Brajard et~al.(2020)Brajard, Carrassi, Bocquet, and
Bertino}}?><label>Brajard et al.(2020)Brajard, Carrassi, Bocquet, and
Bertino</label><?label brajard2020combining?><mixed-citation>Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to emulate a dynamical model from sparse
and noisy observations: A case study with the Lorenz 96 model, Journal of
Computational Science, 44, 101171, <ext-link xlink:href="https://doi.org/10.1016/j.jocs.2020.101171" ext-link-type="DOI">10.1016/j.jocs.2020.101171</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Brajard et~al.(2021)Brajard, Carrassi, Bocquet, and
Bertino}}?><label>Brajard et al.(2021)Brajard, Carrassi, Bocquet, and
Bertino</label><?label brajard2021combining?><mixed-citation>Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to infer unresolved scale parametrization,
Philos. T. Roy. Soc. A, 379, 2194, <ext-link xlink:href="https://doi.org/10.1098/rsta.2020.0086" ext-link-type="DOI">10.1098/rsta.2020.0086</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{Brunton et~al.(2016)Brunton, Proctor, and
Kutz}}?><label>Brunton et al.(2016)Brunton, Proctor, and
Kutz</label><?label brunton2016discovering?><mixed-citation>
Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Discovering governing
equations from data by sparse identification of nonlinear dynamical systems,
P. Natl. Acad. Sci. USA, 113, 3932–3937, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Brunton et~al.(2017)Brunton, Brunton, Proctor, Kaiser, and
Kutz}}?><label>Brunton et al.(2017)Brunton, Brunton, Proctor, Kaiser, and
Kutz</label><?label brunton2017chaos?><mixed-citation>Brunton, S. L., Brunton, B. W., Proctor, J. L., Kaiser, E., and Kutz, J. N.:
Chaos as an intermittently forced linear system, Nat. Commun., 8, 19,
<ext-link xlink:href="https://doi.org/10.1038/s41467-017-00030-8" ext-link-type="DOI">10.1038/s41467-017-00030-8</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Delyon et~al.(1999)Delyon, Lavielle, and
Moulines}}?><label>Delyon et al.(1999)Delyon, Lavielle, and
Moulines</label><?label delyon1999convergence?><mixed-citation>
Delyon, B., Lavielle, M., and Moulines, E.: Convergence of a stochastic
approximation version of the EM algorithm, Ann. Stat., 27, 94–128,
1999.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Dreano et~al.(2017)Dreano, Tandeo, Pulido, Ait-El-Fquih, Chonavel,
and Hoteit}}?><label>Dreano et al.(2017)Dreano, Tandeo, Pulido, Ait-El-Fquih, Chonavel,
and Hoteit</label><?label dreano2017estimating?><mixed-citation>
Dreano, D., Tandeo, P., Pulido, M., Ait-El-Fquih, B., Chonavel, T., and Hoteit,
I.: Estimating model-error covariances in nonlinear state-space models using
Kalman smoothing and the expectation–maximization algorithm, Q.
J. Roy. Meteor. Soc., 143, 1877–1885, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Fablet et~al.(2021)Fablet, Chapron, Drumetz, M{\'{e}}min, Pannekoucke,
and Rousseau}}?><label>Fablet et al.(2021)Fablet, Chapron, Drumetz, Mémin, Pannekoucke,
and Rousseau</label><?label fablet2021learning?><mixed-citation>Fablet, R., Chapron, B., Drumetz, L., Mémin, E., Pannekoucke, O., and
Rousseau, F.: Learning variational data assimilation models and solvers,
J. Adv. Model. Earth Sy., 13, e2021MS002572, <ext-link xlink:href="https://doi.org/10.1029/2021MS002572" ext-link-type="DOI">10.1029/2021MS002572</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Jayne et~al.(2017)Jayne, Roemmich, Zilberman, Riser, Johnson,
Johnson, and Piotrowicz}}?><label>Jayne et al.(2017)Jayne, Roemmich, Zilberman, Riser, Johnson,
Johnson, and Piotrowicz</label><?label jayne2017argo?><mixed-citation>
Jayne, S. R., Roemmich, D., Zilberman, N., Riser, S. C., Johnson, K. S.,
Johnson, G. C., and Piotrowicz, S. R.: The Argo program: present and future,
Oceanography, 30, 18–28, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Kitagawa(1998)}}?><label>Kitagawa(1998)</label><?label kitagawa1998self?><mixed-citation>
Kitagawa, G.: A self-organizing state-space model, J. Am.
Stat. Assoc., 93, 1203–1215, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx11"><?xmltex \def\ref@label{{Korda and Mezi{\'{c}}(2018)}}?><label>Korda and Mezić(2018)</label><?label korda2018linear?><mixed-citation>
Korda, M. and Mezić, I.: Linear predictors for nonlinear dynamical systems:
Koopman operator meets model predictive control, Automatica, 93, 149–160,
2018.</mixed-citation></ref>
      <ref id="bib1.bibx12"><?xmltex \def\ref@label{{Lguensat et~al.(2017)Lguensat, Tandeo, Ailliot, Pulido, and
Fablet}}?><label>Lguensat et al.(2017)Lguensat, Tandeo, Ailliot, Pulido, and
Fablet</label><?label lguensat2017analog?><mixed-citation>
Lguensat, R., Tandeo, P., Ailliot, P., Pulido, M., and Fablet, R.: The analog
data assimilation, Mon. Weather Rev., 145, 4093–4107, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Lorenz(1963)}}?><label>Lorenz(1963)</label><?label lorenz1963deterministic?><mixed-citation>Lorenz, E. N.: Deterministic nonperiodic flow, J. Atmos. Sci.,
20, 130–141, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1963)020&lt;0130:DNF&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1963)020&lt;0130:DNF&gt;2.0.CO;2</ext-link>, 1963 (data available at:  <uri>https://github.com/ptandeo/Kalman</uri>, last access: 26 May 2023).</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Mangiarotti and Huc(2019)}}?><label>Mangiarotti and Huc(2019)</label><?label mangiarotti2019can?><mixed-citation>Mangiarotti, S. and Huc, M.: Can the original equations of a dynamical system
be retrieved from observational time series?, Chaos: An Interdisciplinary
Journal of Nonlinear Science, 29, 023133, <ext-link xlink:href="https://doi.org/10.1063/1.5081448" ext-link-type="DOI">10.1063/1.5081448</ext-link>, 2019.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{North et~al.(2022)North, Wikle, and Schliep}}?><label>North et al.(2022)North, Wikle, and Schliep</label><?label north2022bayesian?><mixed-citation>North, J. S., Wikle, C. K., and Schliep, E. M.: A Bayesian Approach for
Data-Driven Dynamic Equation Discovery, J. Agr. Biol.
Envir. S., 27, 728–747, <ext-link xlink:href="https://doi.org/10.1007/s13253-022-00514-1" ext-link-type="DOI">10.1007/s13253-022-00514-1</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Ouala et~al.(2020)Ouala, Nguyen, Drumetz, Chapron, Pascual, Collard,
Gaultier, and Fablet}}?><label>Ouala et al.(2020)Ouala, Nguyen, Drumetz, Chapron, Pascual, Collard,
Gaultier, and Fablet</label><?label ouala2020learning?><mixed-citation>Ouala, S., Nguyen, D., Drumetz, L., Chapron, B., Pascual, A., Collard, F.,
Gaultier, L., and Fablet, R.: Learning latent dynamics for partially observed
chaotic systems, Chaos: An Interdisciplinary Journal of Nonlinear Science,
30, 103121, <ext-link xlink:href="https://doi.org/10.1063/5.0019309" ext-link-type="DOI">10.1063/5.0019309</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Platzer et~al.(2021{\natexlab{a}})Platzer, Yiou, Naveau, Filipot,
Thi\'{e}baut, and Tandeo}}?><label>Platzer et al.(2021a)Platzer, Yiou, Naveau, Filipot,
Thiébaut, and Tandeo</label><?label platzer2021probability?><mixed-citation>
Platzer, P., Yiou, P., Naveau, P., Filipot, J.-F., Thiébaut, M., and Tandeo,
P.: Probability distributions for analog-to-target distances, J.
Atmos. Sci., 78, 3317–3335, 2021a.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Platzer et~al.(2021{\natexlab{b}})Platzer, Yiou, Naveau, Tandeo,
Filipot, Ailliot, and Zhen}}?><label>Platzer et al.(2021b)Platzer, Yiou, Naveau, Tandeo,
Filipot, Ailliot, and Zhen</label><?label platzer2021using?><mixed-citation>
Platzer, P., Yiou, P., Naveau, P., Tandeo, P., Filipot, J.-F., Ailliot, P., and
Zhen, Y.: Using local dynamics to explain analog forecasting of chaotic
systems, J. Atmos. Sci., 78, 2117–2133,
2021b.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Pulido et~al.(2018)Pulido, Tandeo, Bocquet, Carrassi, and
Lucini}}?><label>Pulido et al.(2018)Pulido, Tandeo, Bocquet, Carrassi, and
Lucini</label><?label pulido2018stochastic?><mixed-citation>Pulido, M., Tandeo, P., Bocquet, M., Carrassi, A., and Lucini, M.: Stochastic
parameterization identification using ensemble Kalman filtering combined with
maximum likelihood methods, Tellus A,
70, 1442099, <ext-link xlink:href="https://doi.org/10.1080/16000870.2018.1442099" ext-link-type="DOI">10.1080/16000870.2018.1442099</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Rudy et~al.(2017)Rudy, Brunton, Proctor, and Kutz}}?><label>Rudy et al.(2017)Rudy, Brunton, Proctor, and Kutz</label><?label rudy2017data?><mixed-citation>Rudy, S. H., Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Data-driven
discovery of partial differential equations, Science Advances, 3, e1602614, <ext-link xlink:href="https://doi.org/10.1126/sciadv.1602614" ext-link-type="DOI">10.1126/sciadv.1602614</ext-link>,
2017.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Ruiz et~al.(2013)Ruiz, Pulido, and Miyoshi}}?><label>Ruiz et al.(2013)Ruiz, Pulido, and Miyoshi</label><?label ruiz2013estimating?><mixed-citation>
Ruiz, J. J., Pulido, M., and Miyoshi, T.: Estimating model parameters with
ensemble-based data assimilation: A review, J. Meteorol.
Soc. Jpn., 91, 79–99, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{{S\'{e}vellec and Fedorov(2014)}}?><label>Sévellec and Fedorov(2014)</label><?label sevellec:14?><mixed-citation>
Sévellec, F. and Fedorov, A. V.: Millennial variability in an idealized
ocean model: predicting the AMOC regime shifts, J. Climate, 27,
3551–3564, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{{Shumway and Stoffer(1982)}}?><label>Shumway and Stoffer(1982)</label><?label shumway1982approach?><mixed-citation>
Shumway, R. H. and Stoffer, D. S.: An approach to time series smoothing and
forecasting using the EM algorithm, J. Time Ser. Anal., 3,
253–264, 1982.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Takens(1981)}}?><label>Takens(1981)</label><?label takens1981detecting?><mixed-citation>Takens, F.: Detecting strange attractors in turbulence, in: Dynamical systems
and turbulence, Warwick 1980,  Springer, 366–381, ISBN 978-3-540-38945-3, <ext-link xlink:href="https://doi.org/10.1007/BFb0091924" ext-link-type="DOI">10.1007/BFb0091924</ext-link>, 1981.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Talmon et~al.(2015)Talmon, Mallat, Zaveri, and
Coifman}}?><label>Talmon et al.(2015)Talmon, Mallat, Zaveri, and
Coifman</label><?label talmon2015manifold?><mixed-citation>
Talmon, R., Mallat, S., Zaveri, H., and Coifman, R. R.: Manifold learning for
latent variable inference in dynamical systems, IEEE T. Signal
Proces., 63, 3843–3856, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{{Tandeo et~al.(2015)Tandeo, Ailliot, Ruiz, Hannart, Chapron, Cuzol,
Monbet, Easton, and Fablet}}?><label>Tandeo et al.(2015)Tandeo, Ailliot, Ruiz, Hannart, Chapron, Cuzol,
Monbet, Easton, and Fablet</label><?label tandeo2015combining?><mixed-citation>Tandeo, P., Ailliot, P., Ruiz, J., Hannart, A., Chapron, B., Cuzol, A., Monbet,
V., Easton, R., and Fablet, R.: Combining analog method and ensemble data
assimilation: application to the Lorenz-63 chaotic system, in: Machine
learning and data mining approaches to climate science,  Springer, 3–12, ISBN 978-3-319-17220-0, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-17220-0_1" ext-link-type="DOI">10.1007/978-3-319-17220-0_1</ext-link>,
2015.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{{Tandeo et~al.(2020)Tandeo, Ailliot, Bocquet, Carrassi, Miyoshi,
Pulido, and Zhen}}?><label>Tandeo et al.(2020)Tandeo, Ailliot, Bocquet, Carrassi, Miyoshi,
Pulido, and Zhen</label><?label tandeo2020review?><mixed-citation>
Tandeo, P., Ailliot, P., Bocquet, M., Carrassi, A., Miyoshi, T., Pulido, M.,
and Zhen, Y.: A review of innovation-based methods to jointly estimate model
and observation error covariance matrices in ensemble data assimilation,
Mon. Weather Rev., 148, 3973–3994, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{{Wikner et~al.(2021)Wikner, Pathak, Hunt, Szunyogh, Girvan, and
Ott}}?><label>Wikner et al.(2021)Wikner, Pathak, Hunt, Szunyogh, Girvan, and
Ott</label><?label Wikner_2021?><mixed-citation>Wikner, A., Pathak, J., Hunt, B. R., Szunyogh, I., Girvan, M., and Ott, E.:
Using data assimilation to train a hybrid forecast system that combines
machine-learning and knowledge-based components, Chaos: An Interdisciplinary
Journal of Nonlinear Science, 31, 053114, <ext-link xlink:href="https://doi.org/10.1063/5.0048050" ext-link-type="DOI">10.1063/5.0048050</ext-link>, 2021.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Data-driven reconstruction of partially observed dynamical systems</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Bocquet et al.(2019)Bocquet, Brajard, Carrassi, and
Bertino</label><mixed-citation>
      
Bocquet, M., Brajard, J., Carrassi, A., and Bertino, L.: Data assimilation as a learning tool to infer ordinary differential equation representations of dynamical models, Nonlin. Processes Geophys., 26, 143–162, <a href="https://doi.org/10.5194/npg-26-143-2019" target="_blank">https://doi.org/10.5194/npg-26-143-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Brajard et al.(2020)Brajard, Carrassi, Bocquet, and
Bertino</label><mixed-citation>
      
Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to emulate a dynamical model from sparse
and noisy observations: A case study with the Lorenz 96 model, Journal of
Computational Science, 44, 101171, <a href="https://doi.org/10.1016/j.jocs.2020.101171" target="_blank">https://doi.org/10.1016/j.jocs.2020.101171</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Brajard et al.(2021)Brajard, Carrassi, Bocquet, and
Bertino</label><mixed-citation>
      
Brajard, J., Carrassi, A., Bocquet, M., and Bertino, L.: Combining data
assimilation and machine learning to infer unresolved scale parametrization,
Philos. T. Roy. Soc. A, 379, 2194, <a href="https://doi.org/10.1098/rsta.2020.0086" target="_blank">https://doi.org/10.1098/rsta.2020.0086</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Brunton et al.(2016)Brunton, Proctor, and
Kutz</label><mixed-citation>
      
Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Discovering governing
equations from data by sparse identification of nonlinear dynamical systems,
P. Natl. Acad. Sci. USA, 113, 3932–3937, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Brunton et al.(2017)Brunton, Brunton, Proctor, Kaiser, and
Kutz</label><mixed-citation>
      
Brunton, S. L., Brunton, B. W., Proctor, J. L., Kaiser, E., and Kutz, J. N.:
Chaos as an intermittently forced linear system, Nat. Commun., 8, 19,
<a href="https://doi.org/10.1038/s41467-017-00030-8" target="_blank">https://doi.org/10.1038/s41467-017-00030-8</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Delyon et al.(1999)Delyon, Lavielle, and
Moulines</label><mixed-citation>
      
Delyon, B., Lavielle, M., and Moulines, E.: Convergence of a stochastic
approximation version of the EM algorithm, Ann. Stat., 27, 94–128,
1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Dreano et al.(2017)Dreano, Tandeo, Pulido, Ait-El-Fquih, Chonavel,
and Hoteit</label><mixed-citation>
      
Dreano, D., Tandeo, P., Pulido, M., Ait-El-Fquih, B., Chonavel, T., and Hoteit,
I.: Estimating model-error covariances in nonlinear state-space models using
Kalman smoothing and the expectation–maximization algorithm, Q.
J. Roy. Meteor. Soc., 143, 1877–1885, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Fablet et al.(2021)Fablet, Chapron, Drumetz, Mémin, Pannekoucke,
and Rousseau</label><mixed-citation>
      
Fablet, R., Chapron, B., Drumetz, L., Mémin, E., Pannekoucke, O., and
Rousseau, F.: Learning variational data assimilation models and solvers,
J. Adv. Model. Earth Sy., 13, e2021MS002572, <a href="https://doi.org/10.1029/2021MS002572" target="_blank">https://doi.org/10.1029/2021MS002572</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Jayne et al.(2017)Jayne, Roemmich, Zilberman, Riser, Johnson,
Johnson, and Piotrowicz</label><mixed-citation>
      
Jayne, S. R., Roemmich, D., Zilberman, N., Riser, S. C., Johnson, K. S.,
Johnson, G. C., and Piotrowicz, S. R.: The Argo program: present and future,
Oceanography, 30, 18–28, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Kitagawa(1998)</label><mixed-citation>
      
Kitagawa, G.: A self-organizing state-space model, J. Am.
Stat. Assoc., 93, 1203–1215, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Korda and Mezić(2018)</label><mixed-citation>
      
Korda, M. and Mezić, I.: Linear predictors for nonlinear dynamical systems:
Koopman operator meets model predictive control, Automatica, 93, 149–160,
2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Lguensat et al.(2017)Lguensat, Tandeo, Ailliot, Pulido, and
Fablet</label><mixed-citation>
      
Lguensat, R., Tandeo, P., Ailliot, P., Pulido, M., and Fablet, R.: The analog
data assimilation, Mon. Weather Rev., 145, 4093–4107, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Lorenz(1963)</label><mixed-citation>
      
Lorenz, E. N.: Deterministic nonperiodic flow, J. Atmos. Sci.,
20, 130–141, <a href="https://doi.org/10.1175/1520-0469(1963)020&lt;0130:DNF&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1963)020&lt;0130:DNF&gt;2.0.CO;2</a>, 1963 (data available at:  <a href="https://github.com/ptandeo/Kalman" target="_blank"/>, last access: 26 May 2023).

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Mangiarotti and Huc(2019)</label><mixed-citation>
      
Mangiarotti, S. and Huc, M.: Can the original equations of a dynamical system
be retrieved from observational time series?, Chaos: An Interdisciplinary
Journal of Nonlinear Science, 29, 023133, <a href="https://doi.org/10.1063/1.5081448" target="_blank">https://doi.org/10.1063/1.5081448</a>, 2019.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>North et al.(2022)North, Wikle, and Schliep</label><mixed-citation>
      
North, J. S., Wikle, C. K., and Schliep, E. M.: A Bayesian Approach for
Data-Driven Dynamic Equation Discovery, J. Agr. Biol.
Envir. S., 27, 728–747, <a href="https://doi.org/10.1007/s13253-022-00514-1" target="_blank">https://doi.org/10.1007/s13253-022-00514-1</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Ouala et al.(2020)Ouala, Nguyen, Drumetz, Chapron, Pascual, Collard,
Gaultier, and Fablet</label><mixed-citation>
      
Ouala, S., Nguyen, D., Drumetz, L., Chapron, B., Pascual, A., Collard, F.,
Gaultier, L., and Fablet, R.: Learning latent dynamics for partially observed
chaotic systems, Chaos: An Interdisciplinary Journal of Nonlinear Science,
30, 103121, <a href="https://doi.org/10.1063/5.0019309" target="_blank">https://doi.org/10.1063/5.0019309</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Platzer et al.(2021a)Platzer, Yiou, Naveau, Filipot,
Thiébaut, and Tandeo</label><mixed-citation>
      
Platzer, P., Yiou, P., Naveau, P., Filipot, J.-F., Thiébaut, M., and Tandeo,
P.: Probability distributions for analog-to-target distances, J.
Atmos. Sci., 78, 3317–3335, 2021a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Platzer et al.(2021b)Platzer, Yiou, Naveau, Tandeo,
Filipot, Ailliot, and Zhen</label><mixed-citation>
      
Platzer, P., Yiou, P., Naveau, P., Tandeo, P., Filipot, J.-F., Ailliot, P., and
Zhen, Y.: Using local dynamics to explain analog forecasting of chaotic
systems, J. Atmos. Sci., 78, 2117–2133,
2021b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Pulido et al.(2018)Pulido, Tandeo, Bocquet, Carrassi, and
Lucini</label><mixed-citation>
      
Pulido, M., Tandeo, P., Bocquet, M., Carrassi, A., and Lucini, M.: Stochastic
parameterization identification using ensemble Kalman filtering combined with
maximum likelihood methods, Tellus A,
70, 1442099, <a href="https://doi.org/10.1080/16000870.2018.1442099" target="_blank">https://doi.org/10.1080/16000870.2018.1442099</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Rudy et al.(2017)Rudy, Brunton, Proctor, and Kutz</label><mixed-citation>
      
Rudy, S. H., Brunton, S. L., Proctor, J. L., and Kutz, J. N.: Data-driven
discovery of partial differential equations, Science Advances, 3, e1602614, <a href="https://doi.org/10.1126/sciadv.1602614" target="_blank">https://doi.org/10.1126/sciadv.1602614</a>,
2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Ruiz et al.(2013)Ruiz, Pulido, and Miyoshi</label><mixed-citation>
      
Ruiz, J. J., Pulido, M., and Miyoshi, T.: Estimating model parameters with
ensemble-based data assimilation: A review, J. Meteorol.
Soc. Jpn., 91, 79–99, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Sévellec and Fedorov(2014)</label><mixed-citation>
      
Sévellec, F. and Fedorov, A. V.: Millennial variability in an idealized
ocean model: predicting the AMOC regime shifts, J. Climate, 27,
3551–3564, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Shumway and Stoffer(1982)</label><mixed-citation>
      
Shumway, R. H. and Stoffer, D. S.: An approach to time series smoothing and
forecasting using the EM algorithm, J. Time Ser. Anal., 3,
253–264, 1982.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Takens(1981)</label><mixed-citation>
      
Takens, F.: Detecting strange attractors in turbulence, in: Dynamical systems
and turbulence, Warwick 1980,  Springer, 366–381, ISBN 978-3-540-38945-3, <a href="https://doi.org/10.1007/BFb0091924" target="_blank">https://doi.org/10.1007/BFb0091924</a>, 1981.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Talmon et al.(2015)Talmon, Mallat, Zaveri, and
Coifman</label><mixed-citation>
      
Talmon, R., Mallat, S., Zaveri, H., and Coifman, R. R.: Manifold learning for
latent variable inference in dynamical systems, IEEE T. Signal
Proces., 63, 3843–3856, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Tandeo et al.(2015)Tandeo, Ailliot, Ruiz, Hannart, Chapron, Cuzol,
Monbet, Easton, and Fablet</label><mixed-citation>
      
Tandeo, P., Ailliot, P., Ruiz, J., Hannart, A., Chapron, B., Cuzol, A., Monbet,
V., Easton, R., and Fablet, R.: Combining analog method and ensemble data
assimilation: application to the Lorenz-63 chaotic system, in: Machine
learning and data mining approaches to climate science,  Springer, 3–12, ISBN 978-3-319-17220-0, <a href="https://doi.org/10.1007/978-3-319-17220-0_1" target="_blank">https://doi.org/10.1007/978-3-319-17220-0_1</a>,
2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Tandeo et al.(2020)Tandeo, Ailliot, Bocquet, Carrassi, Miyoshi,
Pulido, and Zhen</label><mixed-citation>
      
Tandeo, P., Ailliot, P., Bocquet, M., Carrassi, A., Miyoshi, T., Pulido, M.,
and Zhen, Y.: A review of innovation-based methods to jointly estimate model
and observation error covariance matrices in ensemble data assimilation,
Mon. Weather Rev., 148, 3973–3994, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Wikner et al.(2021)Wikner, Pathak, Hunt, Szunyogh, Girvan, and
Ott</label><mixed-citation>
      
Wikner, A., Pathak, J., Hunt, B. R., Szunyogh, I., Girvan, M., and Ott, E.:
Using data assimilation to train a hybrid forecast system that combines
machine-learning and knowledge-based components, Chaos: An Interdisciplinary
Journal of Nonlinear Science, 31, 053114, <a href="https://doi.org/10.1063/5.0048050" target="_blank">https://doi.org/10.1063/5.0048050</a>, 2021.

    </mixed-citation></ref-html>--></article>
