Articles | Volume 31, issue 3
https://doi.org/10.5194/npg-31-409-2024
© Author(s) 2024. This work is distributed under the Creative Commons Attribution 4.0 License.
Representation learning with unconditional denoising diffusion models for dynamical systems
Download
- Final revised paper (published on 19 Sep 2024)
- Preprint (discussion started on 20 Oct 2023)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2023-2261', Sibo Cheng, 12 Mar 2024
- AC1: 'Reply on RC1', Tobias Finn, 24 May 2024
-
RC2: 'Comment on egusphere-2023-2261', Anonymous Referee #2, 03 Apr 2024
- AC2: 'Reply on RC2', Tobias Finn, 24 May 2024
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Tobias Finn on behalf of the Authors (20 Jun 2024)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (12 Jul 2024) by Ioulia Tchiguirinskaia
AR by Tobias Finn on behalf of the Authors (16 Jul 2024)
Manuscript
This research paper presents a study on using denoising diffusion models for data-driven representation learning of dynamical systems. The research demonstrates the utility of such networks with the Lorenz 63 system, showing that the trained network can produce samples almost indistinguishable from those on the attractor, indicating the network has learned an internal representation of the system. This representation is then used for surrogate modeling and generating ensembles out of a deterministic run.
Overall I found this paper very well written and the contribution of introducing diffusion model into dynamical systems in geoscience novel and of clear contribution. Here lists my comments before I can recommend acceptance of this manuscript:
Comments:
1. If I understand correctly, the objective of this study is to explore the possibility of using diffusion model for high-dimension systems in geoscience. The numerical experiments are carried out using a three dimensional Lorenz model. To enhance the discussion, It would be beneficial if the authors could explain how generalizable their approach is to a high-dimensional spatial temporal system (e.g. by adding CNN or transformer layers for feature extractions (encoding) and decoding etc).
2. As a consequence of the small dimension, the ‘latent space’ in your diffusion model (256) is much larger the one of the physics space (3). Therefore, you have little risk in losing any information when using the denoising network for surrogate modelling. The authors may consider adding a baseline of transfer learning from an untrained (randomly initialized denoising NN) in Fig 7. The authors have shown the results of untrained NN in Tab 3 but only with a linear fine-tuning. What happens if you fine-tune with a non-linear NN of an untrained denoising NN?
Minor questions:
-Yang, X. and Wang, X., 2023. Diffusion model as representation learner. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 18938-18949).
- Mittal, S., Abstreiter, K., Bauer, S., Schölkopf, B. and Mehrjou, A., 2023, July. Diffusion based representation learning. In International Conference on Machine Learning (pp. 24963-24982). PMLR.
The authors may want to include some references and discuss the difference/similarity compared to the method used in this paper. This paper is probably the first one to propose diffusion-based representation learning in dynamical systems(?)
3. Page 9, ‘show that this representation is entangled’ why it is important for the learned features to be entangled?
4. Page 11, check the sentence ‘As we will see later, the bigger the Because of the state-dependency, the resulting distribution is implicitly represented by the ensemble and could extend beyond a Gaussian assumption’
5. Page 13, it seems that you have used a lot of training samples (1.6*E7) for your diffusion model for the Lorenz system of dimension 3. I was wondering if a standard surrogate model will require that much. That is saying maybe a standard surrogate model can outperform the diffusion-based one with less training data. I am curious to see the authors’ thought.
6. fig 5 (a) and 1(b). if I understand correctly, the x-axis is the pseudo time instead of the real time in the dynamical system. if it is the case, it would be benificial to add an x-axis label to avoid any confusion.