Articles | Volume 30, issue 3
https://doi.org/10.5194/npg-30-375-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/npg-30-375-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
How far can the statistical error estimation problem be closed by collocated data?
Air Quality Research Division, Environment and Climate Change Canada (ECCC), Dorval, Quebec, Canada
Rhenish Institute for Environmental Research at the University of Cologne (RIU), Cologne, Germany
Richard Ménard
Air Quality Research Division, Environment and Climate Change Canada (ECCC), Dorval, Quebec, Canada
Related authors
Annika Vogel, Richard Ménard, James Abu, and Jack Chen
EGUsphere, https://doi.org/10.5194/egusphere-2025-6386, https://doi.org/10.5194/egusphere-2025-6386, 2025
This preprint is open for discussion and under review for Nonlinear Processes in Geophysics (NPG).
Short summary
Short summary
Recent wildfire activity in Canada has been rising the public demand for smoke forecasts. This however comes with a challenge for national weather centers to provide fast, yet accurate forecasts of these strong unexpected events. This study explores the potential of a novel approach for operational air quality forecasting during wildfire smoke episodes. It not only enables an improved use of sparse observation signals, but also provides theoretical insights how uncertainties evolve over time.
Annika Vogel and Hendrik Elbern
Geosci. Model Dev., 14, 5583–5605, https://doi.org/10.5194/gmd-14-5583-2021, https://doi.org/10.5194/gmd-14-5583-2021, 2021
Short summary
Short summary
While atmospheric chemical forecasts rely on uncertain model parameters, their huge dimensions hamper an efficient uncertainty estimation. This study presents a novel approach to efficiently sample these uncertainties by extracting dominant dependencies and correlations. Applying the algorithm to biogenic emissions, their uncertainties can be estimated from a low number of dominant components. This states the capability of an efficient treatment of parameter uncertainties in atmospheric models.
Annika Vogel and Hendrik Elbern
Atmos. Chem. Phys., 21, 4039–4057, https://doi.org/10.5194/acp-21-4039-2021, https://doi.org/10.5194/acp-21-4039-2021, 2021
Short summary
Short summary
Forecasts of biogenic trace gases highly depend on the model setup and input fields. This study identifies sources of related forecast uncertainties for biogenic gases. Exceptionally high differences in both biogenic emissions and pollutant transport in the Po Valley are identified to be caused by the representation of the land surface and boundary layer dynamics. Consequently, changes in the model configuration are shown to induce significantly different local concentrations of biogenic gases.
Annika Vogel, Richard Ménard, James Abu, and Jack Chen
EGUsphere, https://doi.org/10.5194/egusphere-2025-6386, https://doi.org/10.5194/egusphere-2025-6386, 2025
This preprint is open for discussion and under review for Nonlinear Processes in Geophysics (NPG).
Short summary
Short summary
Recent wildfire activity in Canada has been rising the public demand for smoke forecasts. This however comes with a challenge for national weather centers to provide fast, yet accurate forecasts of these strong unexpected events. This study explores the potential of a novel approach for operational air quality forecasting during wildfire smoke episodes. It not only enables an improved use of sparse observation signals, but also provides theoretical insights how uncertainties evolve over time.
Annika Vogel and Hendrik Elbern
Geosci. Model Dev., 14, 5583–5605, https://doi.org/10.5194/gmd-14-5583-2021, https://doi.org/10.5194/gmd-14-5583-2021, 2021
Short summary
Short summary
While atmospheric chemical forecasts rely on uncertain model parameters, their huge dimensions hamper an efficient uncertainty estimation. This study presents a novel approach to efficiently sample these uncertainties by extracting dominant dependencies and correlations. Applying the algorithm to biogenic emissions, their uncertainties can be estimated from a low number of dominant components. This states the capability of an efficient treatment of parameter uncertainties in atmospheric models.
Annika Vogel and Hendrik Elbern
Atmos. Chem. Phys., 21, 4039–4057, https://doi.org/10.5194/acp-21-4039-2021, https://doi.org/10.5194/acp-21-4039-2021, 2021
Short summary
Short summary
Forecasts of biogenic trace gases highly depend on the model setup and input fields. This study identifies sources of related forecast uncertainties for biogenic gases. Exceptionally high differences in both biogenic emissions and pollutant transport in the Po Valley are identified to be caused by the representation of the land surface and boundary layer dynamics. Consequently, changes in the model configuration are shown to induce significantly different local concentrations of biogenic gases.
Cited articles
Anthes, R. and Rieckh, T.:
Estimating observation and model error variances using multiple data sets, Atmos. Meas. Tech., 11, 4239–4260, https://doi.org/10.5194/amt-11-4239-2018, 2018. a, b, c, d
Crow, W. T. and van den Berg, M. J.:
An improved approach for estimating observation and model error parameters in soil moisture data assimilation, Water Resour. Res., 46, W12519, https://doi.org/10.1029/2010WR009402, 2010. a, b
Crow, W. T. and Yilmaz, M. T.:
The Auto-Tuned Land Data Assimilation System (ATLAS), Water Resour. Res., 50, 371–385, https://doi.org/10.1002/2013WR014550, 2014. a
Daley, R.:
The Effect of Serially Correlated Observation and Model Error on Atmospheric Data Assimilation, Mon. Weather Rev., 120, 164–177, https://doi.org/10.1175/1520--0493(1992)120<0164:TEOSCO>2.0.CO;2, 1992a. a
Daley, R.:
The Lagged Innovation Covariance: A Performance Diagnostic for Atmospheric Data Assimilation, Mon. Weather Rev., 120, 178–196, https://doi.org/10.1175/1520--0493(1992)120<0178:TLICAP>2.0.CO;2, 1992b. a, b
Desroziers, G., Berre, L., Chapnik, B., and Poli, P.:
Diagnosis of observation, background and analysis-error statistics in observation space, Q. J. Roy. Meteorol. Soc., 131, 3385–3396, https://doi.org/10.1256/qj.05.108, 2005. a
Gray, J. and Allan, D.:
A Method for Estimating the Frequency Stability of an Individual Oscillator, in: 28th Annual Symposium on Frequency Control, Atlantic City, NJ, USA, 29–31 May 1974, 243–246, https://doi.org/10.1109/FREQ.1974.200027, 1974. a
Grubbs, F. E.:
On Estimating Precision of Measuring Instruments and Product Variability, J. Am. Stat. Assoc., 43, 243–264, https://doi.org/10.1080/01621459.1948.10483261, 1948. a
Gruber, A., Su, C.-H., Crow, W. T., Zwieback, S., Dorigo, W. A., and Wagner, W.:
Estimating error cross-correlations in soil moisture data sets using extended collocation analysis, J. Geophys. Res.-Atmos., 121, 1208–1219, https://doi.org/10.1002/2015JD024027, 2016. a, b, c
Kren, A. C. and Anthes, R. A.:
Estimating Error Variances of a Microwave Sensor and Dropsondes aboard the Global Hawk in Hurricanes Using the Three-Cornered Hat Method, J. Atmos. Ocean. Tech., 38, 197–208, https://doi.org/10.1175/JTECH-D-20-0044.1, 2021. a
Li, H., Kalnay, E., and Miyoshi, T.:
Simultaneous estimation of covariance inflation and observation errors within an ensemble Kalman filter, Q. J. Roy. Meteorol. Soc., 135, 523–533, https://doi.org/10.1002/qj.371, 2009. a
McColl, K. A., Vogelzang, J., Konings, A. G., Entekhabi, D., Piles, M., and Stoffelen, A.:
Extended triple collocation: Estimating errors and correlation coefficients with respect to an unknown target, Geophys. Res. Lett., 41, 6229–6236, https://doi.org/10.1002/2014GL061322, 2014. a
Ménard, R.:
Error covariance estimation methods based on analysis residuals: theoretical foundation and convergence properties derived from simplified observation networks, Q. J. Roy. Meteorol. Soc., 142, 257–273, https://doi.org/10.1002/qj.2650, 2016. a
Ménard, R. and Deshaies-Jacques, M.:
Evaluation of Analysis by Cross-Validation. Part I: Using Verification Metrics, Atmosphere, 9, 86, https://doi.org/10.3390/atmos9030086, 2018. a
Mitchell, H. L. and Houtekamer, P. L.:
An Adaptive Ensemble Kalman Filter, Mon. Weather Rev., 128, 416–433, https://doi.org/10.1175/1520--0493(2000)128<0416:AAEKF>2.0.CO;2, 2000. a
Nielsen, J. K., Gleisner, H., Syndergaard, S., and Lauritsen, K. B.:
Estimation of refractivity uncertainties and vertical error correlations in collocated radio occultations, radiosondes, and model forecasts, Atmos. Meas. Tech., 15, 6243–6256, https://doi.org/10.5194/amt-15-6243-2022, 2022. a, b
Pan, M., Fisher, C. K., Chaney, N. W., Zhan, W., Crow, W. T., Aires, F., Entekhabi, D., and Wood, E. F.:
Triple collocation: Beyond three estimates and separation of structural/non-structural errors, Remote Sens. Environ., 171, 299–310, https://doi.org/10.1016/j.rse.2015.10.028, 2015. a, b, c, d, e, f, g, h
Rieckh, T., Sjoberg, J. P., and Anthes, R. A.:
The Three-Cornered Hat Method for Estimating Error Variances of Three or More Atmospheric Datasets. Part II: Evaluating Radio Occultation and Radiosonde Observations, Global Model Forecasts, and Reanalyses, J. Atmos. Ocean. Tech., 38, 1777–1796, https://doi.org/10.1175/JTECH-D-20-0209.1, 2021. a, b, c
Scipal, K., Holmes, T., de Jeu, R., Naeimi, V., and Wagner, W.:
A possible solution for the problem of estimating the error structure of global soil moisture data sets, Geophys. Res. Lett., 35, https://doi.org/10.1029/2008GL035599, 2008. a
Stoffelen, A.:
Toward the true near-surface wind speed: Error modeling and calibration using triple collocation, J. Geophys. Res.-Oceans, 103, 7755–7766, https://doi.org/10.1029/97JC03180, 1998. a
Su, C.-H., Ryu, D., Crow, W. T., and Western, A. W.:
Beyond triple collocation: Applications to soil moisture monitoring, J. Geophys. Res.-Atmos., 119, 6419–6439, https://doi.org/10.1002/2013JD021043, 2014. a, b, c
Tandeo, P., Ailliot, P., Bocquet, M., Carrassi, A., Miyoshi, T., Pulido, M., and Zhen, Y.:
A Review of Innovation-Based Methods to Jointly Estimate Model and Observation Error Covariance Matrices in Ensemble Data Assimilation, Mon. Weather Rev., 148, 3973–3994, https://doi.org/10.1175/MWR-D-19--0240.1, 2020. a, b
Tangborn, A., Ménard, R., and Ortland, D.:
Bias correction and random error characterization for the assimilation of high-resolution Doppler imager line-of-sight velocity measurements, J. Geophys. Res.-Atmos., 107, ACL 5-1–ACL 5-15, https://doi.org/10.1029/2001JD000397, 2002. a
Todling, R., Semane, N., Anthes, R., and Healy, S.:
The relationship between two methods for estimating uncertainties in data assimilation, Q. J. Roy. Meteorol. Soc., 148, 2942–2954, https://doi.org/10.1002/qj.4343, 2022.
a, b, c, d
Vogel, A. and Ménard, R.: Statistical error estimation from residual statistics of multiple collocated datasets: Data from synthetic experiments, Zenodo [data set], https://doi.org/10.5281/zenodo.8263552, 2023. a, b
Voshtani, S., Ménard, R., Walker, T. W., and Hakami, A.:
Assimilation of GOSAT Methane in the Hemispheric CMAQ; Part I: Design of the Assimilation System, Remote Sens.-Basel, 14, 371, https://doi.org/10.3390/rs14020371, 2022. a
Xu, X. and Zou, X.:
Global 3D Features of Error Variances of GPS Radio Occultation and Radiosonde Observations, Remote Sens.-Basel, 13, 1, https://doi.org/10.3390/rs13010001, 2021. a
Zwieback, S., Scipal, K., Dorigo, W., and Wagner, W.:
Structural and statistical properties of the collocation technique for error characterization,
Nonlin. Processes Geophys., 19, 69–80, https://doi.org/10.5194/npg-19-69-2012, 2012. a, b, c, d
Short summary
Accurate estimation of the error statistics required for data assimilation remains an ongoing challenge, as statistical assumptions are required to solve the estimation problem. This work provides a conceptual view of the statistical error estimation problem in light of the increasing number of available datasets. We found that the total number of required assumptions increases with the number of overlapping datasets, but the relative number of error statistics that can be estimated increases.
Accurate estimation of the error statistics required for data assimilation remains an ongoing...