Articles | Volume 30, issue 4
https://doi.org/10.5194/npg-30-503-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/npg-30-503-2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Robust weather-adaptive post-processing using model output statistics random forests
Thomas Muschinski
CORRESPONDING AUTHOR
Department of Atmospheric and Cryospheric Sciences, Universität Innsbruck, Innsbruck, Austria
Department of Economics, Statistical Methods and Econometrics, Karlsruhe Institute of Technology, Karlsruhe, Germany
Georg J. Mayr
Department of Atmospheric and Cryospheric Sciences, Universität Innsbruck, Innsbruck, Austria
Achim Zeileis
Department of Statistics, Universität Innsbruck, Innsbruck, Austria
Thorsten Simon
Department of Statistics, Universität Innsbruck, Innsbruck, Austria
Related authors
Thomas Muschinski, Moritz N. Lang, Georg J. Mayr, Jakob W. Messner, Achim Zeileis, and Thorsten Simon
Wind Energ. Sci., 7, 2393–2405, https://doi.org/10.5194/wes-7-2393-2022, https://doi.org/10.5194/wes-7-2393-2022, 2022
Short summary
Short summary
The power generated by offshore wind farms can vary greatly within a couple of hours, and failing to anticipate these ramp events can lead to costly imbalances in the electrical grid. A novel multivariate Gaussian regression model helps us to forecast not just the means and variances of the next day's hourly wind speeds, but also their corresponding correlations. This information is used to generate more realistic scenarios of power production and accurate estimates for ramp probabilities.
Helen Claire Ward, Mathias Walter Rotach, Alexander Gohm, Martin Graus, Thomas Karl, Maren Haid, Lukas Umek, and Thomas Muschinski
Atmos. Chem. Phys., 22, 6559–6593, https://doi.org/10.5194/acp-22-6559-2022, https://doi.org/10.5194/acp-22-6559-2022, 2022
Short summary
Short summary
This study examines how cities and their surroundings influence turbulent exchange processes responsible for weather and climate. Analysis of a 4-year observational dataset for the Alpine city of Innsbruck reveals several similarities with other (flat) city centre sites. However, the mountain setting leads to characteristic daily and seasonal flow patterns (valley winds) and downslope windstorms that have a marked effect on temperature, wind speed, turbulence and pollutant concentration.
Fiona Fix, Georg Johann Mayr, Achim Zeileis, Isabell Kathrin Stucke, and Reto Stauffer
EGUsphere, https://doi.org/10.5194/egusphere-2024-2143, https://doi.org/10.5194/egusphere-2024-2143, 2024
Short summary
Short summary
“Atmospheric deserts” (ADs) are air masses that are transported away from hot, dry regions. Our study introduces this new concept. ADs can suppress or boost thunderstorms, and potentially contribute to the formation of heat waves, which makes them relevant for forecasting extreme events. Using a novel detection method, we follow the AD directly from North Africa to Europe for a case in June 2022, allowing us to analyze the air mass at any time and investigate how it is modified along the way.
Gregor Ehrensperger, Thorsten Simon, Georg Johann Mayr, and Tobias Hell
EGUsphere, https://doi.org/10.48550/arXiv.2210.11529, https://doi.org/10.48550/arXiv.2210.11529, 2024
Short summary
Short summary
Lightning can cause significant damages to infrastructure and pose risks to individuals. As lightning is a short and local event it is not explicitly resolved in atmospheric models. Instead, auxiliary descriptions based on meteorological expert knowledge are used to assess lightning. We used AI that successfully discovered on its own the ingredients that experts know to be essential for lightning in the well-studied region of the Alps. Additionally, it also recognized regional differences.
Deborah Morgenstern, Isabell Stucke, Georg J. Mayr, Achim Zeileis, and Thorsten Simon
Weather Clim. Dynam., 4, 489–509, https://doi.org/10.5194/wcd-4-489-2023, https://doi.org/10.5194/wcd-4-489-2023, 2023
Short summary
Short summary
Two thunderstorm environments are described for Europe: mass-field thunderstorms, which occur mostly in summer, over land, and under similar meteorological conditions, and wind-field thunderstorms, which occur mostly in winter, over the sea, and under more diverse meteorological conditions. Our descriptions are independent of static thresholds and help to understand why thunderstorms in unfavorable seasons for lightning pose a particular risk to tall infrastructure such as wind turbines.
Thomas Muschinski, Moritz N. Lang, Georg J. Mayr, Jakob W. Messner, Achim Zeileis, and Thorsten Simon
Wind Energ. Sci., 7, 2393–2405, https://doi.org/10.5194/wes-7-2393-2022, https://doi.org/10.5194/wes-7-2393-2022, 2022
Short summary
Short summary
The power generated by offshore wind farms can vary greatly within a couple of hours, and failing to anticipate these ramp events can lead to costly imbalances in the electrical grid. A novel multivariate Gaussian regression model helps us to forecast not just the means and variances of the next day's hourly wind speeds, but also their corresponding correlations. This information is used to generate more realistic scenarios of power production and accurate estimates for ramp probabilities.
Helen Claire Ward, Mathias Walter Rotach, Alexander Gohm, Martin Graus, Thomas Karl, Maren Haid, Lukas Umek, and Thomas Muschinski
Atmos. Chem. Phys., 22, 6559–6593, https://doi.org/10.5194/acp-22-6559-2022, https://doi.org/10.5194/acp-22-6559-2022, 2022
Short summary
Short summary
This study examines how cities and their surroundings influence turbulent exchange processes responsible for weather and climate. Analysis of a 4-year observational dataset for the Alpine city of Innsbruck reveals several similarities with other (flat) city centre sites. However, the mountain setting leads to characteristic daily and seasonal flow patterns (valley winds) and downslope windstorms that have a marked effect on temperature, wind speed, turbulence and pollutant concentration.
Deborah Morgenstern, Isabell Stucke, Thorsten Simon, Georg J. Mayr, and Achim Zeileis
Weather Clim. Dynam., 3, 361–375, https://doi.org/10.5194/wcd-3-361-2022, https://doi.org/10.5194/wcd-3-361-2022, 2022
Short summary
Short summary
Wintertime lightning in central Europe is rare but has a large damage potential for tall structures such as wind turbines. We use a data-driven approach to explain why it even occurs when the meteorological processes causing thunderstorms in summer are absent. In summer, with strong solar input, thunderclouds have a large vertical extent, whereas in winter, thunderclouds are shallower in the vertical but tilted and elongated in the horizontal by strong winds that increase with altitude.
David Schoenach, Thorsten Simon, and Georg Johann Mayr
Adv. Stat. Clim. Meteorol. Oceanogr., 6, 45–60, https://doi.org/10.5194/ascmo-6-45-2020, https://doi.org/10.5194/ascmo-6-45-2020, 2020
Short summary
Short summary
State-of-the-art statistical methods are applied to postprocess an ensemble of numerical forecasts for vertical profiles of air temperature. These profiles are important tools in weather forecasting as they show the stratification and the static stability of the atmosphere. Flexible regression models combined with the multi-dimensionality of the data lead to better calibration and representation of uncertainty of the vertical profiles.
Moritz N. Lang, Sebastian Lerch, Georg J. Mayr, Thorsten Simon, Reto Stauffer, and Achim Zeileis
Nonlin. Processes Geophys., 27, 23–34, https://doi.org/10.5194/npg-27-23-2020, https://doi.org/10.5194/npg-27-23-2020, 2020
Short summary
Short summary
Statistical post-processing aims to increase the predictive skill of probabilistic ensemble weather forecasts by learning the statistical relation between historical pairs of observations and ensemble forecasts within a given training data set. This study compares four different training schemes and shows that including multiple years of data in the training set typically yields a more stable post-processing while it loses the ability to quickly adjust to temporal changes in the underlying data.
Christian Mallaun, Andreas Giez, Georg J. Mayr, and Mathias W. Rotach
Atmos. Chem. Phys., 19, 9769–9786, https://doi.org/10.5194/acp-19-9769-2019, https://doi.org/10.5194/acp-19-9769-2019, 2019
Short summary
Short summary
This study presents airborne measurements in shallow convection over land to investigate the dynamic properties of clouds focusing on possible narrow downdraughts in the surrounding of the clouds. A characteristic narrow downdraught region (
subsiding shell) is found directly outside the cloud borders for the mean vertical wind distribution. The
subsiding shellresults from the distribution of the highly variable updraughts and downdraughts in the near vicinity of the cloud.
Moritz N. Lang, Georg J. Mayr, Reto Stauffer, and Achim Zeileis
Adv. Stat. Clim. Meteorol. Oceanogr., 5, 115–132, https://doi.org/10.5194/ascmo-5-115-2019, https://doi.org/10.5194/ascmo-5-115-2019, 2019
Short summary
Short summary
Accurate wind forecasts are of great importance for decision-making processes in today's society. This work presents a novel probabilistic post-processing method for wind vector forecasts employing a bivariate Gaussian response distribution. To capture a possible mismatch between the predicted and observed wind direction caused by location-specific properties, the approach incorporates a smooth rotation of the wind direction conditional on the season and the forecasted ensemble wind direction.
Sebastian J. Dietz, Philipp Kneringer, Georg J. Mayr, and Achim Zeileis
Adv. Stat. Clim. Meteorol. Oceanogr., 5, 101–114, https://doi.org/10.5194/ascmo-5-101-2019, https://doi.org/10.5194/ascmo-5-101-2019, 2019
Short summary
Short summary
Low-visibility conditions reduce the flight capacity of airports and can lead to delays and supplemental costs for airlines and airports. In this study, the forecasting skill and most important model predictors of airport-relevant low visibility are investigated for multiple flight planning horizons with different statistical models.
Manuel Gebetsberger, Reto Stauffer, Georg J. Mayr, and Achim Zeileis
Adv. Stat. Clim. Meteorol. Oceanogr., 5, 87–100, https://doi.org/10.5194/ascmo-5-87-2019, https://doi.org/10.5194/ascmo-5-87-2019, 2019
Short summary
Short summary
This article presents a method for improving probabilistic air temperature forecasts, particularly at Alpine sites. Using a nonsymmetric forecast distribution, the probabilistic forecast quality can be improved with respect to the common symmetric Gaussian distribution used. Furthermore, a long-term training approach of 3 years is presented to ensure the stability of the regression coefficients. The research was based on a PhD project on building an automated forecast system for northern Italy.
Thorsten Simon, Georg J. Mayr, Nikolaus Umlauf, and Achim Zeileis
Adv. Stat. Clim. Meteorol. Oceanogr., 5, 1–16, https://doi.org/10.5194/ascmo-5-1-2019, https://doi.org/10.5194/ascmo-5-1-2019, 2019
Short summary
Short summary
Lightning in Alpine regions is associated with events such as thunderstorms,
extreme precipitation, high wind gusts, flash floods, and debris flows.
We present a statistical approach to predict lightning counts based on
numerical weather predictions. Lightning counts are considered on a grid
with 18 km mesh size. Skilful prediction is obtained for a forecast horizon
of 5 days over complex terrain.
Jutta Vüllers, Georg J. Mayr, Ulrich Corsmeier, and Christoph Kottmeier
Atmos. Chem. Phys., 18, 18169–18186, https://doi.org/10.5194/acp-18-18169-2018, https://doi.org/10.5194/acp-18-18169-2018, 2018
Short summary
Short summary
This paper investigates frequently occurring foehn at the Dead Sea, which strongly impacts the local climatic conditions, in particular temperature and humidity, as well as evaporation from the Dead Sea, the aerosol load, and visibility. A statistical classification exposes two types of foehn and first-time, high-resolution measurements reveal trigger mechanisms and relevant characteristics, such as wind velocities, affected air layers, and resulting phenomena such as hydraulic jumps and rotors.
Reto Stauffer, Georg J. Mayr, Jakob W. Messner, and Achim Zeileis
Adv. Stat. Clim. Meteorol. Oceanogr., 4, 65–86, https://doi.org/10.5194/ascmo-4-65-2018, https://doi.org/10.5194/ascmo-4-65-2018, 2018
Short summary
Short summary
Snowfall forecasts are important for a range of economic sectors as well as for the safety of people and infrastructure, especially in mountainous regions. This work presents a novel statistical approach to provide accurate forecasts for fresh snow amounts and the probability of snowfall combining data from various sources. The results demonstrate that the new approach is able to provide reliable high-resolution hourly snowfall forecasts for the eastern European Alps up to 3 days ahead.
Christian Pfeifer, Peter Höller, and Achim Zeileis
Nat. Hazards Earth Syst. Sci., 18, 571–582, https://doi.org/10.5194/nhess-18-571-2018, https://doi.org/10.5194/nhess-18-571-2018, 2018
Short summary
Short summary
In this article we analyzed spatial and temporal patterns of fatal Austrian avalanche accidents caused by backcountry and off-piste skiers and snowboarders within the winter periods 1967/1968–2015/2016. As a result of the trend analysis, we noticed an increasing trend of backcountry and off-piste avalanche fatalities within the winter periods 1967/1968–2015/2016. As a result of the spatial analysis, we noticed two hot spots of avalanche fatalities (
Arlberg–Silvrettaand
Sölden).
Thorsten Simon, Nikolaus Umlauf, Achim Zeileis, Georg J. Mayr, Wolfgang Schulz, and Gerhard Diendorfer
Nat. Hazards Earth Syst. Sci., 17, 305–314, https://doi.org/10.5194/nhess-17-305-2017, https://doi.org/10.5194/nhess-17-305-2017, 2017
Short summary
Short summary
The study presents a newly developed statistical method to assess the risk of thunderstorms in complex terrain. Observations of lightning serve as an indicator for thunderstorms. The application of the method is illustrated for Carinthia which is located in Austria, Europe.
F. Oesterle, S. Ostermann, R. Prodan, and G. J. Mayr
Geosci. Model Dev., 8, 2067–2078, https://doi.org/10.5194/gmd-8-2067-2015, https://doi.org/10.5194/gmd-8-2067-2015, 2015
Short summary
Short summary
Three practical meteorological applications with different characteristics highlight the core computer science aspects and applicability
of distributed computing to meteorology. Presenting cloud and grid computing this paper shows use case scenarios fitting a wide range of meteorological applications from operational to research studies. The paper concludes that distributed computing complements and extends existing high performance computing concepts.
S. Gisinger, G. J. Mayr, J. W. Messner, and R. Stauffer
Nonlin. Processes Geophys., 20, 305–310, https://doi.org/10.5194/npg-20-305-2013, https://doi.org/10.5194/npg-20-305-2013, 2013
Related subject area
Subject: Predictability, probabilistic forecasts, data assimilation, inverse problems | Topic: Climate, atmosphere, ocean, hydrology, cryosphere, biosphere | Techniques: Big data and artificial intelligence
Selecting and weighting dynamical models using data-driven approaches
A quest for precipitation attractors in weather radar archives
Guidance on how to improve vertical covariance localization based on a 1000-member ensemble
Weather pattern dynamics over western Europe under climate change: predictability, information entropy and production
Calibrated ensemble forecasts of the height of new snow using quantile regression forests and ensemble model output statistics
Enhancing geophysical flow machine learning performance via scale separation
Training a convolutional neural network to conserve mass in data assimilation
Data-driven predictions of a multiscale Lorenz 96 chaotic system using machine-learning methods: reservoir computing, artificial neural network, and long short-term memory network
From research to applications – examples of operational ensemble post-processing in France using machine learning
Pierre Le Bras, Florian Sévellec, Pierre Tandeo, Juan Ruiz, and Pierre Ailliot
Nonlin. Processes Geophys., 31, 303–317, https://doi.org/10.5194/npg-31-303-2024, https://doi.org/10.5194/npg-31-303-2024, 2024
Short summary
Short summary
The goal of this paper is to weight several dynamic models in order to improve the representativeness of a system. It is illustrated using a set of versions of an idealized model describing the Atlantic Meridional Overturning Circulation. The low-cost method is based on data-driven forecasts. It enables model performance to be evaluated on their dynamics. Taking into account both model performance and codependency, the derived weights outperform benchmarks in reconstructing a model distribution.
Loris Foresti, Bernat Puigdomènech Treserras, Daniele Nerini, Aitor Atencia, Marco Gabella, Ioannis V. Sideris, Urs Germann, and Isztar Zawadzki
Nonlin. Processes Geophys., 31, 259–286, https://doi.org/10.5194/npg-31-259-2024, https://doi.org/10.5194/npg-31-259-2024, 2024
Short summary
Short summary
We compared two ways of defining the phase space of low-dimensional attractors describing the evolution of radar precipitation fields. The first defines the phase space by the domain-scale statistics of precipitation fields, such as their mean, spatial and temporal correlations. The second uses principal component analysis to account for the spatial distribution of precipitation. To represent different climates, radar archives over the United States and the Swiss Alpine region were used.
Tobias Necker, David Hinger, Philipp Johannes Griewank, Takemasa Miyoshi, and Martin Weissmann
Nonlin. Processes Geophys., 30, 13–29, https://doi.org/10.5194/npg-30-13-2023, https://doi.org/10.5194/npg-30-13-2023, 2023
Short summary
Short summary
This study investigates vertical localization based on a convection-permitting 1000-member ensemble simulation. We derive an empirical optimal localization (EOL) that minimizes sampling error in 40-member sub-sample correlations assuming 1000-member correlations as truth. The results will provide guidance for localization in convective-scale ensemble data assimilation systems.
Stéphane Vannitsem
Nonlin. Processes Geophys., 30, 1–12, https://doi.org/10.5194/npg-30-1-2023, https://doi.org/10.5194/npg-30-1-2023, 2023
Short summary
Short summary
The impact of climate change on weather pattern dynamics over the North Atlantic is explored through the lens of information theory. These tools allow the predictability of the succession of weather patterns and the irreversible nature of the dynamics to be clarified. It is shown that the predictability is increasing in the observations, while the opposite trend is found in model projections. The irreversibility displays an overall increase in time in both the observations and the model runs.
Guillaume Evin, Matthieu Lafaysse, Maxime Taillardat, and Michaël Zamo
Nonlin. Processes Geophys., 28, 467–480, https://doi.org/10.5194/npg-28-467-2021, https://doi.org/10.5194/npg-28-467-2021, 2021
Short summary
Short summary
Forecasting the height of new snow is essential for avalanche hazard surveys, road and ski resort management, tourism attractiveness, etc. Météo-France operates a probabilistic forecasting system using a numerical weather prediction system and a snowpack model. It provides better forecasts than direct diagnostics but exhibits significant biases. Post-processing methods can be applied to provide automatic forecasting products from this system.
Davide Faranda, Mathieu Vrac, Pascal Yiou, Flavio Maria Emanuele Pons, Adnane Hamid, Giulia Carella, Cedric Ngoungue Langue, Soulivanh Thao, and Valerie Gautard
Nonlin. Processes Geophys., 28, 423–443, https://doi.org/10.5194/npg-28-423-2021, https://doi.org/10.5194/npg-28-423-2021, 2021
Short summary
Short summary
Machine learning approaches are spreading rapidly in climate sciences. They are of great help in many practical situations where using the underlying equations is difficult because of the limitation in computational power. Here we use a systematic approach to investigate the limitations of the popular echo state network algorithms used to forecast the long-term behaviour of chaotic systems, such as the weather. Our results show that noise and intermittency greatly affect the performances.
Yvonne Ruckstuhl, Tijana Janjić, and Stephan Rasp
Nonlin. Processes Geophys., 28, 111–119, https://doi.org/10.5194/npg-28-111-2021, https://doi.org/10.5194/npg-28-111-2021, 2021
Short summary
Short summary
The assimilation of observations using standard algorithms can lead to a violation of physical laws (e.g. mass conservation), which is shown to have a detrimental impact on the system's forecast. We use a neural network (NN) to correct this mass violation, using training data generated from expensive algorithms that can constrain such physical properties. We found that, in an idealized set-up, the NN can match the performance of these expensive algorithms at negligible computational costs.
Ashesh Chattopadhyay, Pedram Hassanzadeh, and Devika Subramanian
Nonlin. Processes Geophys., 27, 373–389, https://doi.org/10.5194/npg-27-373-2020, https://doi.org/10.5194/npg-27-373-2020, 2020
Short summary
Short summary
The performance of three machine-learning methods for data-driven modeling of a multiscale chaotic Lorenz 96 system is examined. One of the methods is found to be able to predict the future evolution of the chaotic system well from just knowing the past observations of the large-scale component of the multiscale state vector. Potential applications to data-driven and data-assisted surrogate modeling of complex dynamical systems such as weather and climate are discussed.
Maxime Taillardat and Olivier Mestre
Nonlin. Processes Geophys., 27, 329–347, https://doi.org/10.5194/npg-27-329-2020, https://doi.org/10.5194/npg-27-329-2020, 2020
Short summary
Short summary
Statistical post-processing of ensemble forecasts is now a well-known procedure in order to correct biased and misdispersed ensemble weather predictions. But practical application in European national weather services is in its infancy. Different applications of ensemble post-processing using machine learning at an industrial scale are presented. Forecast quality and value are improved compared to the raw ensemble, but several facilities have to be made to adjust to operational constraints.
Cited articles
Athey, S., Tibshirani, J., and Wager, S.: Generalized random forests, Ann. Stat., 47, 1148–1178, https://doi.org/10.1214/18-AOS1709, 2019. a
Baran, S. and Nemoda, D.: Censored and shifted gamma distribution based EMOS model for probabilistic quantitative precipitation forecasting, Environmetrics, 27, 280–292, https://doi.org/10.1002/env.2391, 2016. a
Bauer, P., Thorpe, A., and Brunet, G.: The Quiet Revolution of Numerical Weather Prediction, Nature, 525, 47–55, https://doi.org/10.1038/nature14956, 2015. a
Breiman, L.: Bagging Predictors, Mach. Learn., 24, 123–140, https://doi.org/10.1007/bf00058655, 1996. a
Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/a:1010933404324, 2001. a, b
Bremnes, J. B.: Ensemble postprocessing using quantile function regression based on neural networks and Bernstein polynomials, Mon. Weather Rev., 148, 403–414, https://doi.org/10.1175/MWR-D-19-0227.1, 2020. a
Evin, G., Lafaysse, M., Taillardat, M., and Zamo, M.: Calibrated ensemble forecasts of the height of new snow using quantile regression forests and ensemble model output statistics, Nonlin. Processes Geophys., 28, 467–480, https://doi.org/10.5194/npg-28-467-2021, 2021. a
Glahn, H. R. and Lowry, D. A.: The Use of Model Output Statistics (MOS) in Objective Weather Forecasting, J. Appl. Meteorol., 11, 1203–1211, https://doi.org/10.1175/1520-0450(1972)011<1203:tuomos>2.0.co;2, 1972. a, b, c
Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules, Prediction, and Estimation, J. Am. Stat. Assoc., 102, 359–378, https://doi.org/10.1198/016214506000001437, 2007. a
Gneiting, T., Raftery, A. E., Westveld III, A. H., and Goldman, T.: Calibrated Probabilistic Forecasting Using Ensemble Model Output Statistics and Minimum CRPS Estimation, Mon. Weather Rev., 133, 1098–1118, https://doi.org/10.1175/mwr2904.1, 2005. a, b, c
Hamill, T. M., Bates, G. T., Whitaker, J. S., Murray, D. R., Fiorino, M., Galarneau, T. J., Zhu, Y., and Lapenta, W.: NOAA's second-generation global medium-range ensemble reforecast dataset, B. Am. Meteorol. Soc., 94, 1553–1565, https://doi.org/10.1175/BAMS-D-12-00014.1, 2013. a
Hothorn, T., Hornik, K., and Zeileis, A.: Unbiased Recursive Partitioning: A Conditional Inference Framework, J. Comput. Graph. Stat., 15, 651–674, https://doi.org/10.1198/106186006X133933, 2006. a
Hothorn, T., Hornik, K., Van De Wiel, M. A., and Zeileis, A.: Implementing a class of permutation tests: the coin package, J. Stat. Softw., 28, 1–23, 2008. a
Jordan, A. I., Krueger, F., Lerch, S., Allen, S., and Graeter, M.: scoringRules: Scoring Rules for Parametric and Simulated Distribution Forecasts, R package version 1.1.1, https://cran.r-project.org/web/packages/scoringRules/, 2023. a
Kneib, T., Silbersdorff, A., and Säfken, B.: Rage against the mean–a review of distributional regression approaches, Econometrics and Statistics, 26, 99–123, https://doi.org/10.1016/j.ecosta.2021.07.006, 2021. a
Lang, M. N., Lerch, S., Mayr, G. J., Simon, T., Stauffer, R., and Zeileis, A.: Remember the past: a comparison of time-adaptive training schemes for non-homogeneous regression, Nonlin. Processes Geophys., 27, 23–34, https://doi.org/10.5194/npg-27-23-2020, 2020. a
Lerch, S. and Thorarinsdottir, T. L.: Comparison of non-homogeneous regression models for probabilistic wind speed forecasting, Tellus A, 65, 21206, https://doi.org/10.3402/tellusa.v65i0.21206, 2013. a
Matheson, J. E. and Winkler, R. L.: Scoring rules for continuous probability distributions, Manage. Sci., 22, 1087–1096, 1976. a
Meinshausen, N. and Ridgeway, G.: Quantile regression forests, J. Mach. Learn. Res., 7, 983–999, 2006. a
Messner, J. W., Mayr, G. J., and Zeileis, A.: Heteroscedastic Censored and Truncated Regression with crch, R J., 8, 173–181, https://doi.org/10.32614/RJ-2016-012, 2016. a, b, c
Messner, J. W., Mayr, G. J., and Zeileis, A.: Nonhomogeneous boosting for predictor selection in ensemble postprocessing, Mon. Weather Rev., 145, 137–147, https://doi.org/10.1175/MWR-D-16-0088.1, 2017. a
Rasp, S. and Lerch, S.: Neural networks for postprocessing ensemble weather forecasts, Mon. Weather Rev., 146, 3885–3900, https://doi.org/10.1175/MWR-D-18-0187.1, 2018. a
Rigby, R. A. and Stasinopoulos, D. M.: Generalized additive models for location, scale and shape, J. Roy. Stat. Soc. C-App., 54, 507–554, https://doi.org/10.1111/j.1467-9876.2005.00510.x, 2005. a
Scheuerer, M.: Probabilistic Quantitative Precipitation Forecasting Using Ensemble Model Output Statistics, Q. J. Roy. Meteor. Soc., 140, 1086–1096, https://doi.org/10.1002/qj.2183, 2014. a
Schlosser, L., Stauffer, R., and Zeileis, A.: RainTyrol: Precipitation Observations and NWP Forecasts from GEFS, R package version 0.2-0/r2952, https://R-Forge.R-project.org/projects/partykit/ (last access: 15 November 2023), 2020. a
Schlosser, L., Lang, M. N., Hothorn, T., and Zeileis, A.: disttree: Trees and Forests for Distributional Regression, R package version 0.2-0/r3189, https://R-Forge.R-project.org/projects/partykit/ (last access: 15 November 2023), 2021. a
Schoenach, D., Simon, T., and Mayr, G. J.: Postprocessing ensemble forecasts of vertical temperature profiles, Adv. Stat. Clim. Meteorol. Oceanogr., 6, 45–60, https://doi.org/10.5194/ascmo-6-45-2020, 2020. a
Schulz, B. and Lerch, S.: Machine learning methods for postprocessing ensemble forecasts of wind gusts: A systematic comparison, Mon. Weather Rev., 150, 235–257, https://doi.org/10.1175/MWR-D-21-0150.1, 2022. a
Seibold, H., Zeileis, A., and Hothorn, T.: Individual treatment effect prediction for amyotrophic lateral sclerosis patients, Stat. Methods Med. Res., 27, 3104–3125, https://doi.org/10.1177/0962280217693034, 2018. a
Seibold, H., Zeileis, A., and Hothorn, T.: model4you: an R package for personalised treatment effect estimation, J. Open Res. Softw., 7, 17, https://doi.org/10.5334/jors.219, 2019. a, b
Simon, T., Mayr, G. J., Umlauf, N., and Zeileis, A.: NWP-based lightning prediction using flexible count data regression, Adv. Stat. Clim. Meteorol. Oceanogr., 5, 1–16, https://doi.org/10.5194/ascmo-5-1-2019, 2019. a
Stauffer, R., Mayr, G. J., Messner, J. W., Umlauf, N., and Zeileis, A.: Spatio-temporal precipitation climatology over complex terrain using a censored additive regression model, Int. J. Climatol., 37, 3264–3275, https://doi.org/10.1002/joc.4913, 2017a. a
Stauffer, R., Umlauf, N., Messner, J. W., Mayr, G. J., and Zeileis, A.: Ensemble postprocessing of daily precipitation sums over complex terrain using censored high-resolution standardized anomalies, Mon. Weather Rev., 145, 955–969, https://doi.org/10.1175/MWR-D-16-0260.1, 2017b. a
Taillardat, M., Mestre, O., Zamo, M., and Naveau, P.: Calibrated ensemble forecasts using quantile regression forests and ensemble model output statistics, Mon. Weather Rev., 144, 2375–2393, https://doi.org/10.1175/MWR-D-15-0260.1, 2016. a, b
Taillardat, M., Fougères, A.-L., Naveau, P., and Mestre, O.: Forest-based and semiparametric methods for the postprocessing of rainfall ensemble forecasting, Weather Forecast., 34, 617–634, https://doi.org/10.1175/WAF-D-18-0149.1, 2019. a
Thorarinsdottir, T. L. and Gneiting, T.: Probabilistic forecasts of wind speed: Ensemble model output statistics by using heteroscedastic censored regression, J. Roy. Stat. Soc. A Sta., 173, 371–388, https://doi.org/10.1111/j.1467-985X.2009.00616.x, 2010. a
Vannitsem, S., Bremnes, J. B., Demaeyer, J., Evans, G. R., Flowerdew, J., Hemri, S., Lerch, S., Roberts, N., Theis, S., Atencia, A., Bouallègue, Z. B., Bhend, J., Dabernig, M., De Cruz, L., Hieta, L., Mestre, O., Moret, L., Plenković, I. O., Schmeits, M., Taillardat, M., Van den Bergh, J., Van Schaeybroeck, B., Whan, K., and Ylhaisi, J.: Statistical postprocessing for weather forecasts: Review, challenges, and avenues in a big data world, B. Am. Meteorol. Soc., 102, E681–E699, https://doi.org/10.1175/BAMS-D-19-0308.1, 2021. a
Zeileis, A., Hothorn, T., and Hornik, K.: Model-based recursive partitioning, J. Computat. Graph. Stat., 17, 492–514, https://doi.org/10.1198/106186008X319331, 2008. a
Short summary
Statistical post-processing is necessary to generate probabilistic forecasts from physical numerical weather prediction models. To allow for more flexibility, there has been a shift in post-processing away from traditional parametric regression models towards modern machine learning methods. By fusing these two approaches, we developed model output statistics random forests, a new post-processing method that is highly flexible but at the same time also very robust and easy to interpret.
Statistical post-processing is necessary to generate probabilistic forecasts from physical...