Process-based diagnostics using atmospheric budget analysis and nudging technique to identify sources of model systematic errors

Matsukawa, Chihiro; Rodríguez, José M.; Milton, Sean F.

doi:10.5194/wcd-6-1539-2025

Articles | Volume 6, issue 4

https://doi.org/10.5194/wcd-6-1539-2025

Articles | Volume 6, issue 4

Research article

26 Nov 2025

Research article |

| 26 Nov 2025

Process-based diagnostics using atmospheric budget analysis and nudging technique to identify sources of model systematic errors

Chihiro Matsukawa, José M. Rodríguez, and Sean F. Milton

Abstract

Identifying sources of model systematic errors is a fundamental step to successfully reduce them in general circulation models by improving the representation of relevant physical processes. In this study, we examine model error sources in the Met Office Unified Model at numerical weather prediction timescale by the combined use of two diagnostics: (1) the relaxation or “nudging” in which wind and/or temperature fields are relaxed back towards analyses throughout the simulations, and (2) atmospheric zonal-mean zonal momentum and thermal budgets. The budget analysis quantifies resolved processes and subsequently estimates unresolved processes as a residual, corresponding to model dynamics and physics, respectively. This correspondence is demonstrated by a direct comparison between the budgets and the model tendencies. A systematic error addressed in this paper is the Northern Hemisphere mid-latitude zonal wind bias in the lower stratosphere in boreal winter, characterized by an initial easterly bias that subsequently develops as a westerly bias. The momentum and thermal budget analysis for control and nudging experiments indicates that a mechanical forcing predominantly from parametrized gravity wave drag causes the easterly error and an overly strong temperature gradient around the tropopause is one of the main sources of the westerly error through the Coriolis forcing. The relevant warm bias over the tropical tropopause is mainly attributed to the budget residual term that corresponds to a thermal forcing dominated by radiative processes. This is consistent with the experimental result that temperature nudging over the tropical tropopause significantly reduces the westerly wind bias.

Download & links

How to cite.

Received: 30 Mar 2025 – Discussion started: 17 Apr 2025 – Revised: 18 Sep 2025 – Accepted: 14 Oct 2025 – Published: 26 Nov 2025

The works published in this journal are distributed under the Creative Commons Attribution 4.0 License. This licence does not affect the Crown copyright work, which is re-usable under the Open Government Licence (OGL). The Creative Commons Attribution 4.0 License and the OGL are interoperable and do not conflict with, reduce or limit each other.

© Crown copyright 2025

1 Introduction

A model systematic error, or bias, is a deviation of the mean model states from the corresponding mean observed states, affecting not only local regions but also other regions remotely through associated teleconnections. Continuous enhancements incorporated into general circulation models (GCMs), such as increasing horizontal and vertical resolutions, improving representations of dynamical and physical processes, and incorporating new physical components, have massively contributed to a significant reduction of model systematic errors in numerical weather predictions (NWP) and climate projections (e.g., Phillips et al., 2004; Bauer et al., 2015). However, further research and developments to reduce model systematic errors are still essential. Identifying origins and sources of model biases is a key step to find out how to improve model process representations to reduce the errors. It is difficult to achieve this goal using long-timescale climate simulations because of interactions of locally generated and remotely forced errors, nonlinear interactions and feedbacks among variables and various physical processes, and possible compensating errors consisting of two or more substantial errors which cancel out each other. Previous studies show that many long-timescale errors develop within the first few days of simulations and the fastest growing errors are probably associated with the model physics (Martin et al., 2010, 2021; Ma et al., 2014). A better understanding of the initial error growth, when errors are more locally forced and models are constrained by data assimilation through initial conditions, yields insights into the relevant sources of model errors at the physical process level (Rodwell and Palmer, 2007). In addition, there has been a wide variety of diagnostic methods to evaluate model systematic errors: for instance, potential vorticity budget diagnostics (e.g., Chagnon et al., 2013; Saffin et al., 2016), semi-geostrophic balance tool (e.g., Sánchez et al., 2020), perturbed parameter ensemble technique (e.g., Sexton et al., 2019; Karmalkar et al., 2019; Williams et al., 2020), single-column models experiments (e.g., Duynkerke et al., 2004; Lenderink et al., 2004; Svensson et al., 2011), model intercomparison projects (e.g., Elvidge et al., 2019; van Niekerk et al., 2020), and WGNE conferences on model systematic errors (e.g., Frassoni et al., 2023).

The relaxation or “nudging” method is a widely used practical framework that involves the addition of artificial terms to the prognostic equations to relax some of the model variables towards given (usually observed or analysed) states throughout the integration (Jeuken et al., 1996). Nudging forcing towards analysed states is comparable to the forecast error at a particular timescale with the reversed signs. Hence, it is beneficial to evaluate the nudging forcing itself to explore the sources of the model systematic errors. In addition, a nudging technique applied to selected regions or model levels, namely “regional nudging”, has also been used for the purpose of understanding their remote influences on other domains and diagnosing origins of forecast errors (e.g., Klinker, 1990; Hoskins et al., 2012; Rodríguez and Milton, 2019).

Another well-established technique is to diagnose atmospheric momentum and thermal budgets in analyses and forecasts. The budget equations describe a time evolution of wind and temperature field and individual contributions from resolved and unresolved processes. In particular, unresolved processes including a mechanical forcing in the momentum budget and a diabatic heating in the thermal budget can alter the time evolution of the corresponding variable as a source or sink term of the prognostic equations. Better representations of these processes, which need to be parametrized in GCMs, are essential for obtaining more accurate predictions (Bauer et al., 2015). An indirect method of quantifying unresolved forcing is to estimate forcing as a residual of the budget equation using observed or analysed data (e.g., Yanai et al., 1973; Hartmann, 1976; Hamilton, 1983; Smith and Lyjak, 1985; Palmer et al., 1986; Holopainen, 1987). This method has been utilized for model diagnostics in some previous studies to identify possible deficiencies in physics parametrizations. Klinker and Sardeshmukh (1992) defined the balance requirement as the sum of all the adiabatic terms in the zonal-mean zonal flow tendency equation with the sign reversed. The comparison of the balance requirement deduced from the analysis data with parametrized tendencies can suggest possible deficiencies in the model physics to balance the sum of the adiabatic terms. Milton and Wilson (1996) applied the same diagnostic as Klinker and Sardeshmukh (1992) and indicated systematic deficiencies in parametrized tendencies. They demonstrated that the incorporation of the new parametrizations associated with subgrid-scale orography leads to better initial momentum balances as well as reduced systematic errors in the general circulation. As an analogous diagnostic, van Niekerk et al. (2016) used an angular momentum budget analysis to examine a sensitivity of the resolved and parametrized surface drag to changes in horizontal resolution and parametrization. Their approach was to use the nudging framework to constrain the contribution from the angular momentum flux convergence (AMFC), and to determine the contributions from the resolved mountain torque and parametrized surface torque by balancing the AMFC. They found that a parametrized orographic torque in their model was excessive at lower resolutions.

The present study is aimed at understanding mechanisms of forecast error growth in global Met Office Unified Model (hereafter MetUM) at NWP timescales, and finding possible sources of the model errors. We focus on the Northern Hemisphere (NH) mid-latitude zonal wind errors from the upper troposphere to the stratosphere in boreal winter. Many studies have been dedicated to understanding mechanisms of the model systematic errors in the different version of the MetUM, particularly for a temperature bias around the tropopause (Hardiman et al., 2015; Bland et al., 2021). Hardiman et al. (2015) have executed several sensitivity experiments to examine the tropical tropopause warm bias that indicate that microphysical and radiative processes influence the temperature. Bland et al. (2021) have investigated a connection between a cold bias and a moist bias in the extratropical lowermost stratosphere through long-wave radiative effects. The approach adopted in this study is to analyse the atmospheric zonal-mean zonal momentum and thermal budgets in initialized NWP hindcasts and various nudging experiments. We use the globally nudged simulation to provide a best possible estimate of the truth to validate individual components of the momentum and thermal budgets in the MetUM. The budgets are examined to identify which component has a dominant contribution to initial and subsequent error growth through their comparison between the non-nudged control experiment and the globally nudged experiment. In addition, we apply the budget analysis to the regional nudging experiments to examine the impact of regional forcing on the general circulation patterns as well as the specific model errors. A combined application of the momentum and thermal budget analysis enables us to obtain information on wind circulation and temperature interactions and possible error compensations.

This paper is organized as follows. Section 2 describes the model and nudging framework used in this study, experiments carried out, and the details of the budget analysis. Section 3 presents a general view of the model systematic errors and highlights the NH mid-latitude zonal wind errors in boreal winter. Section 4 shows zonal-mean zonal momentum and thermal budgets in control and global nudging experiments and demonstrates a correspondence between budget components and model tendencies. In Sect. 5, we address the NH mid-latitude lower-stratospheric zonal wind errors and investigate sources of the errors using the budget analysis. Section 5 also examines remote and indirect impacts of regional nudging and its momentum budget. Finally, Sect. 6 summarizes the main conclusions and discusses further potential applications of the diagnostics and possible model error sources.

2 Methodology

In this section, we describe details of the global MetUM used in this study, the nudging framework in the MetUM, the experimental design, and the zonal-mean zonal momentum and thermal budget analysis.

2.1 Model description

The MetUM is a numerical model which has been developed for use in regional and global simulations across weather to climate timescales (Cullen, 1993; Senior et al., 2011; Brown et al., 2012). Its scientific configuration used in this study is a global atmosphere and land configuration with a version of Global Atmosphere Land 9 (hereafter GAL9, in prep.; containing incremental upgrades from the former versions e.g., Walters et al., 2019), which is uncoupled with ocean and sea-ice dynamical model. The dynamical core, ENDGame, adopts a semi-implicit semi-Lagrangian formulation to solve the non-hydrostatic, fully compressible deep-atmosphere equations of motion (Wood et al., 2014). The atmospheric prognostic variables are zonal, meridional, and vertical wind velocity, dry virtual potential temperature, Exner pressure, dry density, and moist prognostic variables such as mixing ratio of moisture variables (water vapour, cloud water, and cloud ice) and cloud prognostic fields. Physical processes which are not represented or not resolved in the dynamical core, such as friction, condensation and evaporation, radiative heating and cooling, and too small-scale phenomena to be resolved at the grid scale, are accounted for by parametrizations. Parametrizations employed by the MetUM include shortwave and longwave radiation (Edwards and Slingo, 1996), microphysics (Wilson and Ballard, 1999), gravity wave drag consisting of non-orographic gravity wave (Scaife et al., 2002) and sub-grid scale orographic drag (Appendix in Vosper, 2015 for details), convection (Gregory and Rowntree, 1990), turbulent mixing represented by the boundary-layer scheme (Lock et al., 2000), and large-scale cloud (Wilson et al., 2008 a, b). Processes at the land surface and in the subsurface soil are represented by a community land surface model, the Joint UK Land Environment Simulator (JULES; Best et al., 2011). Contributions of the model dynamics and the individual physics parametrizations to the time evolution of prognostic variables can be diagnosed using increments per model time step, or tendencies that are equivalent to the increments per unit time.

Different applications of the MetUM across a wide range of temporal and spatial scales employ essentially the same model configuration, such as dynamical core and physics parametrizations. The MetUM used in this study has a horizontal resolution of N320 grid (0.5625° longitude ×0.375° latitude; approximately 40 km in the midlatitudes) with 70 vertical levels extending to 80 km altitude. The forecast model time step is 12 min. The horizontal resolution is lower than that of the Met Office's operational global deterministic NWP model (N1280 grid; approximately 10 km horizontal resolution in the midlatitudes). Under the across-scale approach, we adopt a moderate horizontal resolution to investigate large-scale model systematic errors, which provides benefits with regard to computational resources. However, since representations of the dynamical core and the physics parametrizations may have a sensitivity to model horizontal resolutions, extending our analysis to different resolutions of the MetUM may also be required.

2.2 Nudging

The nudging technique with Newtonian relaxation is a method that relaxes predicted variables of GCMs back towards given meteorological fields by adding an unphysical relaxation term to the prognostic equations (Jeuken et al., 1996). The nudging process incorporated into the MetUM at the very end of each model time step is written as follows (Telford et al., 2008; van Niekerk et al., 2016):

\begin{matrix} (1) & X_{F} = X_{M} + \frac{δ t}{τ} (X_{A} - X_{M}) \end{matrix}

where X is the prognostic model variable at the current model time step, δt is the time interval of the model integration (i.e., 12 min in this study), and τ is the relaxation timescale of nudging. Subscripts F, M, and A denote the variables after nudging, those after dynamics and physics calculations just before nudging, and those used as a nudging forcing, respectively. In this study, the 6 h operational MetUM analysis at N1280 grid is regridded onto the model resolution (i.e., N320 grid) and then used as a forcing. Since the analysis data are available every 6 h, they are linearly interpolated into each model time step. The choice of the relaxation timescale is arbitrary. The shorter the relaxation timescale is, the more strongly the model prognostic variables are relaxed towards forcing data (i.e. MetUM analysis data). Too long relaxation timescale is ineffective, and too short relaxation timescale could make the model unstable (Telford et al., 2008). We select relaxation timescale used in this study to be 6 h which corresponds to the temporal spacing of the MetUM analysis. Notice that nudging is applied throughout model integration regardless of the relaxation timescale.

Additional increments due to nudging, which are expressed by the second term of the right-hand side of Eq. (1), are calculated as a difference of the variables between forcing data X_A and the constrained model predictions X_M within the nudged run multiplied by the relaxation coefficient. Therefore, the nudging increments are comparable to the forecast errors at least those growing over the timescale of the relaxation. Since the nudging forcing alters the time evolution of nudged prognostic variables alongside the model dynamics and physics, the nudging tendencies are equivalent to the increments per unit time and are comparable with the model tendencies due to the dynamics and physics.

The prognostic model variables X which can be constrained by nudging are zonal wind, meridional wind, and potential temperature. Other prognostic variables, such as vertical wind velocity and mixing ratio of moisture variables, are allowed to evolve freely and respond to the forced variables. Nudged domains and model levels can be prescribed arbitrarily and are accompanied by interfacing transition zones and layers to ensure a smooth transition between the nudged and free-running parts of the simulation. The transition zones and layers are smoothed using the hyperbolic tangent function over 10° in the horizontal and the linear function over two model levels in the vertical.

2.3 Experimental design

To examine model error growth in boreal winter at NWP timescale, deterministic 15 d forecasts are started at 00:00 UTC between 16 November 2018 and 27 February 2019 and evaluated over the period from December 2018 to February 2019 (90 cases in total). A series of the operational global MetUM analyses produced by the data assimilation system, Hybrid-4DVar (Clayton et al., 2013) based on GA6.1 configuration (Walters et al., 2017) operational in 2018/2019, is spatially interpolated to the model resolution and used as the initial conditions. Lower boundary conditions are given by the sea surface temperature and sea ice concentration of the Operational Sea Surface Temperature and Ice Analysis (OSTIA; Donlon et al., 2012) products fixed at the field on initial dates throughout the 15 d simulations.

The momentum and thermal budgets described below in Sect. 2.4 and model tendencies due to dynamics and individual parametrizations (and nudging forcing) are calculated using the experimental data. The budgets are analysed at the same horizontal resolution as the model on 28 pressure levels (i.e., 1000, 950, 925, 850, 700, 600, 500, 400, 300, 250, 200, 150, 100, 70, 50, 30, 20, 10, 7, 5, 3, 2, 1, 0.7, 0.5, 0.3, 0.2, 0.1 hPa) with masking of the levels below the ground. On the other hand, model tendencies are evaluated at the 70 model native levels. Temporal-mean variables required in the analysis of the budgets and the model tendencies are calculated from the fields at each time step in the model.

Table 1Control and nudging experiments executed. U, V, and Θ in nudged variables indicate zonal wind velocity, meridional wind velocity, and potential temperature, respectively. N/A indicates that nudging is not applied in CNTL experiment.

Download Print Version | Download XLSX

Control and various nudging experiments executed in this study are summarized in Table 1. We apply momentum and thermal budget analysis (see details in Sect. 2.4) to the control experiment referred to as CNTL and the global nudging experiment referred to as GLN. We consider individual budget components of GLN as the best possible estimate of truth instead of analysis data which is unable to provide the temporal-mean variables required in this study. This is why the reasonably short relaxation timescale is selected in the nudging experiments. Nudging frameworks which constrain a part of the variables could bring a better understanding of feedbacks among variables and disentangle compensating errors specifically between horizontal wind and temperature (Wehrli et al., 2018). In addition, regional nudging, in which the model variables are relaxed back towards the analyses over subdomains where there might be significant systematic errors, potentially provide experimental insights into origins of the errors. We performed another global nudging experiment in which only temperature is constrained over the globe, referred to as GLNT, and nudging sensitivity experiments in which temperature is constrained over different domains at various altitudes (i.e., NHTrpT, NHTrpTrpT, NHHLT, and NHHLTrpT; see Table 1 in detail) to test how much wind biases in CNTL could be influenced by temperature biases and diagnose the remote impact of nudging. Note that the tropopause temperature nudging in NHTrpTrpT and NHHLTrpT is applied in a different range of pressure levels depending on their latitudinal positions.

2.4 Atmospheric zonal momentum and thermal budgets

The framework of the atmospheric zonal-mean momentum and thermal budgets is a well-established diagnostic tool for examining the contribution of resolved and unresolved processes to the large-scale structure of the atmosphere. Based on the primitive equations, the zonal-mean zonal momentum and thermal budget equations in spherical and pressure coordinates are derived as below (Hartmann, 1976; Andrews et al., 1987):

\begin{matrix} (2) & \begin{aligned} \overline{\frac{\partial [u]}{\partial t}} = & \underset{Mean Stationary Flow Advection}{\underset{︸}{- (\frac{[\overline{v}]}{a \cos ϕ} \frac{\partial [\overline{u}] \cos ϕ}{\partial ϕ} + [\overline{ω}] \frac{\partial [\overline{u}]}{\partial p})}} \\ \underset{Stationary Eddy Component}{\underset{︸}{- (\frac{1}{a \cos^{2} ϕ} \frac{\partial [{\overline{u}}^{*} {\overline{v}}^{*}] \cos^{2} ϕ}{\partial ϕ} + \frac{\partial [{\overline{u}}^{*} {\overline{ω}}^{*}]}{\partial p})}} \\ \underset{Transient Eddy Component}{\underset{︸}{- (\frac{1}{a \cos^{2} ϕ} \frac{\partial [\overline{u^{'} v^{'}}] \cos^{2} ϕ}{\partial ϕ} + \frac{\partial [\overline{u^{'} ω^{'}}]}{\partial p})}} \\ \underset{Coriolis}{\underset{︸}{+ f [\overline{v}]}} \underset{Residual}{\underset{︸}{+ [\overline{F_{u}}]}} \end{aligned} \\ (3) & \begin{aligned} \overline{\frac{\partial [T]}{\partial t}} = & \underset{Mean Stationary Flow Component}{\underset{︸}{- (\frac{[\overline{v}]}{a} \frac{\partial [\overline{T}]}{\partial ϕ} + [\overline{ω}] \frac{\partial [\overline{T}]}{\partial p} - \frac{R_{d}}{p c_{p}} [\overline{ω}] [\overline{T}])}} \\ \underset{Stationary Eddy Component}{\underset{︸}{- (\frac{1}{a \cos ϕ} \frac{\partial [{\overline{v}}^{*} {\overline{T}}^{*}] \cos ϕ}{\partial ϕ} + \frac{\partial [{\overline{ω}}^{*} {\overline{T}}^{*}]}{\partial p} - \frac{R_{d}}{p c_{p}} [{\overline{ω}}^{*} {\overline{T}}^{*}])}} \\ \underset{Transient Eddy Component}{\underset{︸}{- (\frac{1}{a \cos ϕ} \frac{\partial [\overline{v^{'} T^{'}}] \cos ϕ}{\partial ϕ} + \frac{\partial [\overline{ω^{'} T^{'}}]}{\partial p} - \frac{R_{d}}{p c_{p}} [\overline{ω^{'} T^{'}}])}} \\ \underset{Residual}{\underset{︸}{+ \frac{[\overline{Q}]}{c_{p}}}} \end{aligned} \end{matrix}

where u and v are zonal and meridional components of wind velocity, ω is vertical pressure velocity, a is the mean radius of Earth, ϕ is latitude, p is pressure, f is the Coriolis parameter, T is temperature, R_d is the gas constant, c_p is the specific heat of air at constant pressure, F_u is a mechanical forcing term of the zonal momentum equation, and Q is a diabatic heating rate. Overbars and primes denote the temporal mean and departure from the temporal mean, respectively, and square brackets and asterisks denote the zonal mean and departure from the zonal mean, respectively. The temporally and zonally averaged flow (e.g., $[\overline{u}]$ ) is referred to as mean stationary flow, and the departure from the temporal mean (e.g., $u^{'} = u - \overline{u}$ ) is referred to as transient eddy, and the departure of the temporally averaged variable from its zonal mean (e.g., ${\overline{u}}^{*} = \overline{u} - [\overline{u}]$ ) is referred to as stationary eddy.

The right-hand side of Eq. (2), in order, represents an advection of mean zonal wind by mean stationary flows, a convergence of stationary eddy momentum fluxes, that of transient eddy momentum fluxes, Coriolis forcing, and a residual term. The transient and stationary eddy fluxes are evaluated, for instance, as $\overline{u^{'} v^{'}} = \overline{u v} - \overline{u} \overline{v}$ and $[{\overline{u}}^{*} {\overline{v}}^{*}] = [\overline{u} \overline{v}] - [\overline{u}] [\overline{v}]$ , respectively. In Eq. (3), the right-hand side, from left to right, shows a warming/cooling due to advection of mean temperature by mean stationary flow and adiabatic heating with mean vertical motions, a convergence and an energy conversion of stationary eddy heat flux, those of transient eddy heat flux, and a residual term. The terms other than the residual term are interpreted as contributions from resolved dynamical components and are evaluated directly from the forecast data specifically u, v, ω, and T fields. As in the manner in Martineau et al. (2016), a second-order centered difference scheme is used for calculating meridional and vertical derivatives, while meridional derivatives over the poles and vertical derivatives at the top or bottom of the pressure levels require a first-order difference with their adjacent grids.

The last term of right-hand side of Eqs. (2), (3), determined as a residual of the other terms including the left-hand side and referred to as residual term, represents contributions of the processes which are not represented by the resolved processes. In the partial differential equations of Eqs. (2), (3), the residual term corresponds to a source or sink of the momentum and thermodynamic equations which consists of the frictional forcing and the diabatic heating due to radiative forcing, latent heat releases, and latent and sensible heat fluxes at the surface. On the other hand, the residual term diagnosed using spatially discretized data could include effects of subgrid processes, which are too-small scale to be resolved in a grid scale, on a grid-box mean field as well as the frictional or diabatic forcing. Ideally, the residual term should be qualitatively equal to the total parametrized forcing calculated in the model (Holopainen, 1987), although this is not necessarily the case as described in Sect. 2.5. It is noteworthy that “residual” in this study, which is equivalent to estimated unresolved tendencies, is defined differently from “residual” termed in some previous studies (e.g., Klinker and Sardeshmukh, 1992; Milton and Wilson, 1996), which is the difference between estimated unresolved tendencies in the analysis and parametrized physics tendencies.

There are several options of how to define and compute temporal-mean fields denoted by overbars in Eqs. (2), (3), and it changes what the temporal derivative term on the left-hand side means. In this study, overbars represent the temporal averaging over the given forecast length in each case, not over the 3 months of the experimental period. Thus, a temporal-mean field over the forecast length (e.g., $\overline{u}$ ) varies among experiments and also among cases. The temporal average over the forecast length is computed using the fields at each model time step, which is the reason why globally nudged simulation data instead of analysis data is used to calculate individual budget components for validations because the analysis data has only 6 h instantaneous fields. This definition of temporal mean allows the budget equation to describe a time evolution over a particular forecast length in a single case, which makes it straightforward to diagnose forecast error growth at NWP timescale. The left-hand side of the Eqs. (2), (3) indicating a total tendency of the variable is evaluated as a temporal change from initial conditions to model states at a given forecast lead time.

The momentum and thermal budget analysis is applied to each simulated case individually, and then averaged over the cases if mean budgets are examined. In this context, the terms “mean flow” and “transient eddy” in this paper should be interpreted cautiously because the methodology is different from that commonly used in climatological studies (e.g., Peixoto and Oort, 1992). The partition of the nonlinear advective term among three components including mean stationary flow, stationary eddies, and transient eddies becomes slightly more complex and critically depends on the forecast lead time in this study. For example, in the momentum budget from an initial condition to a forecast at Day 1, if a migratory cyclone or anticyclone is simulated to be quasi-stationary throughout that time period, it is identified as the stationary eddy rather than the transient eddy. Therefore, the contribution of the transient eddy flux becomes smaller at a shorter forecast lead time, and the mean stationary flow and the stationary eddy components are responsible for the remaining part. Although the Coriolis forcing term involves the mean stationary flow and constitutes a part of the resolved processes, it can be independently evaluated regardless of the forecast length because of its linearity. The partition should depend not only on the definition of the temporal mean but also on the horizontal and vertical resolution of the model (and data) and the configuration of the model dynamics and physics.

2.5 Atmospheric budgets and model tendencies

Whilst the model dynamics and physics tendencies themselves, calculated in the model online, are not used for the offline budget calculation in our framework, the diagnosed resolved processes and the residual term correspond to the model dynamics and physics tendencies, respectively. A comparison of two diagnostics, atmospheric budgets and model tendencies, is helpful in understanding their physical meaning. The budget analysis diagnosed using forecast data on pressure levels quantifies contributions to the total tendencies from the resolved processes and the unresolved processes. On the other hand, model tendencies of prognostic variables are calculated by dynamical core and physics parametrizations in numerical models at each model time step during their time integration at each model level. Individual components of both atmospheric budgets and model tendencies represent contributions of the processes in the respective framework to the total tendency of the corresponding variable. Therefore, the atmospheric budget diagnostics can be compared with the model tendencies to demonstrate a qualitative similarity between the unresolved term in the budget equation and the model tendencies of the physics parametrizations and between the resolved term and the tendencies of the model dynamics.

The correspondence between the budget diagnostics and the model tendencies is expected from the derivation of the budget equation, but they are not necessarily exactly equivalent. The residual term could contain effects from imbalances caused by nudging increments (in the case of nudging experiments) and computational errors through the interpolation of forecast fields from model levels to pressure levels and the numerical methods employed to evaluate the budgets (Martineau et al., 2016) as well as the unresolved processes. A discrepancy between the primitive equations used in the budget diagnostics and the governing equations of the MetUM could also generate a difference between the resolved processes and the model dynamics tendencies and therefore between the residual term and the parametrization tendencies to some extent. In the case of nudging experiments, forcing due to nudging, which is associated with the very short timescale model error in the MetUM, should also constitute a part of the residual term.

2.6 Validation of atmospheric budgets

To validate the budget components in CNTL and identify which components contribute to a particular forecast error growth, we use the nudging technique to create quasi-analysis data and calculate the best possible estimate of truth of atmospheric budgets. An error of a given forecast variable against analysis in a single case, defined as its difference between forecast and analysis, is proportional to an error of total tendency as follows:

\begin{matrix} (4) & \begin{aligned} x_{e} (t_{i}; Δ t) & \equiv x_{f} (t_{i}; Δ t) - x_{a} (t_{i} + Δ t) \\ = \{\overline{{(\frac{\partial x}{\partial t})}_{f, t_{i}}} - \overline{{(\frac{\partial x}{\partial t})}_{a, t_{i}}}\} Δ t \end{aligned} \end{matrix}

where x is a given variable, t_i is an initialized time in a given single case, and Δt is a forecast length. The subscripts e, f, and a denote error, forecast, and analysis, respectively. The overbars are the same as defined in Eqs. (2), (3). In this study, the total tendency of the analysis averaged over a given forecast length is approximated by that of GLN, and we refer to a difference in budget components between CNTL and GLN as “quasi-error”. As a result, the validation of the momentum and thermal budgets can decompose a quasi-error in the total tendencies into individual budget components.

The atmospheric budgets of GLN should be interpreted with caution because they have an error against the analysis. Firstly, nudged variables in nudging experiments still have errors against the MetUM analysis. The nudging in some cases is not able to relax sufficiently towards the forcing data with a moderate relaxation timescale especially if the error grows rapidly at a very short timescale. In addition, the vertical velocity, required by the budget analysis but not directly nudged, is driven by the horizontal wind and temperature nudging forcing. Therefore, there should be a non-negligible error in the individual budget components in GLN, making it difficult to precisely evaluate the residual term as a consequence. Secondly, since the 6 h MetUM analyses used as a forcing data of the nudging are linearly interpolated into each model time step as part of the nudged simulation methodology, the temporal interpolation leads to an underestimation of the intra-6-hour variability, and therefore results in underestimated transient eddy components and consequently overestimated residual terms. Although these deficiencies might affect a quasi-error in budget components of CNTL against GLN to some extent, it is reasonable to use GLN as the best estimate of truth.

The interpretation of differences in the residual term among experiments is slightly complex. It is attributed, by definition, to differences in the other resolved terms because the residual term in each experiment is determined by the sum of the other terms to close the budget. From another perspective, the residual term is expected to correspond to the sum of the physics parametrization tendencies and the nudging forcing. The quasi-error in the residual term against GLN could provide some implications for deficiencies in the parametrizations through a comparison between the residual term and the individual parametrization tendencies.

Unlike the budget diagnostics, a difference in individual model tendencies between CNTL and GLN does not show a quasi-error in the model tendencies. The physics parametrization schemes themselves used in CNTL and GLN are identical and have substantial deficiencies in principle. The parametrization tendencies in GLN are calculated using the deficient scheme with nudged grid-box mean fields which are much closer to the analysis than CNTL. The difference in parametrization tendencies between two experiments represents a difference in the responses of the parametrization scheme to different grid-box mean fields.

https://wcd.copernicus.org/articles/6/1539/2025/wcd-6-1539-2025-f01

Figure 1Zonal-mean errors of the CNTL forecast at a forecast lead time of (a, b) Day 1, (c, d) Day 5, and (e, f) Day 15 averaged over the 90 cases from 1 December 2018 to 28 February 2019 in (a, c, e) zonal wind velocity [m s⁻¹], and (b, d, f) temperature [K]. Contour indicates the forecasts, and colour indicates mean errors against the MetUM analysis. The black horizontal lines A, B, and C are inserted for later reference (Figs. 2, 12, 13, and 14; see the text in detail).

Process-based diagnostics using atmospheric budget analysis and nudging technique to identify sources of model systematic errors

2.1 Model description

2.2 Nudging

2.3 Experimental design

2.4 Atmospheric zonal momentum and thermal budgets

2.5 Atmospheric budgets and model tendencies

2.6 Validation of atmospheric budgets

4.1 Zonal momentum budgets

4.2 Thermal budgets

5.1 Zonal momentum and thermal budgets

5.2 Nudging sensitivity experiments