the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Considerable yet contrasting regional imprint of circulation change on summer temperature trends across the Northern hemisphere mid-latitudes
Peter Pfleiderer
Anna Merrifield
István Dunkl
Homer Durand
Enora Cariou
Julien Cattiaux
Gustau Camps-Valls
Sebastian Sippel
Rising summer temperatures and more frequent heat extremes are well-documented outcomes of anthropogenic climate change. However, the extent to which atmospheric circulation changes contribute to these trends remains contested. Regional differences across the northern mid-latitudes suggest that circulation plays a role, yet robustly quantifying its contribution over multiple decades is very challenging. We address this by systematically testing statistical and machine learning methods that decompose temperature signals into a thermodynamic and a dynamic contribution against climate model simulations. Specifically, we use unforced simulations with circulation nudged to match a forced simulation that includes anthropogenic emissions and land-use change. We apply decomposition methods to the forced simulations and compare their estimates of circulation-induced trends with those found in nudged circulation simulations. Our analysis reveals that most methods accurately identify the sign of circulation-induced changes in temperature, although they consistently underestimate their magnitude. Despite this limitation, the results demonstrate that circulation changes have made a substantial contribution to summer temperature trends across the northern mid-latitudes. In Europe, a hotspot region, we estimate that up to half of the observed summer warming between 1979 and 2023 can be attributed to circulation trends. Furthermore, circulation trends have contributed to warmer summer temperatures over Western North America, Central Siberia, Mongolia, Central China, and northeastern Canada. Yet, circulation changes have cooled summer temperatures over Eastern and Central North America, Eastern China, and Central Asia. Overall, our results, based on multiple methods, confirm a circumglobal mid-latitude pattern of considerable, yet contrasting, contributions of circulation changes to summer temperature trends.
- Article
(10056 KB) - Full-text XML
- BibTeX
- EndNote
European summers have warmed considerably due to anthropogenic climate change, driving major ecological changes and societal impacts. This intensification of warm seasons is attributed directly to human activity (Seneviratne et al., 2021). The main reason for this warming is of a thermodynamic nature: warm seasons and heat extremes occurring in a warmer atmosphere are also warmer. However, also various other effects can contribute to regional trends in summer heat, including changes in atmospheric circulation (Teng et al., 2022; Rousi et al., 2022; Vautard et al., 2023; Singh et al., 2023), or feedbacks due to land–atmosphere interactions (Seneviratne et al., 2006). As a result, regional trends in summer temperatures differ strongly across the world and even across the northern hemispheric mid-latitude land area.
In the mid-latitudes, large-scale circulation is a crucial driver for heat extremes (Rousi et al., 2022; Röthlisberger and Papritz, 2023) and there is great interest in understanding to what extent atmospheric circulation contributed to individual events (Cattiaux et al., 2010; Sippel et al., 2024; Zeder and Fischer, 2023), trends in heat extremes (Rousi et al., 2022; Singh et al., 2023) or seasonal temperatures (Teng et al., 2022). Forced changes in jet stream position and strength (Dong et al., 2022; Rousi et al., 2022; Woollings et al., 2023; Shaw and Miyawaki, 2024), changes in storm track intensity (Coumou et al., 2015; Chemke and Coumou, 2024) and the resulting changes in weather pattern frequencies (Horton et al., 2015; Hanna et al., 2018; Fabiano et al., 2021) are likely to affect local climate conditions (Pfleiderer et al., 2019). Over the observational record, these forced changes, however, are small compared to internal climate variability (Eyring et al., 2021). Estimating the contribution of atmospheric circulation changes to local temperature trends and quantifying the extent to which these changes are due to forced or internal variability is crucial for our understanding of past summer temperature trends (Merrifield et al., 2017; Teng et al., 2022; Vautard et al., 2023) and extreme events (Terray, 2021).
In some regions, the observed trends are falling outside the range of model-simulated expected trends (e.g., Western Europe, Teng et al., 2022; Rousi et al., 2022; Vautard et al., 2023; Kornhuber et al., 2024). Potentially, this indicates that model-simulated low-frequency variability in large-scale atmospheric circulation is too weak, or that a forced change in circulation is missing in the models, notwithstanding the broad uncertainty across models (Shepherd, 2014). The missing low-frequency hypothesis is also challenging to assess because the available observations are relatively short. Understanding past circulation changes and their contribution to temperature trends may provide an opportunity to constrain future changes in summer temperature trends, acknowledging that future circulation changes remain a huge source of uncertainty (Topál and Ding, 2023; Fereday et al., 2018).
Identifying the contribution of atmospheric circulation to temperature trends is not straightforward (Deser et al., 2016). Two main approaches are commonly used. The first applies statistical or machine learning methods to decompose observed or simulated temperature trends into thermodynamic and circulation-driven components, often referred to as dynamical adjustment (Deser et al., 2016; Smoliak et al., 2015; Sippel et al., 2019). The second uses nudged circulation simulations, in which the circulation is prescribed and the thermodynamic component is removed (Wehrli et al., 2018). Both options have their limitations: Nudged circulation simulations are limited by the representation of physical mechanisms in the climate model used and might be subject to inconsistencies introduced by the nudging (discussed in Sect. 4). On the other hand, most statistical decomposition methods are designed to capture the relationship between daily circulation patterns and daily temperatures. They do indeed capture day-to-day variability very well. Good skill is obtained on monthly or inter-annual time scales as well (Smoliak et al., 2015; Sippel et al., 2019; Cariou et al., 2025). Whether they can adequately capture a long-term trend is, however, more challenging to test, because processes determining long-term trends may be distinctly different from those that determine short-term circulation variability, and much fewer verification samples are available. Moreover, benchmarks for circulation-induced long-term trends have not been available to date, and to our knowledge, no systematic comparison of dynamical adjustment methods has been conducted.
In this study, we present a comprehensive assessment of circulation-induced summer temperature trends across the northern mid-latitudes (30–60° N) using both statistical decomposition methods and nudged circulation simulations. We focus on two key questions. First, can statistical-empirical methods reliably estimate circulation-driven long-term trends when tested against a climate model benchmark?
To address this, we use a set of CESM2 nudged circulation simulations specifically designed to provide a reference for comparing circulation-induced trends with those derived from statistical methods. Second, we identify circulation-induced summer temperature trends across the northern hemispheric mid-latitudes in observations using four different statistical methods, as well as in CESM2 simulations that are nudged to the ERA5 circulation but driven without anthropogenic forcing.
We use simulations of the fully coupled Community Earth System Climate Model, Version 2 (CESM2) (Danabasoglu et al., 2020a), including simulations from the CESM2 large ensemble (Rodgers et al., 2021). Decomposition methods are first tested on nudged circulation CESM2 simulations. In Sect. 3.2 we then apply decomposition methods to the European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5 (ERA5) (Hersbach et al., 2020) for the period 1979–2023.
2.1 CESM2 nudged circulation simulations driven with CESM2 horizontal winds
To derive a benchmark for evaluating the decomposition methods, we use nudged circulation simulations conducted with the fully coupled CESM2 (Danabasoglu et al., 2020a). First, three standard simulations following historical greenhouse gas emissions, aerosol emissions, and land use changes from 1850 to 2014, and anthropogenic forcings following the scenario SSP3-70 (O'Neill et al., 2016) from then onward (referred to as “hist+SSP370”) are created. These simulations follow the protocol of the CESM2 large ensemble (Rodgers et al., 2021). The initial conditions for these runs are taken from a long piControl (e.g., no anthropogenic greenhouse gas and aerosol emissions, as well as no land use change) simulation and they are separated by 100 years (namely year 1300, 1400 and 1500) to guarantee independent ocean states.
In a second step, for each of these simulations, a nudged circulation simulation is created for piControl forcing. Each of these simulations starts with the same initial conditions as its corresponding hist+ssp370 simulation. 6 hourly global meridional and horizontal winds are nudged globally and throughout the atmosphere (at all vertical levels) to their corresponding hist+ssp370 simulation. The nudging is achieved via a regular relaxation procedure described in the handbook of the Community Atmosphere Model version 6 (CAM6) (camdoc, 2025). The nudging procedure has been used in previous studies (Topál and Ding, 2023). These simulations are referred to as “piControl-nudged”, as they lack direct anthropogenic forcing. Still, through the nudging of atmospheric circulation, any potential anthropogenic forcing on horizontal wind fields is present alongside the internal circulation variability of the atmosphere from the hist+ssp370 simulation. Because land–atmosphere coupling influences near-surface circulation differently under varying climate forcings, it is recommended to apply nudging only at higher altitudes, allowing surface winds to evolve freely when extreme events are to be studied (Wehrli et al., 2018; Merrifield et al., 2019). Although these effects may influence individual events, the spatial patterns of long-term circulation trends remain robust regardless of whether near-surface winds are nudged (Singh et al., 2025).
2.1.1 Conceptual interpretation of CESM2 nudged circulation simulations as a benchmark for dynamical adjustment
In a free-running, fully coupled climate simulation, local and regional temperatures are shaped by both thermodynamic forcing, here represented by global mean surface temperature (GMST) as a proxy for the large-scale thermodynamic background conditions, and atmospheric circulation variability, as well as their interactions (e.g., Deser et al., 2016, Fig. 1). Statistical dynamical adjustment methods seek to separate these influences, but their skill is challenging to evaluate within a coupled system. To address this, we use “piControl-nudged” simulations as benchmarks in which circulation varies while thermodynamic forcing is fixed at pre-industrial levels. This design allows us to isolate and evaluate the circulation contribution independently of thermodynamic trends. Conceptually (Fig. 1), circulation variability is “inherited” from the parent simulation and is expected to dominate local and regional temperature responses. This inherited variability includes both internal fluctuations and possible forced circulation changes. Atmospheric circulation can affect GMST (arrow a in Fig. 1). Additionally, some internal variability unrelated to atmospheric circulation may remain, such as ocean-driven GMST fluctuations (arrow b in Fig. 1). Note that the decomposition into “thermodynamic” and “circulation-induced” changes is a simplification that overlooks important mechanisms for local temperatures. In Sect. 4, we discuss the implications of this simplification in more detail.
2.1.2 Illustration of CESM2 nudged circulation simulations
As expected, day-to-day variability in the nudged circulation run is closely related to its freely running counterpart from which the wind fields originated. As shown in Fig. A1, in the early period (left column of Fig. A1), geopotential height at 500 hPa and surface air temperature are nearly identical in the hist+ssp370 and the piControl-nudged run. In a warmer climate, day-to-day variability remains highly correlated, but geopotential height and surface air temperatures are uniformly shifted to higher values.
Interpreting the GMST signal in the piControl-nudged runs is not straightforward. Even without external forcing, small GMST trends emerge over 40 year periods (Fig. 2b and c). These trends likely reflect the combined influence of atmospheric circulation and internal ocean variability (see Fig. 1).
Figure 2(a) Global mean surface temperature in hist+ssp370 runs (orange) and in piControl-nudged runs (green). Thick lines are smoothed with a 50 year rolling mean. (b) GMT trend over the period 1979–2023 (gray bar in a). (c) GMT trend over the period 2025–2075 (blue bar in a). The cooler histogram displays 500 trends of the same length from piControl, while the warmer histogram displays 100 trends from the CESM2 large ensemble.
A comparison of 1979–2023 GMST trends in the piControl-nudged simulations with their corresponding freely running hist-ssp370 simulations indicates that both thermodynamic and dynamic contributions are relevant (Fig. 2b). Run 1 is the simulation with the highest GMST trend in the freely running configuration (0.25 K per decade) and near zero trend in the piControl-nudged configuration. Runs 2 and 3 have weaker GMST trends than Run 1 in the freely running hist-ssp370 scenario, and both show a cooling trend in the piControl-nudged scenario. The differences between freely running hist-ssp370 simulations and their corresponding piControl-nudged simulations are not constant, indicating that there is indeed an influence of low frequency internal variability that is not controlled by atmospheric circulation. Therefore, we do not expect the circulation component estimated from the freely running forced simulation to match the trend found in the piControl-nudged simulation exactly. In some decomposition methods (ridge regression and DEA), we can account for the effect of these GMST trends in the piControl-nudged simulations. In general, we assume that the impact of GMST on local temperatures is relatively homogeneous around the mid-latitudes of the northern hemisphere. Thus we expect that the spatial pattern relative to the mid-latitudinal mean is well captured.
2.2 CESM2 nudged circulation simulations driven with ERA5 winds
To evaluate the use of the decomposition methods on observed circulation patterns, we created an additional benchmark simulation by nudging CESM2 to the horizontal wind fields from the reanalysis data ERA5 (Hersbach et al., 2020). This method relies on the same relaxation procedure and anthropogenic greenhouse gas and aerosol forcing as described before, with some modifications to the setup. Here, not only the piControl, but also the “hist+SSP370” simulations are nudged to ERA5 horizontal wind fields between 1940 and 2024. There are two further differences in the simulation setup as opposed to the already described nudged circulation simulations. First, to account for forcing-induced variability at the boundary layer, the atmosphere is only nudged above 700 hPa, similar to Wehrli et al. (2018). Second, the model is run in an atmosphere only configuration (AMIP) with prescribed ocean conditions from the Met Office Hadley Centre's sea ice and sea surface temperature dataset (HadISST; Rayner et al., 2003). For the piControl simulation, the forced component was removed from HadISST using low-frequency pattern filtering (Wills et al., 2020). Sea ice concentration (SIC) was then estimated with a random forest model trained on the SST–SIC relationship in HadISST. To capture seasonal hysteresis, separate models were trained for the freezing season (September–January) and the melting season (February–August), and applied to piControl SSTs to generate corresponding SIC values. This approach reproduces daily temperature anomalies in the northern hemisphere well. However, nudging to observed winds and SSTs also introduces a temperature bias by displacing the model from its own climatology.
2.3 Dynamical adjustment methods
We test four methods designed to disentangle the dynamic effect from the thermodynamic effect: (1) ridge regression, (2) constructed circulation analogues, (3) direct effect analysis (DEA), and a neural network UNET (4).
Note that we apply the methods exactly as they were designed and used in other publications, and therefore different proxies for atmospheric circulation are used by different methods. We do not expect that the choice of variable to represent atmospheric circulation significantly affects the results. In Fig. G1, we show a sensitivity analysis for the ridge regression. In Sect. 4, we discuss differences between the methods and how they might affect the decomposition in more detail.
2.3.1 Ridge regression
Ridge regression is a regularized linear method that can deal with high-dimensional predictors (in our case, many correlated spatial locations with streamfunction values, Φj). For each grid cell, we train a ridge regression model to predict daily mean temperatures in summer (June–July–August, hereafter referred to as JJA), Y, using GMST (yearly averaged) and the streamfunction Φ at 500 hPa in all grid cells G within a 40°×40° rectangular area centered on that grid cell:
where the streamfunction (Φ) is related to horizontal eastward (u) and northward (v) wind speeds as and .
Ridge regression mitigates overfitting by introducing a penalty for model complexity, achieved through the shrinkage of regression coefficients related to the streamfunction covariates. Shrinkage is determined by the sum of squared regression coefficients (known as L2 regularization) and a regularization parameter λ, which controls the degree of shrinkage. The shrinkage term is added to the residual sum of squares (RSS) for the minimization:
As a result, ridge regression solves a joint minimization problem, producing small but nonzero regression coefficients that are relatively evenly distributed among correlated predictors. The regularization parameter λ dictates the extent of shrinkage and is selected via cross-validation or knowledge about the noise variance. Notably, the intercept of the linear model as well as the GMST covariate remain excluded from the shrinkage.
2.3.2 Constructed atmospheric circulation analogues technique
The atmospheric circulation analogue technique, introduced in Deser et al. (2016), is a linear dynamical adjustment method. It is designed to provide empirically derived estimates of climate trends induced by “dynamics” or atmospheric circulation patterns. It achieves this through the re-construction of monthly mean climate fields by linear regression with coefficients derived from a field representative of atmospheric circulation (here, sea-level pressure). The method has been used for a variety of applications, including trend assessments, variability analysis, performance weighting, and extreme event attribution (Deser et al., 2016; Lehner et al., 2017; Merrifield et al., 2017; Guo et al., 2019; Terray, 2021).
The method is based on the construction of a target monthly mean sea level pressure (SLP) field (e.g., January 1980) using analogues. In the January 1980 example, analogues are SLP fields from other Januaries between 1850 and 2014 that resemble the target SLP pattern. The method is applied to every month in the record and proceeds as follows. First, the Euclidean distance between all SLP fields from the period of 1850–2014 and the target SLP field is computed. Euclidean distance is calculated at each grid point and averaged over the Northern hemisphere domain (20–90° N, 0–360° E). The Na=80 closest SLP fields to the target are considered analogues. From the 80 analogue choices, the target month is reconstructed using randomly selected subsets of Ns=50 analogues. The process of choosing 50 out of 80 analogues and reconstructing the target SLP is repeated Nr=100 times to obtain an average best estimate result. Na, Ns, and Nr values are consistent with those used for the method's application to observations in Deser et al. (2016).
The target SLP field Xh is reconstructed through multivariate linear regression. The weight assigned to each SLP analogue, β, is computed through a singular value decomposition of a column vector matrix Xc containing the 50 selected analogues and can also be estimated using a Moore–Penrose pseudoinverse:
where β weights are applied to the corresponding monthly mean temperature fields, i.e., those from the same month as the SLP analogue. The weighted linear combination of these fields defines the dynamic component of temperature for the target month. Before weighting, a quadratic trend is removed from the full temperature record at each grid point to approximate the anthropogenic warming signal (Deser et al., 2016; Lehner et al., 2017). This is done to approximate the unforced relationship between SLP and temperature one would expect to find in a preindustrial control simulation. The dynamic component of temperature is also computed Nr=100 times for each month. Results are averaged, and once every month, the record is dynamically adjusted as described; we obtain a dynamic monthly mean temperature timeseries.
Note that the Moore–Penrose pseudoinverse implicitly deploys a “hard” regularization (i.e., kills directions with singular value exactly zero). In contrast, the previous ridge (Tikhonov) regularization imposes an explicit “soft” regularization (i.e., damps unstable directions even if singular values are just small, not zero).
We use the term “analogue” to refer to a month with an SLP field close to the SLP target. The Euclidean distance selection metric does not require an analogue to match the target month spatially over the whole domain. This step is necessary because, with fewer than 200 available analogues, it is improbable to find a perfect match for any target month. Van Den Dool (1994) estimated that it would take on the order of 1030 years to find two Northern Hemisphere circulation patterns similar within observational uncertainty. As a result, the method relies on imperfect analogues, which can introduce spurious features or bias the amplitude of the estimated dynamic temperature trends.
In this study, analogues are selected from the free-running hist+SSP370 simulations to dynamically adjust (1) each hist+SSP370 simulation and (2) ERA5. Each hist+SSP370 simulation is dynamically adjusted using the “leave-one-out” approach (Deser et al., 2016; Lehner et al., 2017). In the leave-one-out approach, for each month, e.g., June 1900, analogues are selected from all other Junes in the simulation's 1850–2014 period except 1900. The leave-one-out approach is used for the comparison between the hist+SSP370 and piControl-nudged simulations. In the second approach, analogues are selected from the entire 1850–2014 period of each of the hist+SSP370 simulations and used to dynamically adjust ERA5. The resulting three dynamical components of ERA5 are shown in Fig. F1 and are averaged to produce the circulation-induced trend estimates in Fig. 4.
2.3.3 DEA
We employ Direct Effect Analysis (DEA), a recently developed causal representation learning method, to separate the dynamical influence of atmospheric circulation (represented by Z500 EOFs) from the thermodynamical influence of GMST on temperature. The approach aims at disentangling an outcome variable Y into a direct effect component, Ydir, which represents the part of Y directly caused by some causal factor Z, and an orthogonal component, Yorth, which corresponds to the part of Y unaffected by Z. Both Z and Y may be influenced by other variables X, which act as confounders, and it is thus necessary to control for these covariates to get a correct estimate of the direct effect of Z on Y. This learned representation of Y can be seen as the result of encouraging conditional independence between Z and Y while controlling for X (Durand et al., 2025).
In this context, the outcome Y represents the temperature field – Yorth being its dynamical component – which is a random vector with one dimension per grid cell. The predictor Z is the monthly mean GMST, used as a proxy for thermodynamic temperature changes. As covariates X, we include the leading Empirical Orthogonal Functions (EOFs) of atmospheric circulation (Z500), denoted where J is selected through 5-fold cross-validation to maximize the R2 score.
Similar to the ridge regression approach described above, we assume the following linear model:
and get the optimal regression parameter matrix using a least squares algorithm. We obtain a matrix B whose columns bi encode how GMST and each EOF influence temperature across grid cells. We emphasize that this is a multivariate regression problem, where Y is a mutlivariate random vector – not a single variable – representing temperature values across multiple grid cells. Each dimension of Y corresponds to one spatial location.
To isolate the dynamical component Yorth of Y, we remove the part aligned with the GMST-related direction, as captured by b1. This is achieved using the linear transformation , where . This transformation ensures that Yorth remains unaffected by any interventions on GMST, and thus represents the dynamical component of Y.
2.3.4 UNET
The final method used in this paper is a convolutional neural network, a UNET structure, recently proposed by Cariou et al. (2025) to link temperature variations to atmospheric circulation. The UNET architecture was initially introduced by Ronneberger et al. (2015) for biomedical image segmentation. It consists of two main components: an encoder and a decoder. The encoder extracts global features from the input (in this case, circulation maps) by progressively reducing spatial resolution while increasing feature depth through convolution and max-pooling layers. The decoder then reconstructs the image using transposed convolutions. Symmetry between the two parts, combined with skip connections, allows the network to preserve and effectively reuse encoded information.
We use this architecture (see Fig. E1) to estimate the part of daily temperature variations (T′, the output) which can be explained by the large-scale circulation, described by the sea level pressure (SLP, the input). Thus, we can write the UNET model as
We follow the methodology described in Cariou et al. (2025). Still, we extend the analysis to a larger spatial domain and train the UNET on daily data from 1850 to 2100 from 8 CESM2 transient simulations (80 % of the data are randomly selected for training, and the remaining 20 % are used for validation). These simulations belong to the CESM2 ensemble, but differ from the three members used to build the nudging experiment (referred to as run 1, run 2 and run 3) in order to prevent overfitting by the UNET. Since the UNET is trained on transient runs (historical and SSP), we must consider climate change in the relationship Eq. (5). The SLP is not detrended, assuming that in the CESM2 model, the forced responses in the SLP is small compared to the daily variability. This assumption is supported by Figs. 2 and 3, which show that the three piControl-nudged simulations do not exhibit significant common trends. However, the forced response is substantial in terms of temperature. Thus, the detrending is made following Rigal et al. (2019): temperature anomalies (T′) are obtained by removing an estimate of the daily non-stationary normal containing both the mean seasonal cycle, which is not circulation-explained, and the climate change signal. The trained model is then tested on three CESM2 piC-nudged runs, with SLP maps standardized using the same values as in the training process.
Figure 3Trend in JJA temperatures over the period 1979–2023 in piControl-nudged (a, f, k) and predicted trends from different decomposition methods. For runs 1 (a–e), 2 (f–j), and 3 (k–o). Estimates from the ridge regression (b, g, l), the analogues (c, h, m), DEA (d, i, n), and UNET (e, j, o). Areas where the predicted trend differs in sign from the piControl-nudged run are highlighted by black hatching.
For ERA5, we use the UNET that was previously trained on CESM2 and retrain it on ERA5 data from 1940 to 1978. This process is known as a fine-tuning method. This two-step approach is motivated by the limited amount of ERA5 data available for training. Pre-training on CESM2 simulations allows the network to learn robust large-scale circulation-temperature relationships from a wide range of situations. The subsequent fine-tuning on ERA5 then adjusts these learned features to the characteristics of ERA5 data studied. SLP maps are standardized with mean and standard deviation calculated on this training period, and the non-stationary normal is computed thanks to an estimate of the forced response obtained with Qasmi and Ribes (2022) method. The inference is then done over the 1979–2023 period.
We evaluate each statistical method's estimate of circulation-induced mid-latitude JJA temperature trends. Estimated trends are derived from CESM2 free-running hist+ssp370 simulations and compared against trends in CESM2 piControl-nudged simulations y, which serve as the benchmark. Method performance is assessed using four skill metrics:
-
The percentage of grid-cells for which the trend sign is correctly identified. This metric provides a general sense of whether the method can correctly capture the sign of the trend, which may be sufficient in specific contexts – for example, in climate change detection.
-
Pearson correlation (pattern correlation across the mid-latitudes). Pearson correlation reflects how well the method captures the spatial pattern of the trend. Some methods may systematically over- or underestimate the magnitude of trends, yet still accurately reproduce their spatial distribution.
-
The coefficient of determination (). R2 is a widely used metric for spatial comparisons, as it accounts for the variance at each location and indicates how much of the observed variability is explained by the prediction. Yet, it is, in contrast to Pearson correlation, sensitive to any bias in the estimated average (Kvålseth, 1985); and hence, a statistical method may show a good spatial Pearson's correlation in its estimates but a poor R-squared score.
-
The regression slope between predicted and benchmark trend estimates. The regression slope indicates whether the method tends to overestimate or underestimate the magnitude of trends.
Note that all metrics are directly applied to the gridded data without any area weighting.
3.1 Evaluation of circulation-induced trends in the historical period (1979–2023) in CESM2 nudged-circulation simulations
Over the period 1979–2023, JJA temperature trends in the piControl-nudged simulations range from −0.35 to 0.35 K per decade (Fig. 3a, d and g). These trends are organized in large regional clusters of alternating signs. Furthermore, the trend patterns differ considerably between the three piControl-nudged runs, indicating that in CESM2, circulation-induced trends are dominated by internal variability and that in CESM2, forced circulation changes are minor. Overall, JJA temperature trends are slightly stronger in run 1, which is likely due to the positive GMT trends in the piControl-nudged runs during this period (see Fig. 2).
Note that most of these trends are not statistically significant (see Fig. B1). Since these trends mostly reflect internal climate variability, it is expected that, from a statistical point of view, the circulation-induced temperature changes at one location are not differentiable from noise. The spatially consistent trend patterns indicate that, although lacking statistical significance, these trends contain valuable information and are worth evaluating.
Using the ridge regression trained on a forced simulation to predict the trends based on the streamfunction of the forced simulation and the GMT of the piControl-nudged run, we get a similar trend pattern as in the piControl-nudged run (Fig. 3b, e and h). Over the mid-latitudinal land area, half of the variability in local temperature trends in the piControl-nudged run is explained by the ridge regression model (compare R2 score in Table 1). For three-quarters of the grid-cells, the sign of the predicted trend is correct, and grid-cells for which the sign of the trend is not indicated correctly are mostly grid-cells with small trends in the piControl-nudged simulation (and the prediction).
Table 1Evaluation metrics comparing trends in piControl-nudged simulations to estimates of circulation induced trends from statistical decomposition methods for land grid-cells between 30–60° N and the period 1979–2023. First block: percentage of grid-cells with correctly predicted signs. Second block: Pearson correlation coefficient. Third block: Coefficient of determination. Fourth block: regression slope (as shown in Fig. C1). See Table D1 for the same evaluation over the period 2025–2075.
The analogue method reveals a positive correlation between predicted and simulated mid-latitude land trends, with a similar percentage of correctly identified signs in Run 1 and slightly lower skill in Runs 2 and 3. Importantly, the analogue method only captures the circulation-driven component of the trend and does not account for GMST contributions in the piControl-nudged simulations. Consequently, performance is lower in runs 2 and 3, where substantial negative GMST trends were simulated. It is important to note, however, that an offset largely influences the relatively poor R2 score in the mean circulation trend (overestimated warming), while the spatial pattern itself shows a rather good resemblance and Pearson correlation compared to the benchmark simulation (Fig. 3).
The DEA method performs well in estimating the sign of circulation-induced trends with 76 % accuracy (see Table 1). Despite the relatively strong correlation between the trend maps (r=0.64), the coefficient of determination is close to zero. Estimates of circulation-induced trends from DEA cover the full range of simulated piControl-nudged trends, including very high and very low trends. This is reflected by a relatively high regression slope between predicted and simulated trends (Fig. C1 and last block in Table 1).
The UNET is performing the best of all tested methods here. With UNET, 84 % of trend signs are predicted correctly; it has the highest Pearson correlation coefficient (0.86) and the highest coefficient of determination (0.66). In comparison to the DEA and ridge regression, UNET tends to predict weaker circulation-induced trends and rarely exceeds magnitudes of 0.2 K per decade. As shown in Fig. C1, this leads to a systematic underestimation of the trend magnitude compared to the piControl-nudged simulation.
The evaluation over a different period (2025–2075) yields similar skill metrics and confirms the above-discussed results (see Figs. D1, D2 and Table D1).
Overall, UNET is the most accurate method for explaining the variance in mid-latitude boreal summer temperature trends. The ridge regression and UNET tend to decompose temperature trends into a regionally smoothed pattern of circulation-induced temperature trends, with a lower likelihood of predicting a strong trend of the wrong sign. DEA and the analogue method project strong trends of the wrong sign in some regions. DEA appears to be more helpful in estimating the potential magnitude of circulation-induced trends, whereas UNET is relatively conservative, estimating trends that are generally too weak. The analogue method also shows skill in predicting the sign of trend, but appears typically less trustworthy then other decomposition methods.
3.2 Identification of the circulation-induced boreal summer temperature trend (1979–2023) in reanalysis
Across both the ERA5 reanalysis and the CESM2 simulations nudged to ERA5 winds, all decomposition methods reveal similar circulation-induced trend patterns for 1979–2023. Over Eurasia, a wave-like structure emerges: strong warming over Central and Eastern Europe (around 30° E), cooling over Kazakhstan and western Siberia (60–90° E), and warming again over Mongolia, eastern Siberia, and Central China (90–120° E), extending toward the Kamchatka Peninsula (Fig. 4).
Figure 4JJA mean temperature trends in ERA5 over the period 1979–2023 (a) decomposed in the circulation-induced (left) and thermodynamic (right) contribution for the ridge regression (d, e), DEA (f, g), analogues (h, i), UNET (j, k) and estimates from CESM2 simulation with horizontal winds nudged to ERA5 winds (b, c).
Across North America, we observe a dipole pattern with positive trends in the western part and negative trends in the central and eastern parts, with positive trends again in the outermost northeastern parts. All decomposition methods as well as the CESM2 simulations nudged to ERA5 winds (Fig. 4j) agree on this broad trend pattern with only little regional deviations. The trend pattern identified with the statistical method is in good agreement with the pattern found in Teng et al. (2022).
Based on our evaluation of decomposition methods against dedicated nudged simulations in the CESM2 setup, we would suggest giving more weight to the results from UNET regarding the sign of circulation-induced trends. For example, this would imply that the positive circulation induced trends over northern North America in the ridge regression are probably wrong (compare low skill of ridge regression in this region, Fig. 3).
Moreover, there remains ambiguity on the magnitude of the above-described trend pattern. The ridge regression, DEA, and UNET suggest a circulation-induced trend of up to 0.3 K per decade over eastern Europe. At the same time, the piControl simulations from CESM2, which were nudged to ERA5 winds, show stronger circulation-induced trends of up to 0.6 K per decade. In other regions, the nudged simulations and DEA exhibit stronger trends, followed by the ridge regression and analogues, while UNET suggests somewhat weaker trends. From the evaluation of the decomposition methods, we know that all methods have indeed a tendency to underestimate the magnitude of circulation-induced trends somewhat, suggesting that the circulation-induced trend over eastern Europe could be around 0.5 K per decade as indicated by the nudged simulations.
In summary, our study confirms the highly variable mid-latitude boreal summer trend pattern found in Singh et al. (2023), Teng et al. (2022), and Vautard et al. (2023) with five independent methods (four statistical methods and a nudged circulation simulation driven by ERA5 horizontal wind fields). The trend pattern highlights several regional warming hotspots where circulation has made a major positive contribution (Teng et al., 2022). While total boreal summer temperature trends are positive across the NH mid-latitudes, circulation has driven cooling in large areas – most notably Central and Eastern North America, Central Eurasia, and, to a lesser extent, coastal eastern China. In these regions, the moderately positive total trends reflect a compensation between circulation-induced cooling and thermodynamic warming.
Many studies have sought to isolate circulation-induced components in climate time series, often referred to as dynamical adjustment (Smoliak et al., 2015; Deser et al., 2016; Guo et al., 2019; Cariou et al., 2025; Singh et al., 2023; Saffioti et al., 2017; Lehner et al., 2017). The core assumption is that circulation variability, primarily driven by internal processes, dominates temperature variability in many regions, while thermodynamic contributions can be derived as the residual (e.g. Deser et al., 2016). This separation of dynamic and thermodynamic components provides a powerful framework for climate attribution (e.g. Shepherd, 2014).
However, while different statistical methods for obtaining circulation-induced components in climate time series are routinely evaluated on short time scales, the estimation of circulation-induced decadal trends has remained a challenge for the climate community and will likely continue to do so. This is because of five main reasons.
First, statistical methods have been found to perform very well on short time scales of day-to-day, month-to-month, or inter-annual variability (Cariou et al., 2025; Smoliak et al., 2015; Sippel et al., 2019). Yet, the difference in the performance on long (that is, trend) time scales versus short time scales has not been quantified so far. Nevertheless, dynamical adjustment has been widely applied on the time scales of trends. A reduced performance on long time scales is expected, and indeed found in this study, because shorter time scales are dominated to the largest extent by circulation-induced variability, whereas on longer time scales other processes are becoming more dominant, such as land–atmosphere interactions (e.g. Merrifield et al., 2017) or long-term warming, both of which may not be straightforward to account for.
Second, and partly related to the previous point, designing a method comparison for the identification of circulation-induced time series is challenging. This is because it is not immediately apparent what the components are that the signal is decomposed into, and which relevant mechanisms can be attributed to these components. In this study, we decompose a trend in local temperatures into a “circulation-induced” component and a “thermodynamic” component without specifying to which of these components changes in other important factors, such as soil moisture or aerosol concentrations, are attributed (see Fig. 5 for the example of land–atmosphere feedbacks). Therefore, the different methods also estimate different trends for the “thermodynamic” component as shown in Fig. 4. The different statistical methods evaluated here were initially developed for similar but slightly different research questions: The analogue method, for instance, was designed to separate the “thermodynamic signal” from “circulation-induced variability.” Yet, it has been shown that summer land–atmosphere interactions remain largely in the residual, thermodynamic component due to the method's setup (Merrifield et al., 2017). On the other hand, machine learning methods such as the UNETs may implicitly identify land–atmosphere interactions as part of circulation variability, if circulation carries an imprint of land–atmosphere variability. In Table 2 we summarize our thoughts on the treatment of land–atmosphere feedbacks in the different approaches.
Figure 5Conceptual illustration of causal relationships influencing local temperatures in a forced climate as in Fig. 1 but with an additional driver of local temperature. In this study we aim at decomposing the influence on local temperature into thermodynamic (GMST) and circulation induced contributions (in blue). Land–atmosphere interactions is a driver we do not explicitly model (in orange).
Third, designing a benchmark for the circulation-induced component of climate time series, such as summer temperatures, is a challenging task. In this study, we use a piControl nudged-circulation approach, where a climate model is nudged to the horizontal winds of a forced transient simulation. This setup provides circulation-induced changes within an otherwise unforced climate simulation. This might lead to inconsistencies between wind fields and their drivers. For example, Arctic amplification and the resulting reduced equator to pole temperature gradient in the hist+ssp370 simulations might lead to weaker westerlies in the northern hemispheric mid-latitudes that would be imposed on the pre-industrial climate. There may also be factors of residual climate variability (such as ocean variability) or feedbacks between circulation and other factors, such as land–atmosphere coupling, that could still affect thermodynamical processes on climate over land (see Table 2). Additionally, summer temperatures in the nudged circulation simulations might be affected by nudging in other seasons. For example, circulation changes can influence soil moisture in late spring which would then have an impact on summer temperatures. Statistical decomposition methods do not use this information. Consequently, we have to admit that the nudged simulations are not a perfect benchmark. Further analysis is required to understand how these limitations affect our estimates of circulation-induced trends and whether a better-suited benchmark test could be designed.
(Merrifield et al., 2017)(Merrifield et al., 2017)(Merrifield et al., 2019)Table 2Expectations on how land–atmosphere interactions might influence the decomposition into “circulation-induced” and “thermodynamic” contributions.
Regarding land–atmosphere interactions, we conclude that the effect on our estimates of circulation-induced trends varies between methods. This increases our confidence in the signals all methods agree on (e.g., circulation-induced warming over Europe). At the same time, there is no systematic (and consistent) difference in how statistical decomposition methods might be affected by land–atmosphere interactions in comparison to how land–atmosphere interactions might influence the nudged simulations. Therefore, the effect of land–atmosphere interactions cannot explain the systematic underestimation of the magnitude of circulation-induced trends in statistical decomposition methods (as compared to the nudged simulations).
Fourth, we present an evaluation of decomposition methods based on one set of nudged simulations from one Earth system model (CESM2). Despite the well-documented performance of CESM2, this is a flaw, as the strength of the links between atmospheric circulation patterns, GMST, and local temperatures might be misrepresented in the model. A follow-up study using multiple ESMs to create a benchmarking dataset would be crucial to constrain our estimates of circulation-induced temperature trends further. The use of such a multi-model ensemble would require adapting the different reconstruction methods slightly. In particular, the UNET would need to be pre-trained on a collection of multi-model data, rather than just CESM2. Preliminary tests conducted on Western Europe suggest that this does not degrade the quality of the reconstruction, especially when the fine-tuning step is applied to early ERA5 data.
Finally, in addition to combining multiple lines of evidence, our study emphasizes the importance of benchmarking efforts for statistical and machine learning approaches. Without evaluating the results against nudged circulation simulations, one would conclude that different decomposition methods project similar trend patterns, with some estimates exhibiting a stronger version of the trend pattern than others. Evaluating which magnitude of the trend pattern is the most likely/plausible is challenging from the statistical analysis alone. Concluding that all decomposition methods applied to observations might be underestimating the magnitude of the trend pattern would be impossible.
Overall, our study demonstrates that statistical methods can effectively identify and separate circulation-induced temperature trends from residual thermodynamic trends. However, their performance declines on climatic timescales compared to shorter timescales. This uncertainty should be carefully considered in future studies that use such estimates for attribution or to constrain projections.
In summary, our analysis targeted two specific research objectives and revealed two distinct findings: First, we evaluated whether statistical-empirical methods can accurately estimate circulation-induced long-term trends in the NH mid-latitudes during boreal summer (and a residual dominated by thermodynamic trends) against a specifically designed climate model benchmark of nudged circulation simulations. Four different statistical methods were tested, and we demonstrated that each of these methods can generally identify the large-scale pattern of circulation variability and changes, even though they are typically trained and validated on short time scales (daily to seasonal). However, the methods showed differences in their ability to reproduce the spatial trend pattern from the nudged circulation benchmark. With three-quarters of correctly estimated signs of trends and coefficients of determination above 50 %, the ridge regression and the UNET methods are performing sufficiently well for the purpose. The UNET has the overall highest scores in most tested skill metrics. However, the UNET method tends to produce underdispersive results, that is, the magnitude of particularly strong circulation trends is often underestimated (irrespective of the sign). DEA and circulation analogues have similar skill in predicting the sign of circulation-induced trends. Still, due to the low coefficient of determination, we would refrain from interpreting the magnitude of regional trends estimated from these methods. Overall, identifying circulation-induced trends on climate time scales in the context of dynamical adjustment studies is possible. Still, it does imply larger uncertainties than for the application on shorter time scales, which needs to be considered in future applications of the techniques.
Our second objective was to identify circulation-induced boreal summer temperature trends across the northern mid-latitudes using four statistical methods and CESM2 simulations nudged to ERA5 circulation without anthropogenic forcing. Large-scale summer circulation trends and their temperature impacts have been widely debated (Teng et al., 2022; Chemke and Coumou, 2024; Rousi et al., 2022; Vautard et al., 2023). Analyzing ERA5 for 1979–2023, we find positive circulation contributions to summer warming over Europe, western North America, and Mongolia. In contrast, a wave-like pattern of circulation-induced cooling appears over Central Eurasia (west Siberia and Kazakhstan) and Central to Eastern North America.
Beyond strengthening confidence in circulation-induced temperature changes, our evaluation also highlights systematic limitations of statistical decomposition methods. Improving understanding of their performance will enhance our ability to attribute regional climate trends. Isolating individual components of historical change is likely to yield a stronger attribution signal, particularly for regional climate change. Focusing on dynamical and thermodynamic changes separately is advantageous, as there are significant differences in the uncertainties of forced changes in these components (Shepherd, 2014). While several attribution studies of circulation changes have been published (Coumou et al., 2015; Chemke and Coumou, 2024; Dong et al., 2022), uncertainties remain large, especially when it comes to attributing the downstream impacts of atmospheric circulation changes. Being able to more robustly decompose a trend into a circulation-induced and a thermodynamic component should also help attribute circulation-induced temperature trends more effectively.
Finally, separating dynamical and thermodynamic components offers a pathway to constrain near-term climate projections using observation-based constraints. The thermodynamic constraint should be straightforward to identify, as it is mainly forced. There are different possibilities to constrain based on the dynamical component. With the assumption that circulation-induced trends over the past decades were primarily due to internal climate variability, one would expect a reversal of the observed trend pattern over the coming decades. If circulation-induced temperature change is forced, both circulation-induced and thermodynamic trends would continue. Due to the considerable uncertainty in the forced circulation-induced changes, a storyline approach would be appropriate, explicitly treating different assumptions about forced atmospheric circulation changes and evaluating the potential outcomes of these scenarios (Shepherd, 2019; Liné et al., 2024).
Figure C1Predicted trends against trends from piControl-nudged for the period 1979–2023 over land grid-cells between 30° N and 60° N from all runs (1,2,3) for the ridge regression (a), DEA (b), analogues (c), and UNET (d). The bulk of the data is represented by a Gaussian kernel density estimate (shadings) while extreme trends are shown as scatter points.
Figure D1Same as Fig. 3 but for the period 2025–2075.
Figure G1Estimates of the circulation induced trends from the ridge regression over the period 1979–2023 in the freely running forced CESM2 simulations. Estimates from the ridge regression using streamfunction at 500 hPa as a covariate for circulation (a, c, e). The same, but with geopotential height at 500 hPa as anomalies to the global mean geopotential height at 500 hPa (b, d, f).
The code required to reproduce this study is available at https://github.com/peterpeterp/circ_contribution_to_JJA_trends.git (last access: 1 December 2025; https://doi.org/10.5281/zenodo.17857756, Pfleiderer, 2025).
The nudged CESM2 simulations used for the evaluation of statistical decomposition methods is available on https://doi.org/10.5281/zenodo.18172330 (Beyerle and Sippel, 2026). Data from the CESM2 large ensemble can be accessed through https://doi.org/10.26024/kgmp-c556 (Danabasoglu et al., 2020b). The CESM2 piControl simulation can be accessed through the Earth System Grid Federation (ESGF) CMIP6 archive at https://doi.org/10.26050/WDCC/AR6.C6CMNRCES2pc (Danabasoglu et al., 2023). ERA5 data can be accessed through Copernicus Climate Data Store https://cds.climate.copernicus.eu/datasets (last access: 18 November 2024; Hersbach et al., 2020).
PP and SS conceived the study. PP wrote the manuscript with contributions from SS and all other authors. PP created all figures. PP contributed the results of the ridge regression. AM contributed the results for the constructed analogues. HD contributed the results for the DEA. EC and JC contributed the results for UNET. ID contributed the CESM2 simulations nudged to ERA5 winds.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We thank Urs Beyerle for producing the nudged CESM2 simulations that were used to evaluate the statistical decomposition methods. We thank all the scientists, software engineers, and administrators who contributed to the development of CESM. We acknowledge the use of ERA5 reanalysis data provided by the Copernicus Climate Change Service (C3S). We acknowledge the CESM2 Large Ensemble Community Project for providing the model output used in this study. S.S. and I.D. acknowledge funding from the German Research Foundation's Heinz Maier-Leibnitz Prize 2024 for early-career researchers.
This research has been supported by the Horizon 2020 (grant no. 101137682) and the Bundesministerium für Bildung und Forschung (grant no. 01LP2323C).
This paper was edited by Camille Li and reviewed by two anonymous referees.
Beyerle, U. and Sippel, S.: Data used in “Considerable yet contrasting regional imprint of circulation change on summer temperature trends across the Northern hemisphere mid-latitudes” (10.5194/egusphere-2025-2397), Zenodo [data set], https://doi.org/10.5281/zenodo.18172330, 2026. a
camdoc: Camdoc Documentation: 9 Physics Modifications via the Namelist, CESM Development Team, https://ncar.github.io/CAM/doc/build/html/users_guide/physics-modifications-via-the-namelist.html#nudging, last access: 21 May 2025. a
Cariou, E., Cattiaux, J., Qasmi, S., Ribes, A., Cassou, C., and Doury, A.: Linking European Temperature Variations to Atmospheric Circulation With a Neural Network: A Pilot Study in a Climate Model, Geophysical Research Letters, 52, e2024GL113540, https://doi.org/10.1029/2024GL113540, 2025. a, b, c, d, e
Cattiaux, J., Vautard, R., Cassou, C., Yiou, P., Masson-Delmotte, V., and Codron, F.: Winter 2010 in Europe: A Cold Extreme in a Warming Climate: COLD WINTER 2010 IN EUROPE, Geophysical Research Letters, 37, https://doi.org/10.1029/2010GL044613, 2010. a
Chemke, R. and Coumou, D.: Human Influence on the Recent Weakening of Storm Tracks in Boreal Summer, npj Climate and Atmospheric Science, 7, 1–8, https://doi.org/10.1038/s41612-024-00640-2, 2024. a, b, c
Coumou, D., Lehmann, J., and Beckmann, J.: The Weakening Summer Circulation in the Northern Hemisphere Mid-Latitudes, Science, 348, 324–327, https://doi.org/10.1126/science.1261768, 2015. a, b
Danabasoglu, G., Lamarque, J.-F., Bacmeister, J., Bailey, D. A., DuVivier, A. K., Edwards, J., Emmons, L. K., Fasullo, J., Garcia, R., Gettelman, A., Hannay, C., Holland, M. M., Large, W. G., Lauritzen, P. H., Lawrence, D. M., Lenaerts, J. T. M., Lindsay, K., Lipscomb, W. H., Mills, M. J., Neale, R., Oleson, K. W., Otto-Bliesner, B., Phillips, A. S., Sacks, W., Tilmes, S., van Kampenhout, L., Vertenstein, M., Bertini, A., Dennis, J., Deser, C., Fischer, C., Fox-Kemper, B., Kay, J. E., Kinnison, D., Kushner, P. J., Larson, V. E., Long, M. C., Mickelson, S., Moore, J. K., Nienhouse, E., Polvani, L., Rasch, P. J., and Strand, W. G.: The Community Earth System Model Version 2 (CESM2), Journal of Advances in Modeling Earth Systems, 12, e2019MS001916, https://doi.org/10.1029/2019MS001916, 2020a. a, b
Danabasoglu, G., Deser, C., Rodgers, K., and Timmermann, A.: CESM2 Large Ensemble, NSF National Center for Atmospheric Research [data set], https://doi.org/10.26024/kgmp-c556, 2020b. a
Danabasoglu, G., Lawrence, D., Lindsay, K., Lipscomb, W., and Strand, G.: IPCC DDC: NCAR CESM2 model output prepared for CMIP6 CMIP piControl, World Data Center for Climate (WDCC) at DKRZ [data set], https://doi.org/10.26050/WDCC/AR6.C6CMNRCES2pc, 2023. a
Deser, C., Terray, L., and Phillips, A. S.: Forced and Internal Components of Winter Air Temperature Trends over North America during the Past 50 Years: Mechanisms and Implications*, Journal of Climate, 29, 2237–2258, https://doi.org/10.1175/JCLI-D-15-0304.1, 2016. a, b, c, d, e, f, g, h, i, j
Dong, B., Sutton, R. T., Shaffrey, L., and Harvey, B.: Recent Decadal Weakening of the Summer Eurasian Westerly Jet Attributable to Anthropogenic Aerosol Emissions, Nature Communications, 13, 1148, https://doi.org/10.1038/s41467-022-28816-5, 2022. a, b
Durand, H., Varando, G., and Camps-Valls, G.: Learning Causal Response Representations through Direct Effect Analysis, Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence, 21–25 July 2025, Proceedings of Machine Learning Research, edited by: Chiappa, S. and Magliacane, S., PMLR, 1135–1166, https://proceedings.mlr.press/v286/durand25a.html (last access: 30 September 2025), 2025. a
Eyring, V., Gillett, N. P., Achuta Rao, K. M., Barimalala, R., Barreiro Parrillo, M., Bellouin, N., Cassou, C., Durack, P. J., Kosaka, Y., McGregor, S., Min, S., Morgenstern, O., and Sun, Y.: Human influence on the climate system, in: Climate change 2021: The physical science basis. Contribution of working group I to the sixth assessment report of the intergovernmental panel on climate change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S. L., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M. I., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J. B. R., Maycock, T. K., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 423–552, https://doi.org/10.1017/9781009157896.005, 2021. a
Fabiano, F., Meccia, V. L., Davini, P., Ghinassi, P., and Corti, S.: A regime view of future atmospheric circulation changes in northern mid-latitudes, Weather Clim. Dynam., 2, 163–180, https://doi.org/10.5194/wcd-2-163-2021, 2021. a
Fereday, D., Chadwick, R., Knight, J., and Scaife, A. A.: Atmospheric Dynamics Is the Largest Source of Uncertainty in Future Winter European Rainfall, Journal of Climate, 31, 963–977, https://doi.org/10.1175/JCLI-D-17-0048.1, 2018. a
Guo, R., Deser, C., Terray, L., and Lehner, F.: Human Influence on Winter Precipitation Trends (1921–2015) over North America and Eurasia Revealed by Dynamical Adjustment, Geophysical Research Letters, 46, 3426–3434, https://doi.org/10.1029/2018GL081316, 2019. a, b
Hanna, E., Fettweis, X., and Hall, R. J.: Brief communication: Recent changes in summer Greenland blocking captured by none of the CMIP5 models, The Cryosphere, 12, 3287–3292, https://doi.org/10.5194/tc-12-3287-2018, 2018. a
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 Global Reanalysis, Quarterly Journal of the Royal Meteorological Society, 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a, b, c
Horton, D. E., Johnson, N. C., Singh, D., Swain, D. L., Rajaratnam, B., and Diffenbaugh, N. S.: Contribution of Changes in Atmospheric Circulation Patterns to Extreme Temperature Trends, Nature, 522, 465–469, https://doi.org/10.1038/nature14550, 2015. a
Kornhuber, K., Bartusek, S., Seager, R., Schellnhuber, H. J., and Ting, M.: Global Emergence of Regional Heatwave Hotspots Outpaces Climate Model Simulations, Proceedings of the National Academy of Sciences, 121, e2411258121, https://doi.org/10.1073/pnas.2411258121, 2024. a
Kvålseth, T. O.: Cautionary Note about R2, The American Statistician, 39, 279–285, https://doi.org/10.2307/2683704, 1985. a
Lehner, F., Deser, C., and Terray, L.: Toward a New Estimate of “Time of Emergence” of Anthropogenic Warming: Insights from Dynamical Adjustment and a Large Initial-Condition Model Ensemble, Journal of Climate, 30, 7739–7756, https://doi.org/10.1175/JCLI-D-16-0792.1, 2017. a, b, c, d
Liné, A., Cassou, C., Msadek, R., and Parey, S.: Modulation of Northern Europe Near-Term Anthropogenic Warming and Wettening Assessed through Internal Variability Storylines, npj Climate and Atmospheric Science, 7, 1–14, https://doi.org/10.1038/s41612-024-00759-2, 2024. a
Merrifield, A., Lehner, F., Xie, S.-P., and Deser, C.: Removing Circulation Effects to Assess Central U.S. Land-Atmosphere Interactions in the CESM Large Ensemble, Geophysical Research Letters, 44, 9938–9946, https://doi.org/10.1002/2017GL074831, 2017. a, b, c, d, e, f
Merrifield, A. L., Simpson, I. R., McKinnon, K. A., Sippel, S., Xie, S.-P., and Deser, C.: Local and Nonlocal Land Surface Influence in European Heatwave Initial Condition Ensembles, Geophysical Research Letters, 46, 14082–14092, https://doi.org/10.1029/2019GL083945, 2019. a, b
O'Neill, B. C., Tebaldi, C., van Vuuren, D. P., Eyring, V., Friedlingstein, P., Hurtt, G., Knutti, R., Kriegler, E., Lamarque, J.-F., Lowe, J., Meehl, G. A., Moss, R., Riahi, K., and Sanderson, B. M.: The Scenario Model Intercomparison Project (ScenarioMIP) for CMIP6, Geosci. Model Dev., 9, 3461–3482, https://doi.org/10.5194/gmd-9-3461-2016, 2016. a
Pfleiderer, P.: Peterpeterp/circ_contribution_to_JJA_trends: V1.1, Zenodo [code], https://doi.org/10.5281/zenodo.17857756, 2025. a
Pfleiderer, P., Schleussner, C.-f., Kornhuber, K., and Coumou, D.: Summer Weather Becomes More Persistent in a 2 °C World, Nature Climate Change, https://doi.org/10.1038/s41558-019-0555-0, 2019. a
Qasmi, S. and Ribes, A.: Reducing Uncertainty in Local Temperature Projections, Science Advances, 8, eabo6872, https://doi.org/10.1126/sciadv.abo6872, 2022. a
Rayner, N. A., Parker, D. E., Horton, E. B., Folland, C. K., Alexander, L. V., Rowell, D. P., Kent, E. C., and Kaplan, A.: Global Analyses of Sea Surface Temperature, Sea Ice, and Night Marine Air Temperature since the Late Nineteenth Century, Journal of Geophysical Research: Atmospheres, 108, https://doi.org/10.1029/2002JD002670, 2003. a
Rigal, A., Azaïs, J.-M., and Ribes, A.: Estimating Daily Climatological Normals in a Changing Climate, Climate Dynamics, 53, 275–286, https://doi.org/10.1007/s00382-018-4584-6, 2019. a
Rodgers, K. B., Lee, S.-S., Rosenbloom, N., Timmermann, A., Danabasoglu, G., Deser, C., Edwards, J., Kim, J.-E., Simpson, I. R., Stein, K., Stuecker, M. F., Yamaguchi, R., Bódai, T., Chung, E.-S., Huang, L., Kim, W. M., Lamarque, J.-F., Lombardozzi, D. L., Wieder, W. R., and Yeager, S. G.: Ubiquity of human-induced changes in climate variability, Earth Syst. Dynam., 12, 1393–1411, https://doi.org/10.5194/esd-12-1393-2021, 2021. a, b
Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for Biomedical Image Segmentation, arXiv [preprint], https://doi.org/10.48550/arXiv.1505.04597, 2015. a
Röthlisberger, M. and Papritz, L.: Quantifying the Physical Processes Leading to Atmospheric Hot Extremes at a Global Scale, Nature Geoscience, 16, 210–216, https://doi.org/10.1038/s41561-023-01126-1, 2023. a
Rousi, E., Kornhuber, K., Beobide-Arsuaga, G., Luo, F., and Coumou, D.: Accelerated Western European Heatwave Trends Linked to More-Persistent Double Jets over Eurasia, Nature Communications, 13, 3851, https://doi.org/10.1038/s41467-022-31432-y, 2022. a, b, c, d, e, f
Saffioti, C., Fischer, E. M., and Knutti, R.: Improved Consistency of Climate Projections over Europe after Accounting for Atmospheric Circulation Variability, Journal of Climate, 30, 7271–7291, https://doi.org/10.1175/JCLI-D-16-0695.1, 2017. a
Seneviratne, S. I., Lüthi, D., Litschi, M., and Schär, C.: Land–Atmosphere Coupling and Climate Change in Europe, Nature, 443, 205–209, https://doi.org/10.1038/nature05095, 2006. a
Seneviratne, S. I., Zhang, X., Adnan, M., Badi, W., Dereczynski, C., Di Luca, A., Vicente-Serrano, S. M., Wehner, M., and Zhou, B.: Chapter 11: Weather and Climate Extreme Events in a Changing Climate, in: Climate Change 2021: The Physical Science Basis, https://doi.org/10.1017/9781009157896.013, 2021. a
Shaw, T. A. and Miyawaki, O.: Fast Upper-Level Jet Stream Winds Get Faster under Climate Change, Nature Climate Change, 14, 61–67, https://doi.org/10.1038/s41558-023-01884-1, 2024. a
Shepherd, T. G.: Atmospheric Circulation as a Source of Uncertainty in Climate Change Projections, Nature Geoscience, 7, 703–708, https://doi.org/10.1038/ngeo2253, 2014. a, b, c
Shepherd, T. G.: Storyline Approach to the Construction of Regional Climate Change Information, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 475, https://doi.org/10.1098/rspa.2019.0013, 2019. a
Singh, J., Sippel, S., and Fischer, E. M.: Circulation Dampened Heat Extremes Intensification over the Midwest USA and Amplified over Western Europe, Communications Earth & Environment, 4, 1–9, https://doi.org/10.1038/s43247-023-01096-7, 2023. a, b, c, d
Singh, J., Sippel, S., Gu, L., Knutti, R., and Fischer, E.: Externally Forced Circulation Changes Amplify Mid-Latitude Regional Heat Extremes in Climate Model Nudged-Circulation Experiments, https://doi.org/10.21203/rs.3.rs-7299009/v2, 2025. a
Sippel, S., Meinshausen, N., Merrifield, A., Lehner, F., Pendergrass, A. G., Fischer, E., and Knutti, R.: Uncovering the Forced Climate Response from a Single Ensemble Member Using Statistical Learning, Journal of Climate, 32, 5677–5699, https://doi.org/10.1175/JCLI-D-18-0882.1, 2019. a, b, c
Sippel, S., Barnes, C., Cadiou, C., Fischer, E., Kew, S., Kretschmer, M., Philip, S., Shepherd, T. G., Singh, J., Vautard, R., and Yiou, P.: Could an extremely cold central European winter such as 1963 happen again despite climate change?, Weather Clim. Dynam., 5, 943–957, https://doi.org/10.5194/wcd-5-943-2024, 2024. a
Smoliak, B. V., Wallace, J. M., Lin, P., and Fu, Q.: Dynamical Adjustment of the Northern Hemisphere Surface Air Temperature Field: Methodology and Application to Observations, Journal of Climate, 28, 1613–1629, https://doi.org/10.1175/JCLI-D-14-00111.1, 2015. a, b, c, d
Teng, H., Leung, R., Branstator, G., Lu, J., and Ding, Q.: Warming Pattern over the Northern Hemisphere Midlatitudes in Boreal Summer 1979–2020, Journal of Climate, 35, 3479–3494, https://doi.org/10.1175/JCLI-D-21-0437.1, 2022. a, b, c, d, e, f, g, h
Terray, L.: A dynamical adjustment perspective on extreme event attribution, Weather Clim. Dynam., 2, 971–989, https://doi.org/10.5194/wcd-2-971-2021, 2021. a, b
Topál, D. and Ding, Q.: Atmospheric Circulation-Constrained Model Sensitivity Recalibrates Arctic Climate Projections, Nature Climate Change, 13, 710–718, https://doi.org/10.1038/s41558-023-01698-1, 2023. a, b
Van Den Dool, H. M.: Searching for Analogues, How Long Must We Wait?, Tellus A, 46, 314–324, https://doi.org/10.1034/j.1600-0870.1994.t01-2-00006.x, 1994. a
Vautard, R., Cattiaux, J., Happé, T., Singh, J., Bonnet, R., Cassou, C., Coumou, D., D'Andrea, F., Faranda, D., Fischer, E., Ribes, A., Sippel, S., and Yiou, P.: Heat Extremes in Western Europe Increasing Faster than Simulated Due to Atmospheric Circulation Trends, Nature Communications, 14, 6803, https://doi.org/10.1038/s41467-023-42143-3, 2023. a, b, c, d, e
Wehrli, K., Guillod, B. P., Hauser, M., Leclair, M., and Seneviratne, S. I.: Assessing the Dynamic Versus Thermodynamic Origin of Climate Model Biases, Geophysical Research Letters, 45, 8471–8479, https://doi.org/10.1029/2018GL079220, 2018. a, b, c
Wills, R. C. J., Battisti, D. S., Armour, K. C., Schneider, T., and Deser, C.: Pattern Recognition Methods to Separate Forced Responses from Internal Variability in Climate Model Ensembles and Observations, Journal of Climate, 33, 8693–8719, https://doi.org/10.1175/JCLI-D-19-0855.1, 2020. a
Woollings, T., Drouard, M., O'Reilly, C. H., Sexton, D. M. H., and McSweeney, C.: Trends in the Atmospheric Jet Streams Are Emerging in Observations and Could Be Linked to Tropical Warming, Communications Earth & Environment, 4, 1–8, https://doi.org/10.1038/s43247-023-00792-8, 2023. a
Zeder, J. and Fischer, E. M.: Quantifying the statistical dependence of mid-latitude heatwave intensity and likelihood on prevalent physical drivers and climate change, Adv. Stat. Clim. Meteorol. Oceanogr., 9, 83–102, https://doi.org/10.5194/ascmo-9-83-2023, 2023. a
- Abstract
- Introduction
- Data and Methods
- Results
- Discussion
- Conclusions and Outlook
- Appendix A: Nudged circulation plots
- Appendix B: Significance of circulation induced trends
- Appendix C: More evaluation plots
- Appendix D: Period 2025–2075
- Appendix E: UNET architecture
- Appendix F: Analogues stochastic
- Appendix G: Ridge regression with geopotential heigt at 500 hPa
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Abstract
- Introduction
- Data and Methods
- Results
- Discussion
- Conclusions and Outlook
- Appendix A: Nudged circulation plots
- Appendix B: Significance of circulation induced trends
- Appendix C: More evaluation plots
- Appendix D: Period 2025–2075
- Appendix E: UNET architecture
- Appendix F: Analogues stochastic
- Appendix G: Ridge regression with geopotential heigt at 500 hPa
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References