Modeling HadCRUT5 with CO2 and with out CO2 • Watts Up With That?
By Andy May
I hate statistics, as many of you know. Some people think statistics and/or statistical models that meet standard statistical criteria are facts. The IPCC can be like that. They statistically model global surface temperatures with models of volcanic and anthropogenic forcing and compare the model to one with only volcanic forcing. Then they turn to us, with a straight face, and say the comparison shows anthropogenic forcing is driving all the warming. What about solar? Oh, they considered that they say, the Sun makes no difference, see their chart in figure 1 from AR6. Solar is assumed to be zero and volcanism is small, thus the model assumes all recent warming is due to humans, then draws the same conclusion in a perfect example of circular reasoning. But what if the solar forcing is not zero? What difference does that make?
Figure 1. The IPCC AR6 assumed forces affecting global surface warming translated to degrees C. From AR6 WG1, page 961.
Numerous papers have been published that show the Sun could have more impact on global temperatures and climate change than assumed by the IPCC. We must remember that statistical models are not evidence or theories, they aren’t even proper hypotheses. They are just a tool to test the validity of ideas and a hypothesis might come out of a statistical model, but proof never will. If a model repeatedly predicts the future accurately, then it is evidence the hypothesis is correct, it isn’t proof. The IPCC presents their statistical climate model with the plots shown in figure 2.
Figure 2 is quite busy, but what it says, in brief, is that they assume that natural warming (heavy green line) is zero, which makes, under their assumptions, all warming due to human activities. The WG1 AR6 report is 2,391 pages long, but figure 2, modified slightly from what they display on page 441, really encapsulates everything it proposes. The rest is filler.
Figure 2. The IPCC model shows their greenhouse gas warming hypothesis with this graph. This is after IPCC AR6 WG1 figure 3.9b (page 441). The vertical axis is the temperature anomaly relative to 1850-1900.
There are numerous problems with figure 2, but we will focus on the comparison between the anthropogenic + natural models, in orange, and the observations in black. First of all, the orange is not one model, but the average of many selected models. The range of model calculations (5 to 95th percentile) is shown with light orange shading. The range is quite large, if they had confidence in their models wouldn’t they choose the best one and use it? If they don’t trust the models, why try to use them as evidence that the Sun has no influence, and all the warming is due to human activities? Why use the models to confidently predict a man-made climate catastrophe? AR6 WGII Summary for Policymakers (p 12-20) reports high confidence in many future catastrophes based on model results. Why high confidence, if the models are so imprecise, that they must be averaged? Second, they use thick lines to try and obscure the differences between the black and orange lines, but the differences are significant, especially between 1935 and 1976 and 1980 to 2000. The model average between 1920 and 1960 looks almost hand-drawn because it is so straight relative to rising temperatures until 1944 and falling temperatures afterward.
So, let’s take a different approach. The classical paleoclimate literature, pre-IPCC, mostly thought that solar variability dominated climate change. Over time the study of the cosmogenic isotopes 14C in tree rings and 10Be in ice cores has led to accepted proxy records of the Sun’s output that go back thousands of years (see the discussion of Carbon-14 and Beryllium-10 here). These isotopes are created in the atmosphere when galactic cosmic rays make it through the solar magnetic field and impact the atmosphere. When solar output is high, its magnetic field is stronger than when it is low. Thus, low concentrations of 14C and 10Be suggest a strong solar output and vice versa. Since 1700 sunspot records provide a more accurate view of solar activity.
Studies of 14C, 10Be, and sunspot records have uncovered four major long-term solar cycles. These are the Hallstatt (or Bray) cycle of about 2,400 years, the Eddy Cycle of about 1,000 years, the de Vries (or Suess) cycle of about 210 years, Feynman (or Gleissberg) cycle of about 105 years, and the Pentadecadal cycle of about 50 years. All the cycle periods are approximate, further, they may vary over geological time. Some may not like my use of the term “cycles,” since our understanding of the cycle periods and the strength or power of each cycle is poor. Perhaps the term oscillation would be better but understand that I fully appreciate how poorly we understand these cycles and use the term only for convenience and not necessarily according to the precise definition of the word.
The Sun is a dynamo and generates a magnetic field that controls the variations in its output over time. Such a dynamo will have cycles, we have shown they exist and affect Earth’s climate, but the details are sketchy. What astrophysicists and paleoclimatologists have done is observe the Sun and solar impacts on Earth’s climate and recognized in-phase patterns of both solar activity and climate impacts. We discuss these observed (but only approximate) patterns in the post and correlate them to HadCRUT5. Cycles are also observed in other stars that are like our Sun.
There are also shorter periods of solar variability, like the sunspot cycle which has a varying period and asymmetrical shape that averages about 11 years. Finally, we have the ENSO cycle, also with a varying period, that is driven, in part, by solar activity. To cover the shorter solar cycles we include the SILSO sunspot record and the ERSST Niño 3.4 (ENSO) record from KNMI.
If we ignore the IPCC assumption that solar activity has played no role in climate change since 1750, as suggested in figure 1, it is possible to investigate the correlation of these well-established cycles or oscillations and one of the global surface temperature records used in AR6, the HadCRUT5 record. Unfortunately, the HadCRUT5 global surface temperature record only goes back to 1850, but it is an instrumental record, and preferable to proxies. The data used to build HadCRUT5 is poor prior to 1958, so we will also investigate the even shorter period of more accurate data from 1958 to 2023.
We used statistical multiple regression to see how well these cycles and data can predict HadCRUT5. We understand going in, that even if we can build a multiple regression model with a high R2 (Coefficient of determination or the square of the correlation coefficient), we haven’t proven anything. We also understand that while global average surface temperature is an important metric of climate change, it is not the only important metric. Other metrics, such as mid-latitude wind speed and direction, as well as surface temperature trends at the poles, and in the tropics (especially in the middle troposphere) are also important. The purpose of this post is simply to show that the IPCC’s choice to characterize the correlation of the trends in the logarithm of CO2 concentration and global average surface temperature as “proof” or “evidence” that CO2 and other human greenhouse gas emissions drive climate change is not very solid. In fact, it is probably wrong. Other reasonable correlations are possible, and arguably better.
Figure 3 is a plot of the independent or predictor variables used in our regression study. They have been normalized to scales of -3 to +3 by dividing the larger variables (Log(CO2) and sunspots) by their mean to better compare the variables to one another. In addition, we divided the sunspot number by its standard deviation to help make it comparable in scale to other variables.
Figure 3. The input series used in this multiple regression study. The y axis scale is an index, and the curves cannot be compared quantitatively.
Unfortunately, our period is too short to properly evaluate some of the stronger climate cycles, like the Hallstatt (light blue) and Eddy (orange) cycles. These two cycles bottomed in the Little Ice Age and their periods are so long they almost appear as straight lines, but they are increasing like the HadCRUT5 record. The logarithm of CO2 is also nearly a straight line, and very slightly increasing. The CO2 data are interpolated yearly averages to avoid the seasonal wiggles.
The ENSO 3.4, sunspots (SN Norm), and CO2 (Log CO2 Norm) records used in the study are from well-known datasets. The longer-term solar cycles are created using a sinusoid function of the form:
Where the cosine argument is in radians, f=frequency, t=time, and the offset is used to align the sine wave with assumed cycle lows (cold periods) from Ilya Usoskin and Joan Feynman. For more on this transform, used in Fourier analysis, see David Evans’ paper here. These lows are not precise and must be estimated from the available data. The actual values used, and the precise functions are in the supplementary materials which are linked at the end of this post.
The Multiple Regression Model
I performed a number of regressions with the variables plotted in figure 3 and various subsets of them. In every case where I could tell, the statistically most important single variable, judging from AIC, sum of squares, and R2, was the logarithm of CO2. However, all the variables were significant, and CO2 compared to the impact of all the others combined was small, as we will see. AIC ranks the input predictors for the 1958 case rank as follows: Log_CO2, Nino_3_4, Hallstatt, Eddy, Pentadecadal, sunspots, and finally de Vries. AIC is based on the sum of squares, so it can be problematic in autocorrelated series like these. The plots below give you feel for the relative importance of the main variables, which is hard (maybe impossible) to calculate statistically with any precision, mainly due to the brief period of our instrumental data and the long periods of the important solar cycles. The next four plots are for the whole instrumental record, 1850 to 2023. Figure 4 includes all the variables in the study.
Figure 4. A model with all series, including log(CO2). The fine gray line is the monthly HadCRUT5 data, and the blue line is smoothed with an 11-year moving average. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990. The orange line is the model.
Figure 5 uses all the variables except Log_CO2. In both figures the blue line is the smoothed HadCRUT5 record, and the fine gray line is the monthly HadCRUT5 data. The orange line is the model. We can see that Log_CO2 visually adds little to the match between observations and the model. Significant improvement is visible around 1940, otherwise the two models are about the same.
Figure 5. Plot of the regression with log(CO2) removed from the list of predictors. The R2 drops to 0.84 and there is noticeable deterioration in the fit between 1935 and 1947. The fine gray line and the blue line are as before. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Figure 6 compares the model that uses Log_CO2 to the model that only uses the solar related variables. The two models are similar. The only noticeable differences are before 1940 when CO2 was supposedly not very important. It is possible that the differences are due to data quality. As we will see, the data prior to 1958 was lower in quality than the data after that date.
Figure 6. A comparison of the “no CO2” versus “with CO2” models. All other predictors are in both models. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
In figure 7 we model HadCRUT5 with only CO2. While the R2 is 0.8 and the model generally follows HadCRUT5, the model lacks the granularity and detail that is apparent in figures 5 and 6. The IPCC calls the granularity natural variability and dismisses it as statistical “noise” that is random. Notice the P-value doesn’t change, the P-value is of little use in models like this that have a lot of observations and produce good matches. It is not a good measure of model quality.
Figure 7. HadCRUT5 is modeled with only CO2. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Next, we repeat the above four plots using a new model that only uses the data between 1958 and the present day. This is the largest period possible with good data. To get another upward step change in data quality we would need to move to 2005 when the ARGO array became sufficiently large to produce better data on ocean temperatures than we can get from ships. But only 17 years of good ocean data is not long enough to judge the influence of the longer solar cycles.
Figure 8 shows a good visual match between observations and a model with all the variables. It also has an R2 of 0.9, which would be impressive if the variables were independent and not autocorrelated. The mismatch between 1992 and 1995 is probably due to the Pinatubo eruption in 1991, which was not incorporated into this model.
Figure 8. A model with all predictors, including CO2, from 1958 to the present day. The Pinatubo eruption is identified. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Figure 9. A model from 1958 with all predictors except CO2. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Figure 9 is the model with all variables except for CO2. The match is still good, but there are differences in detail suggesting that adding CO2 makes a difference. The large difference just after 1992 is probably due to the influence of the Mt. Pinatubo eruption in the summer of 1991. The effect of the eruption lasted several years. With the exception of the Pinatubo eruption, the model is almost as good as the model that includes CO2, at least visually.
Figure 10. Models with and without CO2 over the 1958 to present period. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Figure 10 compares the models with and without CO2 directly, and except for the period right around the Mt. Pinatubo eruption, the match is excellent. I’m not saying that Pinatubo had an effect before it erupted, just that the large impact of the eruption on the HadCRUT5 record (see Figure 11) could have distorted the two regressions differently in that period. Possibly the addition of CO2 makes a small difference, but it isn’t apparent in this plot anywhere except around the eruption.
Figure 11. A model with only CO2 as a predictor from 1958 to the present. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
Figure 11 shows a model using only the logarithm of CO2, there is a general correspondence of temperature and CO2, but a great deal of detail is missing that we see in the other models. We can argue that the variation of the HadCRUT5 record around the orange model in figure 11 is not random noise if it can be modeled with solar cycles.
A word on statistics
The risk in evaluating regression statistics of models of autocorrelated series is most easily seen by considering that any two monotonically increasing time series, for example CO2 and temperature since 1850, will appear to correlate, even if they are unrelated. This is why I often hate statistics, too often statistical measures of fit, like R2, or computed statistical probabilities are used to gaslight readers into believing something that isn’t true. Your first judgment of a correlation should be made with a plot of the data versus the model, second should be a plot of the residuals. Are the residuals evenly dispersed about zero, or do they have a trend? All the residual plots for the top models in this post are trendless, as they should be.
The main point is to trust your eyes, not statistical measures of the fits, they are secondary. Sometimes the obvious is correct. To illustrate this point, I used a stepwise regression to order the models. To generate these four models, I removed the top variable (according to its AIC) and reran the regression with the remaining variables until the visual model did not match HadCRUT5 very well. The procedure suggests the most important variables are Log_CO2, Hallstatt, and Eddy. The four acceptable stepwise regression models are plotted in figure 12.
Figure 12. The four best forward stepwise regression models. The vertical scale is the HadCRUT5 temperature anomaly in degrees C, relative to 1961-1990.
The first stepwise model (All) chose the variables listed in the figure. The variables are listed in order of importance according to their AIC scores. The best models, visually, are “All” and “no CO2,” and it is hard to tell the difference between the two. Notice that when CO2 was removed from the selection list, more variables were chosen.
After Hallstatt is removed, the list of chosen variables shrinks, but the model visually degraded a lot. Once Eddy was removed the model becomes very poor. The top variable, by AIC, is Log_CO2, but when Log_CO2 is removed from the model (the green curve) the match to HadCRUT5 is still good. Other models were also evaluated in this fashion, but these three are the best.
The variables that came out consistently on the bottom, according to AIC, were the Pentadecadal cycle and sunspots. However, removing these variables always caused the model to visually deteriorate unacceptably. Thus, AIC, while useful, is not a good sole criterion for the value of variables or models. Always look at the plots.
There are several logical conclusions from this study.
- A successful model can be built using only solar cycles, ENSO, and the sunspot record.
- Adding CO2 to the model described in (1) above adds a little to the fit, mostly in short intervals, like from 1935 to 1940 and in the middle 1990s around the Pinatubo eruption.
- Standard statistical measures, like AIC, R2 or the P test, cannot be used as the sole measure of the success of the model. Evaluating the plots is critical.
This study shows that solar variability, at least statistically, correlates to HadCRUT5 at least as well as CO2. Since HadCRUT5 is one of the main global average surface temperature records used by the IPCC to measure climate change, their conclusion, as stated in the AR6 Technical Summary is:
“Taken together with numerous formal attribution studies across an even broader range of indicators and theoretical understanding, this underpins the unequivocal attribution of observed warming of the atmosphere, ocean, and land to human influence.”
AR6 TS, page 63, emphasis added.
This is incorrect, and the result of their unsupported assertion that the Sun has no influence on climate. They should seriously investigate the influence of solar variability on climate change. I expected to have to deal with lagged solar effects on climate in this study going in. Possible multi-year lags between solar events and related climatic effects are mentioned in many papers (example here, other examples are cited in Eichler, et al.), but the observation/model matches in this post were all achieved with no lags.
I would like to thank Charley May and David Evans for their help with this post, although if there are any errors, they are mine alone.
Download the bibliography here.
Download the supplementary material here. You will find the R code to create all the models and Excel code to make the main models, not all the R models can be made in Excel. To run the models in Excel you will need to the “Analysis ToolPak” and the “Solver” Add-in. These are found under File/Options/Add-ins.
(IPCC, 2021, p. 961) ↑
See especially: (Connolly et al., 2021), (Hoyt & Schatten, 1997), (Soon, Connolly, & Connolly, 2015), (Usoskin I. , 2017), (Usoskin, Gallet, Lopes, Kovaltsov, & Hulot, 2016), (Scafetta N. , 2023), (Vahrenholt & Lüning, 2015), and (Judge, Egeland, & Henry, 2020). ↑
(Hoyt & Schatten, 1997) and (Bray, 1968) ↑
14C is the Carbon-14 isotope, except for nuclear bombs it is only created in the atmosphere by galactic cosmic rays, which increase when the Sun is less active. It has been used a proxy for solar activity for many decades. It is stored in tree rings, which provide a convenient and accurate date for each 14C concentration. (Cain & Suess, 1976) and (Cain W. , 1975). ↑
10Be is an isotope of Beryllium that is created by cosmic rays and is also inversely correlated with solar activity. It is stored in ice cores. (Beer, Blinov, Bonani, & al., 1990). ↑
(Beer, Blinov, Bonani, & al., 1990) and (Hoyt & Schatten, 1997, p. 174) ↑
(Bray, 1968) ↑
(Delaygue & Bard, 2011) ↑
(Bray, 1968) ↑
(Abreu, Beer, & Ferriz-Mas, 2010) ↑
Joan Feynman studied this centennial cycle and the pentadecadal cycle for many years. She called it the Gleissberg cycle, but since many have used the name Feynman cycle, we continue with that name here (Feynman & Ruzmaikin, 2014). See also (Peristykh & Damon, 2003). ↑
The Pentadecadal cycle was first recognized by Rudolf Wolf in 1862 (Peristykh & Damon, 2003). He recognized that two or three high cycles were often followed by two or three low cycles. More formal recognition of the cycle was made by (Feynman & Ruzmaikin, 2014) and (Clilverd, Clarke, Ulich, Rishbeth, & Jarvis, 2006). ↑
(Peristykh & Damon, 2003) ↑
(Judge, Egeland, & Henry, 2020) and (Baliunas, et al., 1995) ↑
(Peristykh & Damon, 2003) ↑
(Roy, 2014) ↑
1958 was the International Geophysical Year (IGY), which led to gathering much higher quality climate and climate-related data. It is notable that the late S. Fred Singer was one of the organizers of this project and that it was organized in James Van Allen’s living room in 1950. According to Van Allen, it was his wife’s (Abigail) chocolate cake that sealed the deal that day. (Korsmo, 2007). ↑
(McKitrick & Christy, 2018) and (McKitrick & Christy, 2020) ↑
CO2 concentration varies as the logarithm to the base 2 with temperature, which means as CO2 doubles, temperature increases linearly. As CO2 concentration increases, its effect on surface temperature decreases. (Romps, Seeley, & Edman, 2022) and (Wijngaarden & Happer, 2020) ↑
ENSO 3.4 is from ERSST, which only goes back to 1854. 1850 through 1853 are filled in with the Webb, 2022 ONI. The sunspot number is from SILSO, and the CO2 concentration data are from NASA and NOAA. The CO2 record is interpolated yearly averages to avoid the seasonal changes. ↑
(Evans, 2013) ↑
(Usoskin, Gallet, Lopes, Kovaltsov, & Hulot, 2016) and (Usoskin I. , 2017) ↑
(Feynman & Ruzmaikin, 2014) ↑
(Evans, 2013) ↑
AIC stands for Akaike Information Criterion. It estimates the information lost by using the regression model in the place of the measurements. Like R2, it is based on the sum of squares and is susceptible to inflation (making variables and models look better than they actually are) due to autocorrelation. The Wikipedia article on this metric is helpful, see here. The lower the AIC value the better the model. ↑
All the input time series used in these multiple regression models are autocorrelated, which simply means each value in the series is highly dependent on its previous values not independent of one another as required by the rules of regression. This artificially inflates the statistical measures often used to evaluate the quality of a regression, such as the R2 value shown in some of the plots. ↑