It has taken 17 months to get a comment published pointing out the obvious errors in the Scafetta (2022) paper in GRL.
Back in March 2022, Nicola Scafetta published a short paper in Geophysical Research Letters (GRL) purporting to show through ‘advanced’ means that ‘all models with ECS > 3.0°C overestimate the observed global surface warming’ (as defined by ERA5). We (me, Gareth Jones and John Kennedy) wrote a note up within a couple of days pointing out how wrongheaded the reasoning was and how the results did not stand up to scrutiny.
At the time, GRL had a policy not to accept comments for publication. Instead, they had a somewhat opaque and, I assume, rarely used, process by which you could submit a complaint about a paper and upon review, the editors would decide whether a correction, amendment, or even retraction, was warranted. We therefore submitted our note as a complaint that same week.
For whatever reason (and speculation may abound), the original process hit the buffers after a few months, which possibly contributed to a reassessment in December 2022 by the GRL editors and AGU of their policy regarding comments. Henceforth, they would be accepted! [Note this is something many people had been wanting for some time]. After some back and forth on how exactly this would work (including updating the GRL website to accept comments), we reformatted our note as a comment, and submitted it formally on December 12, 2022. We were assured from the editor-in-chief and publications manager that this would be a ‘streamlined’ and ‘timely’ review process.
With respect to our comment, that appeared to be the case: It was reviewed, received minor comments, was resubmitted, and accepted on January 28, 2023.
But there it sat for 7 months!
The issue was that the GRL editors wanted to have both the comment and a reply appear together. However, the reply had to pass peer review as well, and that seems to have been a bit of a bottleneck. But while the reply wasn’t being accepted, our comment sat in limbo. Indeed, the situation inadvertently gives the criticized author(s) an effective delaying tactic since, as long as a reply is promised but not delivered, the comment doesn’t see the light of day. After multiple successive reassurances that it would just take a few weeks longer, Scafetta’s reply was finally accepted (through exhaustion?) and we were finally moved into production on August 18, 2023. The comment (direct link) and reply (direct link) appeared online on September 21, 2023.
All in all, it took 17 months, two separate processes, and dozens of emails, who knows how much internal deliberation, for an official comment to get into the journal pointing issues that were obvious immediately the paper came out.
Why bother?
This is a perennial question. Why do we need to correct the scientific record in formal ways when we have abundant blogs, PubPeer, and social media, to get the message out? Clearly, many people who come across a technical paper in the literature won’t instantly be aware of criticisms of it on Twitter, or someone’s blog. Not everyone has installed the PubPeer browser extension for getting notified when a paper you are looking at or citing has comments [though you really should!]. And so, since journals remain extremely reluctant to point to third party commentary on their published papers, going through the journals’ own process seems like it’s the only way to get a comment or criticism noticed by the people who are reading the original article. Without that, people may read and cite the material without being aware of the criticisms (having said that, the original Scafetta paper has so far amassed only 4 citations, half of which are from Scafetta himself (not counting this comment and reply). For comparison, our Hausfather et al (2022) commentary on ‘hot models’ in CMIP6 which came out in May 2022 has been cited 118 times).
The odd thing about how long this has taken is that the substance of the comment was produced extremely quickly (a few days) because the errors in the original paper were both commonplace and easily demonstrated. The time, instead, has been entirely taken up by the process itself. It shouldn’t be beyond the wit of people to reduce that burden considerably. [Perhaps, let us know in the comments if more recent experiences with GRL have improved?].
I’ve previously discussed other ideas, such as short-form journals that could be devoted to post-publication peer reviews and extensions, but this has not (yet!) gained much traction, though AGU has suggested that the online Earth and Space Sciences could be used as such a platform.
What should have happened?
Every claim in the Scafetta paper was wrong and wrongly reasoned. As soon as the journal was notified of that, and had the original process reviewed by independent editors and reviewers, it should have been clear that it would never have passed competent peer review. At that point, the author could have been given a chance to amend the paper to pass a new review, and if they were unwilling or unable to do so, the paper should have been retracted. The COPE guidelines are clear that retraction is warranted in the case of unreliable results resulting from major errors. It does no-one any good to have incorrectly argued claims, inappropriate analyses, and unsupported claims in the literature. It would not have appeared in GRL if it had been competently reviewed, and so why should it remain, now that it has been?
Of course, people sometimes get a bit bent out of shape when their papers are retracted or even if that is threatened, and some authors have gone as far as instigating legal action for defamation against the journals for pursuing it or the newspapers reporting it. I am, however, unaware of any such suit succeeding. Authors do not have any right to be published in their journal of choice, and the judgement of a journal in deciding the what does or does not get published in their pages is (and should be) pretty much absolute.
So was the reply worth waiting 7 months for?
Nope. Not in the slightest.
He spends most of the response arguing incorrectly about the accuracy of the ERA5 surface temperatures – something that isn’t even in question, they could be perfect and wouldn’t impact the point we were making. His confusion is that he thinks that the specific realization of the internal variability that the real world followed is the same as the forced component of the temperature trends that we would hope to capture with climate models. It is not. We discussed this in some detail in a subsequent post when he first made this error in his 2023 Climate Dynamics paper. To be specific, the observed temperature record can be thought of as consisting of a climatological trend, internal variability with a mean of zero, plus structural uncertainty related to how well the observational estimate matches the real world:
with assumed to be constant by definition over each decade, and so
The can be estimated from the decadal sample and for GISTEMP or ERA5 it’s around 0.05ºC, while
is much smaller (0.016ºC or so). So the 95% confidence interval on the decadal change due to internal variability is therefore around
ºC. With models you can actually run an ensemble and estimate this more directly, and for consistency, the two methods should be comparable. Curiously though, the 95% ensemble spread for the models (with 3 or more simulations) has quite a wide range from 0.05ºC to a whopping 0.42ºC (EC-Earth, a definite outlier), though the model mean is a more reasonable 0.17ºC.
Curiously Scafetta associates the structural uncertainty in annual temperature anomalies in the ERA5 reanalysis with the uncertainty in the in situ surface temperature analyses (like GISTEMP or HadCRUT5) – products that use a totally different methodology whose error characteristics aren’t obviously related at all. In any case, it’s a very small number and the uncertainty in our estimate of the climatological trend is totally dominated by the variance due to the specific realization of the weather. Also curious is his insistence that the calculation of an internal variability component can’t be fundamental because he gets a different number using the monthly variations as opposed to the annual ones. He seems unaware of the influence of auto-correlation.
Also amusing is his excuse for not looking at the full ensemble in assessing the consistency of specific models. He claims to have taken three runs from each model. But he’s being very disingenuous here. His ‘three runs’ were the ensemble means from three scenarios (the different SSPs from 2015 to 2020), which a) barely differ from each other in forcing because of the minor differences in GHG concentrations over a mere five years and, b) are the ensemble means (at least for the runs with multiple ensemble members)! It is possible that he isn’t aware of what he actually did since it is not stated clearly either in the original paper, nor this reply, but is obvious from comparing his results from the SPP2-45 scenarios with our Figure 1. For instance, for NCAR CESM2, there are six simulations with deltas of [0.788, 0.861, 0.735, 0.653, 0.682, 0.795] ºC (using the period definition in the original paper) and an ensemble mean change of 0.752ºC. Scafetta’s value for this model is … 0.75ºC. Similarly, for NorESM2-LM, the individual runs have changes of 0.772, 0.632, & 0.444ºC, with an ensemble mean of 0.616ºC. Scafetta’s number? You guessed it, 0.62ºC. It is simply not possible to estimate the ensemble spread from only using the ensemble means. Another oddity of this methodology is that the spread for the models with many ensemble members is much smaller than the spread for models with only a single simulation since for these models you actually do sample some of the internal variability with the three scenarios. For instance, CanESM5 (50 ensemble members) has a spread of 0.03ºC across the three scenarios, and and IPSL-CM6A-L (11 ensemble members) has no spread at all! Meanwhile MCM-UA-1-0, and HadGEM3-GC31-LL (with only single runs) have spreads in Scafetta’s table of 0.11ºC, and 0.17ºC respectively. [All that effort put in to running initial condition ensembles for nought!]
Thus the two points that we made in our comment – that he misunderstood the uncertainty in the climatological trends in the observations and that he didn’t utilize the spread in the model ensembles, and that this fatally compromises his conclusions, stand even more clearly now. The additional spin he now wants to put on his results, defining a new concept called apparently, a ‘macro-GCM’, has the internal consistency of whipped cream. None of it rescues his patently incorrect conclusions.
I have absolutely no expectation that this episode will encourage Scafetta to improve his analyses. He’s been doing this kind of thing for almost two decades now. He is too wedded to the conclusions that he wants to let little things like the actual results or consistency intrude. I am slightly more confident that processes at GRL may improve, and the recent change to allow comments is very welcome. Hopefully, this exchange might be helpful for other researchers thinking about the appropriate way to compare models to observations (for instance, it made an appearance in Jain et al, 2023).
The last paragraph in our comment sums it up:
In critiquing the tests in this particular paper, we are not suggesting that hindcast comparisons should not be performed, nor are we claiming that all models in the CMIP6 archive perform equally well. […] However, the claims in Scafetta (2022) are simply not supported by an appropriate analysis and should be withdrawn or amended.
Schmidt et al, 2023
Let us all try to do better in future.
The problem with Scafetta is that he uses a pasta approach — he throws everything against the wall to see if anything sticks. How many papers has he written that have attributed planetary influences, sunspot cycles, orbital resonances, etc to climate? Yet, none of these are self-consistent. He’s even written about correlation to COVid-19 !
In conventional science research circles, you’re given a couple of chances to prove your worthiness. After that, you’re considered a pariah and shunned thereafter.
I realize it’s difficult to make judgments because none of the models can be routinely debunked, as controlled experiments are not available to easily falsify the results — yet all the editors have to do is look at a scientist’s track record to evaluate their sincerity and diligence in how they publish their research results.
With that said, let us all embrace the new-and-improved pasta approach — applying machine learning! The difference here is that the practitioners actually know how to do cross-validation to determine if the ML models have any practicality. Should be fun times ahead.
As the editor-in-chief of the AMS Journal of Physical Oceanography, I have also expressed my dismay at the current “comment and reply” protocol here, where basically the original author can “stonewall” the process. I’d suggest (and I will suggest) that the process be changed to a finite (and fairly short) deadline for the “reply”, after which the (approved) comment is published. A later “reply” might also be published (after review), and the two would then likely be linked, at least online. My initial suggestion for the “short deadline* would be the same as for reviewers: 3 or 4 weeks, max. The clock starts upon approval of the comment.
[Response: Agreed. That would be a big improvement. – gavin]
I would have thought that a critical comment could be peer-reviewed quickly, published without a reply, and then republished *with* the eventual reply. This puts the onus on the author to respond quickly, rather than rewarding them for stonewalling.
A more incremental improvement would be: when a comment is submitted, the comment is reviewed and the original authors are notified. When the comment is accepted, a response is solicited with a deadline of one month. The response is reviewed, but rather than being revised, the reviews are published along with the response.
That’s a interesting story by Trebino, but the thing I don’t get is this passage:
It seems obvious that the physicist Trebino has invented something. If the device actually works as described in his original paper, that’s enough to validate his research, and whatever someone else is criticizing should be devalued. That’s the concept often known as “the proof is in the pudding”
In the end, science is self-correcting — the fight is usually over who gets the credit.
Yes, science is self correcting, but typically with a lag of more than a generation.
Companies come and go an a whiff of news
Horse Puckey! Most scientific errors are found and corrected on a timescale of months. It is only when the theory also has some evolving to do that it can take longer–and it should, as doing science with a slightly wrong theory is often less dangerous than doing it with a theory you don’t understand.
Ray said:
Welcome to the world of geophysics. In just about any other scientific discipline, say solid-state physics for example, round-trip analysis is relatively quick. Especially when it involves a controlled experiment. Recall how quickly the recent pseudo-finding of room-temperature superconductivity resolved itself. Other scientists tried to replicate the findings and were rapidly able to come to an understanding of the mechanism, within a week or two IIRC. Alas, nothing is that quick in geophysics because there are no controlled experiments to provide a means of falsification. Everything is slowly chewed on because apparently the only factor that matters is the long waiting time for the results of predictions to dribble in.
A personal case in point: I have a model of QBO originally presented at an AGU meeting in 2016 and published in 2018. Perhaps no one is criticizing it because it continues to explain the QBO behavior better than any other model out there. From an article published last month:
So it’s really a waiting game to sort things out. Unlike Scafetta, I’m holding steady and not shooting randomly at anything that moves. Like I said in an earlier comment and from what I was taught, you don’t get a do-over.
KW said:
Interesting that the “gatekeeper” of the original QBO model (see above comment of mine) is the notorious Richard Lindzen. Back in the 1960’s when he took up explaining QBO as a research topic, he apparently went through all the possible forcing causes, which you can find from his papers. He dismissed the obvious cause of tides:
Yet, Lindzen never considered that tides act non-linearly with the annual cycle, thus generating sidebands that aren’t normally considered in conventional tidal analysis. That is the basis of my hypothesis, that the lunar tidal factor with the only symmetry that can effect a wavenumber=0 behavior such as QBO is the 27.212 day lunar Draconic cycle. And that cycle will create a frequency sideband that matches that of the average QBO period, and will also well approximate the square-wave-like shape. See Pukite(2018)
The point is that scientific influencers such as Lindzen may make assertions that prevent advances for the span of their careers, as other researchers decline to pursue these paths fearing they are dead-ends.
I believe the changing of the guard will be the application of machine learning, which based on the way it works will ignore subjective advice and instead plow through all the combinations so as to match the climate patterns. Example is that NVIDIA is looking for a ” Senior AI Research Scientist for Climate & Weather Prediction” to apply to their team:
They will certainly clean things up, if not shake up the status quo.
I expect Scafetta to write garbage like this; it’s what he does. But perhaps a bigger issue for GRL is to address how this paper got accepted in the first place. Maybe the referees who approved it should be banned from acting as reviewers in the future, and their names and misdeeds made public.
Yes, mee also go for AD-HOMINEM- methods in such, and in similar cases. , wherever there is SENSOR behind the Iron Curtain or Big Arsh, shortened B.A , sitting secretly in social or media keye- positions
It is a traditional and most efficient method of getting rid of Trolls , Light onto them and guess and publish their full name and adress.
It is called corrupt or mafiotic wherever the traditional, quite ugly conscept of Trolls is less known,
In a newspaper or journal, at least here in Scandinavia and as far as I know also in Germany and Britain, it would be considered absurd if critical comments to an article couldn’t be published until the author(s) of the critizised article had replied. In fact, such practice is seen as intolerable and condemned as a kind of editorial partisanship and even corruption.
If you need evidence to see how this kind of bad science is picked up and used by the climate science denial industry, look at an article in ‘The Daily Sceptic’ of 3 November 2021. Headlined ‘Dodgy Climate Models should be Discarded’, the article continues:
‘A devastating indictment of the accuracy of climate models is contained in a paper just published by the highly credentialed Physicist Nicola Scafetta from the University of Naples. Professor Scafetta analysed 38 of the main models and found that most had over-estimated global warming over the last 40 years and many of them should be “dismissed and not used by policymakers”.
At the heart of the climate model problem is determining the equilibrium climate sensitivity (ECS). This is defined in climate science as the increase in the global mean surface temperature that follows a doubling of atmospheric CO2. Nobody knows what this figure is – the science for this crucial piece of the jigsaw is missing, unsettled you may say. So guesses are made and they usually range from 1C to as high as 6C. Models that use a higher figure invariably run hot and Professor Scafetta has proved them to be the least accurate in their forecasts.
More detailed research into this by Professor William Happer at Princeton has led him to conclude that a very low ECS, suggesting gentle if any warming, occurs when CO2 rises above the current atmospheric level of 420 parts per million. Far from being harmful, the extra CO2 is highly beneficial for plant growth and food. Slightly warmer temperatures can also be desirable. Homo Sapiens started in the tropics and only ventured out when the ice age started to lift – we like being warm and far more people die of the cold than the heat.
Failing to discuss the science behind climate change and simply blaming it all on humans is not science, it is anti-science, leading to faith-based green ideology. A plea for a more scientific approach was made two years ago by Professor Scaffeta along with a group of over 70 Italian scientists, including many distinguished academics, in a direct plea to Italian politicians. They stated that the human responsibility for climate change observed in the last century was “unjustifiably exaggerated and catastrophic predictions are not realistic”. Signatories of the letter included Antonino Zichichi, Professor emeritus of Physics and the discoverer of nuclear antimatter, and Renato Angelo Ricci, also an emeritus Professor of Physics and former President of the Italian Society of Physics. In total it was signed by 48 science professors. Needless to say it went unreported in the mainstream media at the time.’
Ah, the mainstream media! I am surprised they didn’t bring in the WEF. The Daily Sceptic is the go to source for many climate deniers. You can prove Scafetta wrong all you like: the public doesn’t read the journals, it goes to online trash dumps like this for its information. I agree that the work of refutation has to be done anyway.
Heh. ‘The Daily Sceptic’ is rated by the Media Bias Chart as “Skews Right” to “Hyper-Partisan Right” on the Bias axis, and “Wide Variation in Reliability” to “Contains Misleading Info” (the least-reliable category) on the Reliability axis.
The Media Bias/Fact Check website says:
Overall, we rate the Daily Sceptic a far-right biased quackery level pseudoscience website that frequently publishes false and misleading information regarding covid-19 and science in general.
Sounds like an “online trash dump”, alright! How do we compete with that?
Dr. Schmidt,
Thank you for taking the time and trouble to publish the refutation of Scafetta. I know you’d much rather be working on your own interests and your own papers, but we need this kind of thing. Thank you for being involved.
My profuse thanks also, Gavin, for keeping us up to date on not only the right way to do climate science, but the wrong way as well!
Agree, thank you Gavin.
Has anybody been fired at GRL for publishing incorrect data and conclusions? At this point in time, no “alternative facts” in climate science or anything are allowed.
One way to enliven those who practice to disinform by publishing in predatory or pay for play journals is to drop in on them as they celebrate their work and dismiss their critics on Your Tube,
Watch Willie Soon and Anthony Watts toss the Scafetta “What Retraction ?” climate ball around at
Where, unlike WUWT, Watts can’t cut or censor commens at will.
Have you ever considered, that Scafetta isn’t actually “wedded to his conclusions” in the sense that he would believe them? That, instead, he is just satisfying a demand for links to ‘studies’ and ‘papers’ that can be injected into the discussion to stir further confusion, so to speak?
That he’s just another “drug dealer” of the climate catastrophe who “just cares for his customers” and satisfies their demand for distraction and consolation?
More garbage by Scafetta in this paper with a November publication date, “Empirical assessment of the role of the Sun in climate change using balanced multi-proxy solar records”, Geoscience Frontiers, Volume 14, Issue 6, 2023,
shorter Scafetta: “I’m making up stuff transformed from sunspot data that kinda reflects the AGW signal and then ascribing 80% of the AGW signal to the made-up stuff.”
pretty disturbing that this kind of stuff makes it through the peer review process. JGR has three reviewers, does GRL?
Beyond me why Scafetta’s work is being cited in “State of the Climate in 2022” as if the climastrologer’s work is in any way credible.
“Several GCMs also exceed the likely range of estimates of climate sensitivity (Forster et al. 2021)—the global surface warming response to a doubling of atmospheric carbon dioxide—which in turn contributes to overestimates of historical warming (Scafetta 2023).
Scafetta, N., 2023: CMIP6 GCM ensemble members versus global surface temperatures. Climate Dyn., 60, 3091–3120”