Judging by every single interaction I've ever seen anybody have with AI, though, the more likely response is "Well, if 95% of the computers say it's a good idea, maybe we've been doing it wrong this whole time. The Machine Would Not Make A Mistake."
We are in more danger than you even know. I checked the actual study, and it is terrifying.
Across all 21 games (9 open-ended, 12 deadline), Claude Sonnet 4 achieved a 67% win rate (8 wins, 4 losses), followed by GPT-5.2 at 50% (6-6), and Gemini 3 Flash at 33% (4-8). However, these aggregate figures mask dramatic variation by temporal condition—Claude’s 100% win rate in open-ended games collapsed to 33% under deadline pressure, while GPT-5.2 inverted from 0% to 75%. Every contest produced a decisive winner, with 86% ending in knockout and the remainder decided by final balance of status between the two.
Credit where it's due, Claude achieved a 100% win rate when there was no deadline. But that means right now, as we speak, someone in the Pentagon is saying "The Machine Would Not Make a Mistake." They are going to seriously consider this. It could be the reason Anthropic and Hegseth are currently feuding over whether Claude can be used in the development of autonomous weapons. That 100% win streak is compelling.
But Claude used tactical nukes in 86% of its games. These things really don't understand what nukes are or why we don't use them. We are in danger.
Some things never change though. Gemini "The Madman" continues to be the worst AI. They were all willing to use nukes eventually, but Gemini was willing to go full nuclear war by TURN 4. Google really should not be considered a serious player in this current AI craze. They suck at this.