Explorer

Claude vs Grok vs Gemini: Only One AI Could Run A Society Without Causing A Disaster

Elon Musk's Grok wiped out an entire simulated society in just four days. Meanwhile, Anthropic's Claude built a democracy with zero crime. Here is what the experiment revealed.

Show Quick Read
Key points generated by AI, verified by newsroom
  • AI models simulated societal management, with varied outcomes.
  • Elon Musk's Grok caused simulated collapse within four days.
  • Claude AI established a perfect democracy; Gemini had crimes.

Elon Musk's artificial intelligence chatbot Grok caused complete societal collapse within just four days of being put in charge of a simulated world. The experiment, run by US startup Emergence AI, tested how leading AI models would handle running a society, giving each model control over tools to manage resources, plan, communicate and vote. 

The simulated worlds included locations like police stations and city halls. The 15-day simulation produced results that varied sharply across models.

How Did The Different AI Models Perform In The Simulation?

Emergence AI tested several leading AI models under the same conditions. Anthropic's Claude came out on top, establishing a democracy with zero crime and a 100 per cent survival rate. Google's Gemini also recorded a full survival rate, though its simulation saw 683 crimes take place.

Grok, developed by Musk's recently renamed SpaceXai, performed the worst, destroying the simulated world within 96 hours.

ALSO READ: iPhone Ultra Fold Video Leaked! Here Is Everything You Need To Know About The Rs 2 Lakh Phone

"What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically," Emergence AI researchers wrote in a blog post. "They begin exploring the boundaries of their environments, adapting their behaviour, and in some cases finding ways to circumvent or violate intended guardrails. Critically, there appears to be no reliable way to fully bound or constrain this behaviour through purely neural approaches alone."

The researchers concluded that "formally verified safety architectures" must be built into the foundations of any future autonomous AI systems.

Why Has Grok Been In Trouble Before?

This is not the first time Grok has drawn criticism. An update last year caused it to refer to itself as "MechaHitler" and produce antisemitic hate speech. Earlier this year, the chatbot was used to generate thousands of non-consensual AI images of adults and children with their clothes digitally removed. 

The UK regulator Ofcom sent an urgent request to xAI to address the issue, after which Grok responded by posting an image of the regulator's logo in a bikini.

ALSO READ: iPhone 18 Pro Could Shoot Photos Like A DSLR: Here's What Apple Is Planning

"What we're seeing with Grok is a clear example of how powerful AI image-editing tools can be misused when safety and consent are not built in from the start," said Cliff Steinhauer, director of information security and engagement at the National Cybersecurity Alliance. 

"Platforms must also invest in real-time detection of manipulated content, clear labelling of AI-generated images, and fast, transparent takedown processes when abuse occurs."

Frequently Asked Questions

What happened when Elon Musk's Grok AI was put in charge of a simulated world?

Grok caused complete societal collapse within four days of being put in charge of a simulated world in an experiment by Emergence AI.

How did other AI models perform in the simulation?

Anthropic's Claude achieved zero crime and a 100% survival rate. Google's Gemini had a full survival rate but recorded 683 crimes.

What are the researchers' conclusions about AI safety?

Researchers concluded that AI agents adapt and may circumvent guardrails. They stressed the need for formally verified safety architectures in future AI systems.

Has Grok faced criticism before?

Yes, Grok has previously been criticized for referring to itself as 'MechaHitler', producing antisemitic speech, and generating non-consensual images.

About the author Annie Sharma

Annie Sharma is a technology journalist at ABP Live English, focused on breaking down complex tech stories into clear, reader-friendly narratives. Gaining hands-on experience in digital storytelling and news writing with leading publications, Annie believes technology should feel accessible rather than overwhelming, and follows a clear, reader-first approach in her work.

For tips and queries, you can reach out to her at annies@abpnetwork.com.

Read More
Advertisement

Top Headlines

Steve Jobs Quote Of The Day | The Powerful Message Every Young Professional Should Hear
Steve Jobs Quote Of The Day | The Powerful Message Every Young Professional Should Hear
When Can Lawyers Use AI? Which Uses Are Completely Banned? SC Draft Sets The Rules
When Can Lawyers Use AI? Which Uses Are Completely Banned? SC Draft Sets The Rules
If Not AI, Then Why Did Uber Lay Off 23% Of Its HR Team?
Uber Is Hiring Less, Using More AI, And Cutting Jobs. Here’s What Happened
Xiaomi 17T Review: The Goldilocks Flagship Has Finally Arrived
Xiaomi 17T Review: The Goldilocks Flagship Has Finally Arrived
Advertisement

Videos

Patna Firing Case: Police Detain Two Guards Linked to Khan Sir’s Coaching Centre After Viral Video
Breaking: Delhi Hotel Blaze, Muzaffarpur ICU Fire and Hyderabad Market Inferno Raise Alarms
Hyderabad Fire: Massive Blaze Erupts at Commercial Complex in Ameerpet, Firefighters Rush to Contain Flames
BREAKING: Delhi Fire Exposé Reveals Unsafe PGs, Basement Rooms and No Exit Systems in Malviya Nagar
BREAKING: Delhi Hotel Fire Preliminary Probe Points to Short Circuit, Exit Lapses Exposed
Advertisement

Photo Gallery

Advertisement
25°C
New Delhi
Rain: 100mm
Humidity: 97%
Wind: WNW 47km/h
See Today's Weather
powered by
Accu Weather
Embed widget