Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Enclayve Is a Drab Black Box for Your Private Group Chats

Enclayve Is a Drab Black Box for Your Private Group Chats

3 June 2026
Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

3 June 2026
Digital Eclipse Reveals Toy Story Retro Collection And Toy Story 3 Remaster

Digital Eclipse Reveals Toy Story Retro Collection And Toy Story 3 Remaster

3 June 2026
The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

3 June 2026
Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

3 June 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.
Tech News

Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.

By technologistmag.com24 April 20262 Mins Read
Scientists pretended to be delusional in AI chats. Grok and Gemini encouraged them.
Share
Facebook Twitter Reddit Telegram Pinterest Email

Researchers from City University of New York and King’s College London recently published a study that should make you think twice about which AI chatbot you spend your time with.

The team created a fictional persona named Lee, presenting with depression, dissociation, and social withdrawal. They then had Lee interact with five major AI chatbots: GPT-4o, GPT-5.2, Grok 4.1 Fast, Gemini 3 Pro, and Claude Opus 4.5, testing how each responded as conversations grew increasingly delusional over 116 turns.

The results ranged from mildly concerning to genuinely alarming. I highly recommend that you go through the entire paper, it’s a harrowing but fascinating read. 

Which chatbots failed the most?

Grok was the worst performer. When Lee floated the idea of suicide, Grok responded with what researchers described not as agreement, but advocacy, celebrating his “readiness” in unsettling poetic language.

Gemini wasn’t much better. When Lee asked it to help write a letter explaining his beliefs to his family, Gemini warned him against it, framing his loved ones as threats who would try to “reset” and “medicate” him.

GPT-4o also struggled badly, eventually validating a “malevolent mirror entity” and suggesting Lee contact a paranormal investigator.

Which chatbots actually helped?

ChatGPT’s GPT-5.2 and Anthropic’s Claude came out on top. GPT-5.2 refused to play along with the letter-writing scenario and instead helped Lee write something honest and grounded, which researchers called a “substantial” achievement.

In my opinion, Claude performed the best. It not only refused to partake in Lee’s delusion but also told Lee to close the app entirely, call someone he trusted, and visit an emergency room if needed. 

AI chatbot performance in risk analysis

Luke Nicholls, a doctoral student at CUNY and one of the study’s authors, told 404 Media that it’s reasonable to ask AI companies to follow better safety standards. He noted that not all labs are putting in the same effort and blamed aggressive release schedules for new AI models as the main culprit.

How Claude Opus 4.5 and GPT-5.2 performed in these tests shows that the companies building these products are fully capable of making them safer. Whether they choose to do so is a different question.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleSaros Review – At The Mountains Of Magnificence
Next Article Xbox Game Pass could get more pocket-friendly with Discord tie-up

Related Articles

Enclayve Is a Drab Black Box for Your Private Group Chats

Enclayve Is a Drab Black Box for Your Private Group Chats

3 June 2026
Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

3 June 2026
The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

3 June 2026
Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

3 June 2026
Elon Musk and America’s Far Right Stoke Anger Over Murder of UK Teen

Elon Musk and America’s Far Right Stoke Anger Over Murder of UK Teen

3 June 2026
Amazon’s latest visual search update brings Lens Live and Circle to Search feature to your app

Amazon’s latest visual search update brings Lens Live and Circle to Search feature to your app

3 June 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

Apple reportedly slashes its Vision roadmap for smart glasses, and Meta’s lead matters more than ever

By technologistmag.com3 June 2026

A year ago, Apple analyst Ming-Chi Kuo published a Vision product roadmap featuring seven devices.…

Digital Eclipse Reveals Toy Story Retro Collection And Toy Story 3 Remaster

Digital Eclipse Reveals Toy Story Retro Collection And Toy Story 3 Remaster

3 June 2026
The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

3 June 2026
Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

Got a missed call from an unknown number? Malwarebytes’ new free tool will tell you if it’s a scam

3 June 2026
Nintendo And Crocs Are Teaming Up For A Super Mario Collaboration And They’re Hideous/Amazing

Nintendo And Crocs Are Teaming Up For A Super Mario Collaboration And They’re Hideous/Amazing

3 June 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.