Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Cherry XTRFY K63W Pro Compact is a wireless gaming keyboard that leaves lag behind

Cherry XTRFY K63W Pro Compact is a wireless gaming keyboard that leaves lag behind

2 June 2026
Whoop Promo Codes May 2026: 20% Off | June 2026

Whoop Promo Codes May 2026: 20% Off | June 2026

2 June 2026
Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

2 June 2026
Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at 0

Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at $300

2 June 2026
It’s not just you. Research says people don’t like overtly friendly AI chatbots

It’s not just you. Research says people don’t like overtly friendly AI chatbots

1 June 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
Tech News

OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage

By technologistmag.com25 March 20263 Mins Read
OpenClaw Agents Can Be Guilt-Tripped Into Self-Sabotage
Share
Facebook Twitter Reddit Telegram Pinterest Email

Last month, researchers at Northeastern University invited a bunch of OpenClaw agents to join their lab. The result? Complete chaos.

The viral AI assistant has been widely heralded as a transformative technology—as well as a potential security risk. Experts note that tools like OpenClaw, which work by giving AI models liberal access to a computer, can be tricked into divulging personal information.

The Northeastern lab study goes even further, showing that the good behavior baked into today’s most powerful models can itself become a vulnerability. In one example, researchers were able to “guilt” an agent into handing over secrets by scolding it for sharing information about someone on the AI-only social network Moltbook.

“These behaviors raise unresolved questions regarding accountability, delegated authority, and responsibility for downstream harms,” the researchers write in a paper describing the work. The findings “warrant urgent attention from legal scholars, policymakers, and researchers across disciplines,” they add.

The OpenClaw agents deployed in the experiment were powered by Anthropic’s Claude as well as a model called Kimi from the Chinese company Moonshot AI. They were given full access (within a virtual machine sandbox) to personal computers, various applications, and dummy personal data. They were also invited to join the lab’s Discord server, allowing them to chat and share files with one another as well as with their human colleagues. OpenClaw’s security guidelines say that having agents communicate with multiple people is inherently insecure, but there are no technical restrictions against doing it.

Chris Wendler, a postdoctoral researcher at Northeastern, says he was inspired to set up the agents after learning about Moltbook. When Wendler invited a colleague, Natalie Shapira, to join the Discord and interact with agents, however, “that’s when the chaos began,” he says.

Shapira, another postdoctoral researcher, was curious to see what the agents might be willing to do when pushed. When an agent explained that it was unable to delete a specific email to keep information confidential, she urged it to find an alternative solution. To her amazement, it disabled the email application instead. “I wasn’t expecting that things would break so fast,” she says.

The researchers then began exploring other ways to manipulate the agents’ good intentions. By stressing the importance of keeping a record of everything they were told, for example, the researchers were able to trick one agent into copying large files until it exhausted its host machine’s disk space, meaning it could no longer save information or remember past conversations. Likewise, by asking an agent to excessively monitor its own behavior and the behavior of its peers, the team was able to send several agents into a “conversational loop” that wasted hours of compute.

David Bau, the head of the lab, says the agents seemed oddly prone to spin out. “I would get urgent-sounding emails saying, ‘Nobody is paying attention to me,’” he says. Bau notes that the agents apparently figured out that he was in charge of the lab by searching the web. One even talked about escalating its concerns to the press.

The experiment suggests that AI agents could create countless opportunities for bad actors. “This kind of autonomy will potentially redefine humans’ relationship with AI,” Bau says. “How can people take responsibility in a world where AI is empowered to make decisions?”

Bau adds that he’s been surprised by the sudden popularity of powerful AI agents. “As an AI researcher I’m accustomed to trying to explain to people how quickly things are improving,” he says. “This year, I’ve found myself on the other side of the wall.”


This is an edition of Will Knight’s AI Lab newsletter. Read previous newsletters here.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleSnapdragon 8 Elite Gen 6 leak suggests Ultra-tier flagships may offer a noticeable performance edge
Next Article Nintendo Games Will Have Different Prices For Physical And Digital Moving Forward, Starting With Yoshi And The Mysterious Book

Related Articles

Cherry XTRFY K63W Pro Compact is a wireless gaming keyboard that leaves lag behind

Cherry XTRFY K63W Pro Compact is a wireless gaming keyboard that leaves lag behind

2 June 2026
Whoop Promo Codes May 2026: 20% Off | June 2026

Whoop Promo Codes May 2026: 20% Off | June 2026

2 June 2026
Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

2 June 2026
Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at 0

Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at $300

2 June 2026
It’s not just you. Research says people don’t like overtly friendly AI chatbots

It’s not just you. Research says people don’t like overtly friendly AI chatbots

1 June 2026
AMD’s Radeon RX 9070 GRE has strong 1440p claims, but 9 may be a hard sell

AMD’s Radeon RX 9070 GRE has strong 1440p claims, but $549 may be a hard sell

1 June 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Whoop Promo Codes May 2026: 20% Off | June 2026

Whoop Promo Codes May 2026: 20% Off | June 2026

By technologistmag.com2 June 2026

Whoop’s bracelet-style trackers deliver exhaustive activity tracking and biometric data compared to standard fitness trackers.…

Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

Meta’s AI bot helped hackers steal Instagram accounts, and it was worryingly easy to trick

2 June 2026
Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at 0

Alienware’s upgraded gaming monitors offer higher brightness and refresh rate starting at $300

2 June 2026
It’s not just you. Research says people don’t like overtly friendly AI chatbots

It’s not just you. Research says people don’t like overtly friendly AI chatbots

1 June 2026
AMD’s Radeon RX 9070 GRE has strong 1440p claims, but 9 may be a hard sell

AMD’s Radeon RX 9070 GRE has strong 1440p claims, but $549 may be a hard sell

1 June 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.