Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
An Explosion Knocked Out Anduril’s Rocket Motor Test Site in Mississippi

An Explosion Knocked Out Anduril’s Rocket Motor Test Site in Mississippi

1 July 2026
Xbox might let you digitize your game discs, and the timing makes perfect sense

Xbox might let you digitize your game discs, and the timing makes perfect sense

1 July 2026
You Can Now Sound the Alarm on AI Behaving Badly

You Can Now Sound the Alarm on AI Behaving Badly

1 July 2026
SwitchBot’s new outdoor security camera uses AI to describe activity around your home

SwitchBot’s new outdoor security camera uses AI to describe activity around your home

1 July 2026
Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

1 July 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » You Can Now Sound the Alarm on AI Behaving Badly
Tech News

You Can Now Sound the Alarm on AI Behaving Badly

By technologistmag.com1 July 20263 Mins Read
You Can Now Sound the Alarm on AI Behaving Badly
Share
Facebook Twitter Reddit Telegram Pinterest Email

Writing AI Lab each week means I occasionally encounter AI models that behave badly and bizarrely. Usually, there’s nothing to be done about it, save for sharing those tales with you. But that could soon change.

A group of AI researchers has set up a crowdsourced website, Flaw Reporting for AI (FLARE-AI), for reporting and tracking AI harms. If, for example, a chatbot generates malware or a bomb-making recipe, leaks personal information, or triggers delusional thinking in users, FLARE-AI could be used to sound the alarm. The open source code behind the system allows others to verify an issue and route reports to model makers, as well as organizations like MITRE, a nonprofit that tracks problems with technical systems. It’s a bit like Downdetector, which compiles real-time user reports for global service outages affecting things like apps and websites.

The website is another step in the group’s ongoing work with AI reporting, which I first wrote about last year. Members of the group also consulted on a congressional bill announced in June, which would see the US government take a central role in tracking this kind of AI misbehavior.

“Right now, there is no centralized, accountable way to report flaws in AI systems,” says Avijit Ghosh, an artificial intelligence policy researcher at HuggingFace who co-led development of FLARE-AI with computer scientists Elaine Zhu and Shayne Longpre.

The alarm system was developed in collaboration with 49 AI experts from 32 different organizations. In a paper outlining the work, the researchers argue that their initiative could prove crucial as AI is adopted more widely and as agentic systems gain greater power. The lack of a consistent way to report AI flaws is a significant problem, they believe.

“I think it’s a really good initiative,” says Jessica Ji, a researcher at the think tank Center for Security and Emerging Technology. Ji says the researchers are right to note that existing reporting mechanisms are fragmented and that AI models are black boxes. “I’m in support of anything that makes AI more transparent,” she says.

Though bugs and cybersecurity problems get a lot of attention—especially of late—Ghosh tells me that problems with AI systems span topics like psychological harm, discrimination or bias, and misinformation. He adds that different companies have different standards around such issues, which means some problems go unrecognized. “In the absence of a coordinated disclosure system, there are no external mechanisms to enforce transparency,” Ghosh says.

A spate of recent incidents involving popular AI tools shows how easily the technology can go bad.

This week, a company called LayerX disclosed a way to dupe AI-infused web browsers, including OpenAI’s Atlas and Perplexity’s Comet, into vaulting their guardrails. Convincing the AI model behind the browser that it was playing a game, for example, could lead to the browser going rogue and trying to hack a website. (The companies responsible for the affected browsers have fixed the issue, LayerX says.) And this April, Johann Rehberger, a security researcher, discovered a way to trick Claude into divulging personal data using images generated by ChatGTP.

AI introduces bizarre new kinds of problems, too. Last year, OpenAI was forced to update its models after it discovered that they were overly sycophantic, which sometimes appeared to encourage delusional thinking.

Rumman Chowdhury, the CEO and founder of Humane Intelligence PBC, says FLARE-AI could be a useful way for many AI developers to implement ways of reporting issues with their tools. But she adds that such initiatives often come with serious challenges.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleSwitchBot’s new outdoor security camera uses AI to describe activity around your home
Next Article Xbox might let you digitize your game discs, and the timing makes perfect sense

Related Articles

An Explosion Knocked Out Anduril’s Rocket Motor Test Site in Mississippi

An Explosion Knocked Out Anduril’s Rocket Motor Test Site in Mississippi

1 July 2026
Xbox might let you digitize your game discs, and the timing makes perfect sense

Xbox might let you digitize your game discs, and the timing makes perfect sense

1 July 2026
SwitchBot’s new outdoor security camera uses AI to describe activity around your home

SwitchBot’s new outdoor security camera uses AI to describe activity around your home

1 July 2026
Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

1 July 2026
Sony kills physical PlayStation games. The era of discs comes to an end for Team Blue

Sony kills physical PlayStation games. The era of discs comes to an end for Team Blue

1 July 2026
Penalty Shootouts: Is the Team That Kicks First More Likely to Win?

Penalty Shootouts: Is the Team That Kicks First More Likely to Win?

1 July 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Xbox might let you digitize your game discs, and the timing makes perfect sense

Xbox might let you digitize your game discs, and the timing makes perfect sense

By technologistmag.com1 July 2026

Earlier today, Sony announced it will stop making physical game discs for new PlayStation titles…

You Can Now Sound the Alarm on AI Behaving Badly

You Can Now Sound the Alarm on AI Behaving Badly

1 July 2026
SwitchBot’s new outdoor security camera uses AI to describe activity around your home

SwitchBot’s new outdoor security camera uses AI to describe activity around your home

1 July 2026
Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

Sony’s PlayStation Puts a Nail in Physical Media’s Coffin

1 July 2026
Sony kills physical PlayStation games. The era of discs comes to an end for Team Blue

Sony kills physical PlayStation games. The era of discs comes to an end for Team Blue

1 July 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.