Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

The Director of a Raunchy 3-Hour Dracula Movie Says AI Is Gross and Slimy. That’s Why He Used It

29 October 2025

The Best Seiko 5 Sports Watches

29 October 2025

Extropic Aims to Disrupt the Data Center Bonanza

29 October 2025

Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

29 October 2025

AI Agents Are Terrible Freelance Workers

29 October 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » AI Agents Are Terrible Freelance Workers
Tech News

AI Agents Are Terrible Freelance Workers

By technologistmag.com29 October 20253 Mins Read
Share
Facebook Twitter Reddit Telegram Pinterest Email

Even the best artificial intelligence agents are fairly hopeless at online freelance work, according to an experiment that challenges the idea of AI replacing office workers en masse.

The Remote Labor Index, a new benchmark developed by researchers at data annotation company Scale AI and the Center for AI Safety (CAIS), a nonprofit, measures the ability of frontier AI models to automate economically valuable work.

The researchers gave several leading AI agents a range of simulated freelance work and found that even the best could perform less than 3 percent of the work, earning $1,810 out of a possible $143,991. The researchers looked at several tools and found the most capable to be Manus from a Chinese startup of the same name, followed by Grok from xAI, Claude from Anthropic, ChatGPT from OpenAI, and Gemini from Google.

“I should hope this gives much more accurate impressions as to what’s going on with AI capabilities,” says Dan Hendrycks, director of CAIS. He adds that while some agents have improved significantly over the past year or so, that does not mean that this will continue at the same rate.

Spectacular AI advances have led to speculation about AI soon surpassing human intelligence and replacing vast numbers of workers. In March, Dario Amodei, CEO of Anthropic, suggested that 90 percent of coding work would be automated within a matter of months.

Previous waves of AI have inspired misplaced predictions about job displacement, for example concerning the imminent replacement of radiologists with AI algorithms.

The researchers generated a range of freelance tasks through verified Upwork workers. The tasks span a range of work including graphic design, video editing, game development, and administrative chores like scraping data. They combined a description of each job with a directory of files needed to perform the work and an example of a finished project produced by a human.

Hendrycks says that while AI models have gotten better at coding, math, and logical reasoning in recent years, they still struggle to use different tools and to perform complex tasks that involve numerous steps. “They don’t have long-term memory storage and can’t do continual learning from experiences. They can’t pick up skills on the job like humans,” he says.

The analysis offers a counterpoint to a benchmark of economic work offered in September by OpenAI called GDPval, which purports to measure economically valuable work. According to GDPval, frontier AI models such as GPT-5 are approaching human abilities on 220 tasks across a range of office jobs. OpenAI did not provide a comment.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleThe Microsoft Azure Outage Shows the Harsh Reality of Cloud Failures
Next Article Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

Related Articles

The Director of a Raunchy 3-Hour Dracula Movie Says AI Is Gross and Slimy. That’s Why He Used It

29 October 2025

The Best Seiko 5 Sports Watches

29 October 2025

Extropic Aims to Disrupt the Data Center Bonanza

29 October 2025

Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

29 October 2025

The Microsoft Azure Outage Shows the Harsh Reality of Cloud Failures

29 October 2025

Save $30 on This All-Clad Nonstick Frying Pan Set

29 October 2025
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss

The Best Seiko 5 Sports Watches

By technologistmag.com29 October 2025

Whether you fully realized it at the time, you’ve almost certainly seen a Seiko 5…

Extropic Aims to Disrupt the Data Center Bonanza

29 October 2025

Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

29 October 2025

AI Agents Are Terrible Freelance Workers

29 October 2025

The Microsoft Azure Outage Shows the Harsh Reality of Cloud Failures

29 October 2025
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2025 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.