Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
These Feature-Packed Earbuds Are Less Than

These Feature-Packed Earbuds Are Less Than $50

27 January 2026
It looks like two new budget-centric Nothing smartphone could arrive soon

It looks like two new budget-centric Nothing smartphone could arrive soon

27 January 2026
Best Walking Pad Deals: Save 0 and More (2026)

Best Walking Pad Deals: Save $150 and More (2026)

27 January 2026
Meta premium subscriptions are coming, here’s what you’ll actually pay for

Meta premium subscriptions are coming, here’s what you’ll actually pay for

27 January 2026
8-Bit Big Band Founder Charlie Rosen Talks Broadway, Grammy Awards, And Why The Group Is Still A “Side Project”

8-Bit Big Band Founder Charlie Rosen Talks Broadway, Grammy Awards, And Why The Group Is Still A “Side Project”

27 January 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » AI Agents Are Terrible Freelance Workers
Tech News

AI Agents Are Terrible Freelance Workers

By technologistmag.com29 October 20253 Mins Read
AI Agents Are Terrible Freelance Workers
Share
Facebook Twitter Reddit Telegram Pinterest Email
AI Agents Are Terrible Freelance Workers

Even the best artificial intelligence agents are fairly hopeless at online freelance work, according to an experiment that challenges the idea of AI replacing office workers en masse.

The Remote Labor Index, a new benchmark developed by researchers at data annotation company Scale AI and the Center for AI Safety (CAIS), a nonprofit, measures the ability of frontier AI models to automate economically valuable work.

The researchers gave several leading AI agents a range of simulated freelance work and found that even the best could perform less than 3 percent of the work, earning $1,810 out of a possible $143,991. The researchers looked at several tools and found the most capable to be Manus from a Chinese startup of the same name, followed by Grok from xAI, Claude from Anthropic, ChatGPT from OpenAI, and Gemini from Google.

“I should hope this gives much more accurate impressions as to what’s going on with AI capabilities,” says Dan Hendrycks, director of CAIS. He adds that while some agents have improved significantly over the past year or so, that does not mean that this will continue at the same rate.

Spectacular AI advances have led to speculation about AI soon surpassing human intelligence and replacing vast numbers of workers. In March, Dario Amodei, CEO of Anthropic, suggested that 90 percent of coding work would be automated within a matter of months.

Previous waves of AI have inspired misplaced predictions about job displacement, for example concerning the imminent replacement of radiologists with AI algorithms.

The researchers generated a range of freelance tasks through verified Upwork workers. The tasks span a range of work including graphic design, video editing, game development, and administrative chores like scraping data. They combined a description of each job with a directory of files needed to perform the work and an example of a finished project produced by a human.

Hendrycks says that while AI models have gotten better at coding, math, and logical reasoning in recent years, they still struggle to use different tools and to perform complex tasks that involve numerous steps. “They don’t have long-term memory storage and can’t do continual learning from experiences. They can’t pick up skills on the job like humans,” he says.

The analysis offers a counterpoint to a benchmark of economic work offered in September by OpenAI called GDPval, which purports to measure economically valuable work. According to GDPval, frontier AI models such as GPT-5 are approaching human abilities on 220 tasks across a range of office jobs. OpenAI did not provide a comment.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleThe Microsoft Azure Outage Shows the Harsh Reality of Cloud Failures
Next Article Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

Related Articles

These Feature-Packed Earbuds Are Less Than

These Feature-Packed Earbuds Are Less Than $50

27 January 2026
It looks like two new budget-centric Nothing smartphone could arrive soon

It looks like two new budget-centric Nothing smartphone could arrive soon

27 January 2026
Best Walking Pad Deals: Save 0 and More (2026)

Best Walking Pad Deals: Save $150 and More (2026)

27 January 2026
Meta premium subscriptions are coming, here’s what you’ll actually pay for

Meta premium subscriptions are coming, here’s what you’ll actually pay for

27 January 2026
Google DeepMind Staffers Ask Leaders to Keep Them ‘Physically Safe’ From ICE

Google DeepMind Staffers Ask Leaders to Keep Them ‘Physically Safe’ From ICE

27 January 2026
Samsung’s Galaxy Z TriFold finally gets a US launch date and a jaw-dropping price tag

Samsung’s Galaxy Z TriFold finally gets a US launch date and a jaw-dropping price tag

27 January 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
It looks like two new budget-centric Nothing smartphone could arrive soon

It looks like two new budget-centric Nothing smartphone could arrive soon

By technologistmag.com27 January 2026

The Carl Pei-led smartphone manufacturer Nothing is gearing up to launch two new smartphones: the…

Best Walking Pad Deals: Save 0 and More (2026)

Best Walking Pad Deals: Save $150 and More (2026)

27 January 2026
Meta premium subscriptions are coming, here’s what you’ll actually pay for

Meta premium subscriptions are coming, here’s what you’ll actually pay for

27 January 2026
8-Bit Big Band Founder Charlie Rosen Talks Broadway, Grammy Awards, And Why The Group Is Still A “Side Project”

8-Bit Big Band Founder Charlie Rosen Talks Broadway, Grammy Awards, And Why The Group Is Still A “Side Project”

27 January 2026
Google DeepMind Staffers Ask Leaders to Keep Them ‘Physically Safe’ From ICE

Google DeepMind Staffers Ask Leaders to Keep Them ‘Physically Safe’ From ICE

27 January 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.